Light

fenglinbei / lm_server Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 35.86 MB

基于python实现的LLM、embedding以及rerank的API服务

Python 55.90% Shell 0.18% Dockerfile 0.42% C++ 13.45% CMake 0.47% Makefile 0.09% Jupyter Notebook 29.50%

lm_server's Introduction

LLM SERVER

当前分支使用的模型为 chatglm3-6b-f16-ggml

🐳 环境配置

启动方式

第一次启动

在项目主目录下 docker-compose up -d 即可完成docker镜像的构建与启动

后续启动

bash autorestart.sh 由于显存限制，当context过长时可能导致显存溢出导致服务不可用该脚本会自动检查服务的状态，若不可用则会自动重启

🤖 使用方式

参数配置

在项目根目录的.env修改以下参数

PORT: 服务器端口
MODEL_NAME: 若使用的是ggml模型，此处填入模型的.bin文件路径，若使用HF模型，此处填入包含config.json的模型文件夹
MODEL_PATH: 若使用的是ggml模型，此处填入原模型的包含config.json的模型文件夹，若使用HF模型，此处填入包含config.json的模型文件夹

lm_server's People

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.