cbflivuni / llm_server Goto Github PK
View Code? Open in Web Editor NEWa client/server interface for llama.cpp
a client/server interface for llama.cpp
You will need to use a conda environment Install conda first, e.g. Download the latest miniconda installer from https://docs.conda.io/en/latest/miniconda.html and Run the installer example / slexample.sh has an example of all the steps that are needed source ./slexample.sh should run completely on a slurm cluster and setup everything - but beware that slurm_start.sh includes parameters that are specific to the cluster it is running on The conda environment and all other required folders will be created in the current directory (This will be fairly big, you should use a volume with enough storage - such as a "volatile" folder) When you run the script, it will create a local conda environment and install all the necessary packages It will also install and build llama.cpp, which will be needed for conversion and quantisation Then it will run the conversion and quantisation for an example model (A Llama variant from nvidia) This can then be executed - slexample.sh calls sbatch to run the specified model on a slurm cluster example.sh will run it without slurm One installed and running you can use the following scripts to interact with the server get_model.py <model_name> will download a model and create quantised versions of it sbatch slurm_start.sh <model_name> will start the server with the model prompt.sh will generate text using the prompt or promptb64.sh will take a base64 encoded prompt (To handle special characters inside the prompt) exit.sh will tell the server to shutdown
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.