kylejmorris / demo-yarn-llama-2-7b-128k-gptq Goto Github PK
View Code? Open in Web Editor NEWThis project forked from yachty66/demo-yarn-llama-2-7b-128k-gptq
This is a Yarn-Llama-2-7B-128K-GPTQ model starter template from Banana.dev that allows on-demand serverless GPU inference.