Comments (2)
Can you open the UI, normally the command line will stop at starting xxx
.
from inference.
Can you open the UI, normally the command line will stop at
starting xxx
.
yes,i can open the UI and fastAPI pages, but when i want to test the api, it shows "500:Internal Server Error, content-length: 21
content-type: text/plain; charset=utf-8
date: Fri,24 May 2024 05:32:41 GMT
server: uvicorn "
from inference.
Related Issues (20)
- Qwen1.5-14b-chat-gptq-int4 推理速度 HOT 1
- Failed to do inference with latest GLM-4 chat 9b model HOT 2
- v1/completions接口无法使用,返回空字符串 HOT 1
- 显示启动模型失败,load失败 HOT 2
- Failed to register model, Invalid model URI D:/Pretrainedmodels3/ZhipuAI/chat4/glm-4-9b-chat. HOT 1
- 建议新增对图embedding模型的 HOT 1
- 使用xinference的api服务调用,当过多请求的时候,xinference本地api会直接卡死 HOT 7
- Attention mask size mismatch error and question about input choice HOT 1
- 关于注册自定义模型的prompt_style参数说明 HOT 1
- ui界面可以支持audio模型 指定worker启动吗 HOT 1
- 增加embedding多卡分布式部署能力 HOT 1
- k8s拉起xinference能够pod,running,但是内置的模型,不能运行起来;但是手动进入pod里面,执行命令后,能够把模型运行起来,显存成功占用,是为什么 HOT 1
- 偶发性报错 xinference.api.restful_api 164241 ERROR Remote server 0.0.0.0:44339 closed HOT 4
- 【Reranker建议】xinference的页面端建议支持半精度启动reranker HOT 5
- xinference 在cuda118环境下可以安装使用吗? HOT 2
- Failed to launch custom model, 显示No such file or directory: '/D:/Pretrainedmodels3/ZhipuAI/chat4/glm-4-9b-chat' -> '/home/chat4/glm-4-9b-chat' HOT 1
- xinference api调用报错 Fast Chat AI error:{"detail":"Method Not Allowed"} HOT 1
- xinference启动一段时间后embedding API访问异常"detail":"[address=0.0.0.0:37167, pid=193948] [Errno 5] Input/output error" HOT 2
- rerank模型启动时,页面支持选择use_fp16参数 HOT 2
- uvicorn.error HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from inference.