Comments (8)
It means GPU has been used by other launched models, there's no slot left.
from inference.
使用以下方式运行,此问题依然存在:
docker run -dit -p 8080:9997 -v /data/ModelFiles:/workspace --gpus all xprobe/xinference:v0.8.2 xinference-local -H 0.0.0.0
from inference.
使用以下方式运行,此问题依然存在:
docker run -dit -p 8080:9997 -v /data/ModelFiles:/workspace --gpus all xprobe/xinference:v0.8.2 xinference-local -H 0.0.0.0
Have you launched any models before?
from inference.
是的,我已经加载一个模型ChatGLM3-6B-32k,再次加载其他的模型时就会出现这个错误提示!!!
from inference.
意思是一块GPU只能运行一个模型?这有点浪费资源了吧
from inference.
目前就是这样的……
from inference.
意思是一块GPU只能运行一个模型?这有点浪费资源了吧
目前就是这样的……
This is incredible, if a GPU can only run one model
from inference.
确实有些浪费
from inference.
Related Issues (20)
- xinference部署能使用deepspeed吗? HOT 5
- QUESTIO目前xinference支持一个模型部署在多台机器上么? HOT 1
- bce-reranker-base 模型调用报错 CUDA-capable device(s) is/are busy or unavailable
- docker 0.10.3 版本 3060 显卡运行 llama-3-8b-instruct-q4_k_m 发送消息后持续阻塞
- langchain中Xinference Chat的支持 HOT 4
- 同一服务器下不同用户使用xinference服务时的模型识别BUG
- BUG: Fail to run model "bge-reranker-v2-minicpm-layerwise" with Xinference v0.11.0 version (docker images) HOT 8
- Download llama_cpp_python-0.2.75.tar.gz failure HOT 3
- docker cpu 版本,下载模型异常:Server error: 500 - [address=0.0.0.0:33434, pid=15] 'NoneType' object is not subscriptable HOT 5
- QUESTION xinference 接口最大能支持多大的并发呢?
- embedding模型launch_model,会重复加载 v0.11.1.1
- BUG: Accessing a wrong model can cause the entire Docker environment to crash. HOT 1
- 能否接入Qwen-Audio-Chat模型? HOT 2
- QUESTION:注册本地模型,启动时无法设置量化
- codeqwen1.5-7b-base,deepseek-coder-6.7b-base等base模型,在注册时无法找到对应的family,选other后,没有vllm引擎 HOT 2
- ENH: Registering custom rerank model support types HOT 1
- FEAT: cogvlm2_llama3 support
- internlm2-chat模型对话报错
- 致命错误:bad revision 'HEAD' HOT 2
- BUG cannot run model 'bge-reranker-v2-minicpm-layerwise' in the lastest version of xinference
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from inference.