Comments (6)
2024-02-15 12:30:55,450 xinference.api.restful_api 772 ERROR [address=127.0.0.1:46891, pid=784] Model not found in the model list, uid: 3dc22fb0-740c-11ee-be13-452b73d8be98
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/xinference/api/restful_api.py", line 542, in describe_model
data = await (await self._get_supervisor_ref()).describe_model(model_uid)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 227, in send
return self._process_result_message(result)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 102, in _process_result_message
raise message.as_instanceof_cause()
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 657, in send
result = await self._run_coro(message.message_id, coro)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 368, in _run_coro
return await coro
File "/opt/conda/lib/python3.10/site-packages/xoscar/api.py", line 384, in on_receive
return await super().on_receive(message) # type: ignore
File "xoscar/core.pyx", line 558, in on_receive
raise ex
File "xoscar/core.pyx", line 520, in xoscar.core._BaseActor.on_receive
async with self._lock:
File "xoscar/core.pyx", line 521, in xoscar.core._BaseActor.on_receive
with debug_async_timeout('actor_lock_timeout',
File "xoscar/core.pyx", line 526, in xoscar.core._BaseActor.on_receive
result = await result
File "/opt/conda/lib/python3.10/site-packages/xinference/core/utils.py", line 45, in wrapped
ret = await func(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/xinference/core/supervisor.py", line 880, in describe_model
raise ValueError(f"Model not found in the model list, uid: {model_uid}")
ValueError: [address=127.0.0.1:46891, pid=784] Model not found in the model list, uid: 3dc22fb0-740c-11ee-be13-452b73d8be98
from inference.
If I launch model with xinference-local, why I get those kind of distributed deployment stuff.
@log_async(logger=logger)
async def describe_model(self, model_uid: str) -> Dict[str, Any]:
replica_info = self._model_uid_to_replica_info.get(model_uid, None)
if replica_info is None:
raise ValueError(f"Model not found in the model list, uid: {model_uid}")
# Use rep id 0 to instead of next(replica_info.scheduler) to avoid
# consuming the generator.
replica_model_uid = build_replica_model_uid(model_uid, replica_info.replica, 0)
worker_ref = self._replica_model_uid_to_worker.get(replica_model_uid, None)
if worker_ref is None:
raise ValueError(
f"Model not found in the model list, uid: {replica_model_uid}"
)
info = await worker_ref.describe_model(model_uid=replica_model_uid)
info["replica"] = replica_info.replica
return info
from inference.
from xinference.client import RESTfulClient
client = RESTfulClient("http://127.0.0.1:9997")
print(client.list_models())
error out info:
{'zephyr-7b-beta': {'model_type': 'LLM', 'address': '127.0.0.1:39873', 'accelerators': ['0', '1'], 'model_name': 'zephyr-7b-beta', 'model_lang': ['en'], 'model_ability': ['chat'], 'model_description': 'Zephyr-7B-β is the second model in the series, and is a fine-tuned version of mistralai/Mistral-7B-v0.1', 'model_format': 'pytorch', 'model_size_in_billions': 7, 'model_family': 'zephyr-7b-beta', 'quantization': '4-bit', 'model_hub': 'huggingface', 'revision': '3bac358730f8806e5c3dc7c7e19eb36e045bf720', 'context_length': 8192}}
2024-02-16 14:59:33,086 xinference.core.supervisor 6525 DEBUG Enter list_models, args: (<xinference.core.supervisor.SupervisorActor object at 0x7d25f736cb30>,), kwargs: {}
2024-02-16 14:59:33,087 xinference.core.worker 6525 DEBUG Enter list_models, args: (<xinference.core.worker.WorkerActor object at 0x7d24c5baa610>,), kwargs: {}
2024-02-16 14:59:33,087 xinference.core.worker 6525 DEBUG Leave list_models, elapsed time: 0 s
2024-02-16 14:59:33,087 xinference.core.supervisor 6525 DEBUG Leave list_models, elapsed time: 0 s
2024-02-16 14:59:47,890 xinference.core.supervisor 6525 DEBUG Enter describe_model, args: (<xinference.core.supervisor.SupervisorActor object at 0x7d25f736cb30>, 'e1613eb0-9f2f-11ee-afd3-573c38e3e261'), kwargs: {}
2024-02-16 14:59:47,892 xinference.api.restful_api 6509 ERROR [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: e1613eb0-9f2f-11ee-afd3-573c38e3e261
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/xinference/api/restful_api.py", line 542, in describe_model
data = await (await self._get_supervisor_ref()).describe_model(model_uid)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 227, in send
return self._process_result_message(result)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 102, in _process_result_message
raise message.as_instanceof_cause()
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 657, in send
result = await self._run_coro(message.message_id, coro)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 368, in _run_coro
return await coro
File "/opt/conda/lib/python3.10/site-packages/xoscar/api.py", line 384, in on_receive
return await super().on_receive(message) # type: ignore
File "xoscar/core.pyx", line 558, in on_receive
raise ex
File "xoscar/core.pyx", line 520, in xoscar.core._BaseActor.on_receive
async with self._lock:
File "xoscar/core.pyx", line 521, in xoscar.core._BaseActor.on_receive
with debug_async_timeout('actor_lock_timeout',
File "xoscar/core.pyx", line 526, in xoscar.core._BaseActor.on_receive
result = await result
File "/opt/conda/lib/python3.10/site-packages/xinference/core/utils.py", line 45, in wrapped
ret = await func(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/xinference/core/supervisor.py", line 880, in describe_model
raise ValueError(f"Model not found in the model list, uid: {model_uid}")
ValueError: [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: e1613eb0-9f2f-11ee-afd3-573c38e3e261
2024-02-16 14:59:48,127 xinference.core.supervisor 6525 DEBUG Enter describe_model, args: (<xinference.core.supervisor.SupervisorActor object at 0x7d25f736cb30>, '3dc22fb0-740c-11ee-be13-452b73d8be98'), kwargs: {}
2024-02-16 14:59:48,130 xinference.api.restful_api 6509 ERROR [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: 3dc22fb0-740c-11ee-be13-452b73d8be98
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/xinference/api/restful_api.py", line 542, in describe_model
data = await (await self._get_supervisor_ref()).describe_model(model_uid)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 227, in send
return self._process_result_message(result)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 102, in _process_result_message
raise message.as_instanceof_cause()
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 657, in send
result = await self._run_coro(message.message_id, coro)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 368, in _run_coro
return await coro
File "/opt/conda/lib/python3.10/site-packages/xoscar/api.py", line 384, in on_receive
return await super().on_receive(message) # type: ignore
File "xoscar/core.pyx", line 558, in on_receive
raise ex
File "xoscar/core.pyx", line 520, in xoscar.core._BaseActor.on_receive
async with self._lock:
File "xoscar/core.pyx", line 521, in xoscar.core._BaseActor.on_receive
with debug_async_timeout('actor_lock_timeout',
File "xoscar/core.pyx", line 526, in xoscar.core._BaseActor.on_receive
result = await result
File "/opt/conda/lib/python3.10/site-packages/xinference/core/utils.py", line 45, in wrapped
ret = await func(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/xinference/core/supervisor.py", line 880, in describe_model
raise ValueError(f"Model not found in the model list, uid: {model_uid}")
ValueError: [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: 3dc22fb0-740c-11ee-be13-452b73d8be98
2024-02-16 15:00:14,962 xinference.core.supervisor 6525 DEBUG Enter describe_model, args: (<xinference.core.supervisor.SupervisorActor object at 0x7d25f736cb30>, 'e1613eb0-9f2f-11ee-afd3-573c38e3e261'), kwargs: {}
2024-02-16 15:00:14,964 xinference.api.restful_api 6509 ERROR [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: e1613eb0-9f2f-11ee-afd3-573c38e3e261
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/xinference/api/restful_api.py", line 542, in describe_model
data = await (await self._get_supervisor_ref()).describe_model(model_uid)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 227, in send
return self._process_result_message(result)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 102, in _process_result_message
raise message.as_instanceof_cause()
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 657, in send
result = await self._run_coro(message.message_id, coro)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 368, in _run_coro
return await coro
File "/opt/conda/lib/python3.10/site-packages/xoscar/api.py", line 384, in on_receive
return await super().on_receive(message) # type: ignore
File "xoscar/core.pyx", line 558, in on_receive
raise ex
File "xoscar/core.pyx", line 520, in xoscar.core._BaseActor.on_receive
async with self._lock:
File "xoscar/core.pyx", line 521, in xoscar.core._BaseActor.on_receive
with debug_async_timeout('actor_lock_timeout',
File "xoscar/core.pyx", line 526, in xoscar.core._BaseActor.on_receive
result = await result
File "/opt/conda/lib/python3.10/site-packages/xinference/core/utils.py", line 45, in wrapped
ret = await func(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/xinference/core/supervisor.py", line 880, in describe_model
raise ValueError(f"Model not found in the model list, uid: {model_uid}")
ValueError: [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: e1613eb0-9f2f-11ee-afd3-573c38e3e261
2024-02-16 15:00:15,224 xinference.core.supervisor 6525 DEBUG Enter describe_model, args: (<xinference.core.supervisor.SupervisorActor object at 0x7d25f736cb30>, '3dc22fb0-740c-11ee-be13-452b73d8be98'), kwargs: {}
2024-02-16 15:00:15,226 xinference.api.restful_api 6509 ERROR [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: 3dc22fb0-740c-11ee-be13-452b73d8be98
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/xinference/api/restful_api.py", line 542, in describe_model
data = await (await self._get_supervisor_ref()).describe_model(model_uid)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 227, in send
return self._process_result_message(result)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 102, in _process_result_message
raise message.as_instanceof_cause()
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 657, in send
result = await self._run_coro(message.message_id, coro)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 368, in _run_coro
return await coro
File "/opt/conda/lib/python3.10/site-packages/xoscar/api.py", line 384, in on_receive
return await super().on_receive(message) # type: ignore
File "xoscar/core.pyx", line 558, in on_receive
raise ex
File "xoscar/core.pyx", line 520, in xoscar.core._BaseActor.on_receive
async with self._lock:
File "xoscar/core.pyx", line 521, in xoscar.core._BaseActor.on_receive
with debug_async_timeout('actor_lock_timeout',
File "xoscar/core.pyx", line 526, in xoscar.core._BaseActor.on_receive
result = await result
File "/opt/conda/lib/python3.10/site-packages/xinference/core/utils.py", line 45, in wrapped
ret = await func(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/xinference/core/supervisor.py", line 880, in describe_model
raise ValueError(f"Model not found in the model list, uid: {model_uid}")
ValueError: [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: 3dc22fb0-740c-11ee-be13-452b73d8be98
2024-02-16 15:00:55,887 xinference.core.supervisor 6525 DEBUG Enter describe_model, args: (<xinference.core.supervisor.SupervisorActor object at 0x7d25f736cb30>, 'e1613eb0-9f2f-11ee-afd3-573c38e3e261'), kwargs: {}
2024-02-16 15:00:55,890 xinference.api.restful_api 6509 ERROR [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: e1613eb0-9f2f-11ee-afd3-573c38e3e261
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/xinference/api/restful_api.py", line 542, in describe_model
data = await (await self._get_supervisor_ref()).describe_model(model_uid)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 227, in send
return self._process_result_message(result)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 102, in _process_result_message
raise message.as_instanceof_cause()
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 657, in send
result = await self._run_coro(message.message_id, coro)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 368, in _run_coro
return await coro
File "/opt/conda/lib/python3.10/site-packages/xoscar/api.py", line 384, in on_receive
return await super().on_receive(message) # type: ignore
File "xoscar/core.pyx", line 558, in on_receive
raise ex
File "xoscar/core.pyx", line 520, in xoscar.core._BaseActor.on_receive
async with self._lock:
File "xoscar/core.pyx", line 521, in xoscar.core._BaseActor.on_receive
with debug_async_timeout('actor_lock_timeout',
File "xoscar/core.pyx", line 526, in xoscar.core._BaseActor.on_receive
result = await result
File "/opt/conda/lib/python3.10/site-packages/xinference/core/utils.py", line 45, in wrapped
ret = await func(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/xinference/core/supervisor.py", line 880, in describe_model
raise ValueError(f"Model not found in the model list, uid: {model_uid}")
ValueError: [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: e1613eb0-9f2f-11ee-afd3-573c38e3e261
2024-02-16 15:00:56,130 xinference.core.supervisor 6525 DEBUG Enter describe_model, args: (<xinference.core.supervisor.SupervisorActor object at 0x7d25f736cb30>, '3dc22fb0-740c-11ee-be13-452b73d8be98'), kwargs: {}
2024-02-16 15:00:56,132 xinference.api.restful_api 6509 ERROR [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: 3dc22fb0-740c-11ee-be13-452b73d8be98
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/xinference/api/restful_api.py", line 542, in describe_model
data = await (await self._get_supervisor_ref()).describe_model(model_uid)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 227, in send
return self._process_result_message(result)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/context.py", line 102, in _process_result_message
raise message.as_instanceof_cause()
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 657, in send
result = await self._run_coro(message.message_id, coro)
File "/opt/conda/lib/python3.10/site-packages/xoscar/backends/pool.py", line 368, in _run_coro
return await coro
File "/opt/conda/lib/python3.10/site-packages/xoscar/api.py", line 384, in on_receive
return await super().on_receive(message) # type: ignore
File "xoscar/core.pyx", line 558, in on_receive
raise ex
File "xoscar/core.pyx", line 520, in xoscar.core._BaseActor.on_receive
async with self._lock:
File "xoscar/core.pyx", line 521, in xoscar.core._BaseActor.on_receive
with debug_async_timeout('actor_lock_timeout',
File "xoscar/core.pyx", line 526, in xoscar.core._BaseActor.on_receive
result = await result
File "/opt/conda/lib/python3.10/site-packages/xinference/core/utils.py", line 45, in wrapped
ret = await func(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/xinference/core/supervisor.py", line 880, in describe_model
raise ValueError(f"Model not found in the model list, uid: {model_uid}")
ValueError: [address=127.0.0.1:54988, pid=6525] Model not found in the model list, uid: 3dc22fb0-740c-11ee-be13-452b73d8be98
from inference.
I also encountered the same issue, and this problem causes the model not to return any results. Awaiting solutions online, thank you. The content below is the output information from the log.
File "/usr/local/lib/python3.10/dist-packages/xinference/core/worker.py", line 654, in describe_model raise ValueError(f"Model not found in the model list, uid: {model_uid}")
from inference.
same issue, and the rerank model
from inference.
same issue, and the rerank model
怎么解决的呢?
from inference.
Related Issues (20)
- huggingface源和modelscope源下载的模型cache不能共存?
- BUG RuntimeError: [address=0.0.0.0:33341, pid=464586] The expanded size of the tensor (913) must match the existing size (514) at non-singleton dimension 1. Target sizes: [3, 913]. Tensor sizes: [1, 514] HOT 3
- qwen2.0-7b-instruct cannot use file as request body.(不能上传文件)
- BUG: quantization for glm-4v
- BUG super-worker模式有健壮性问题。 HOT 1
- glm4-chat的tool call 在dify下,无法使用 HOT 8
- BUG: docker image of 0.12.1 launch failed HOT 6
- FP8量化支持
- Image upgrade to 0.12.1, running Qwen1.5-14B-Chat-GPTQ-Int4 is much slower compared to 0.11.0
- BUG fix security vulnerability HOT 1
- Feat: Support download model only. HOT 1
- FEAT: support embedding model Alibaba-NLP/gte-Qwen2-7B-instruct
- Please support the configuration of chattts in the “Register Model”
- worker,ValueError: [address=0.0.0.0:45416, pid=47] Model not found,
- [QUESTION] i pulled qwen2 awq 7b 4bit quantized model but it is giving me gibberish text which has no meaning
- Support Optional Configurations for Embedding models HOT 3
- BUG 20G以上模型导致机器重启 HOT 3
- BUG When I reasoning the model Qwen-VL-Chat-Int4 and Yi-VL-6B, the Model Engine cannot be found HOT 2
- Suggestions about transcription enhancement HOT 1
- 多卡运行模型启动报错,单卡运行正常
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from inference.