Comments (9)
你是如何运行的?
from inference.
你是如何运行的?
你好,我是使用xinference的UI来运行的,然后在one-api上自定义模型,然后调用one-api去请求这个向量模型,然后就报错了
from inference.
我看你走到了 chat 的接口,embedding 模型不能走 chat。
from inference.
好的,经过排查是下游调用的问题,感谢
from inference.
好的,经过排查是下游调用的问题,感谢
你好,请问你是怎么解决的?
from inference.
from inference.
好的,经过排查是下游调用的问题,感谢
我也碰到了这个问题,请问是重装xinference吗?
from inference.
请问怎么弄好的,重装了哪个库啊?
from inference.
from inference.
Related Issues (20)
- It seems that this preserves the ReAct "thought" process, but this is not user-friendly. Is it possible to use the method of the official Qwen function call? HOT 6
- [FER] Intel NPU support HOT 1
- BUG-运行Qwen2-MoE-14B-GPTQ4 模型出错
- [Suggestion]Image and audio models should download necessary files only.
- 支持昇腾处理器 HOT 2
- Failed to build chatglm-cpp llama-cpp-python pynini HOT 2
- BUG - Using xinference to reason the qwen1.5 model, the react prompt that hides the "thought" reasoning process leads to a reasoning bug, which prevents the upper-level application layers of the reasoning engine from using the agent normally. HOT 1
- 是否考虑支持Sqlcoder HOT 1
- 下载 glm4-chat 提示 `Distant resource does not have a Content-Length` HOT 1
- BUG
- qwen2工具调用,对话中输出了部分思考过程 HOT 8
- 运行速度很慢,为什么? HOT 3
- 支持将第三方llm平台转为openai-like接口嘛,比如火山方舟、百度千帆 HOT 1
- BUG xinference、glm4工具调用报错400
- v0.12.2没有上传到dockerhub HOT 6
- [QUESTION] can't install "xinference[all]", No module named 'Cython' HOT 2
- [QUESTION] Failed to import module 'SentenceTransformer' HOT 1
- docker部署时:Unexpected error from cudaGetDeviceCount() HOT 6
- xinference 格式初始化llm,sse 流式输出有问题 HOT 1
- CURL没返回值 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from inference.