Comments (6)
wsl2 ubuntu 22.0.4遇到了一样的错误,去google了一些,可以这样解决这两的编译错误:
激活进入到你的conda环境里面(我用的是python3.10)后:
sudo add-apt-repository ppa:ubuntu-toolchain-r/test
sudo apt update
sudo apt install gcc-11 g++-11
sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-11 60 --slave /usr/bin/g++ g++ /usr/bin/g++-11
pip install --upgrade pip
pip install --upgrade setuptools wheel
sudo apt-get install build-essential
跑完这些,中间install那一步gcc-11没见装新版本,然后再跑pip install "xinference[all]"就正常通过了:
Building wheels for collected packages: chatglm-cpp, llama-cpp-python
Building wheel for chatglm-cpp (pyproject.toml) ... done
Created wheel for chatglm-cpp: filename=chatglm_cpp-0.3.1-cp310-cp310-linux_x86_64.whl size=825124 sha256=d79a39e161ba40d61f3bffa10d4864aa2e1579f3a8a41e137ee16eebbe9d8a01
Stored in directory: /home/leo/.cache/pip/wheels/c2/d5/b5/013d7e9b0893e485df77a9eabb231d54a0e803438550f0d75c
Building wheel for llama-cpp-python (pyproject.toml) ... done
Created wheel for llama-cpp-python: filename=llama_cpp_python-0.2.62-cp310-cp310-manylinux_2_35_x86_64.whl size=3127781 sha256=2da1cc274937e433aad637621c260716b0156d1f210f788cde73cea5b9a79f12
Stored in directory: /home/leo/.cache/pip/wheels/c0/81/de/d4cc8f152d89865379dbf28ca672358c667192ee55deaca7cb
Successfully built chatglm-cpp llama-cpp-python
from inference.
@cq134cq .安装chatglm-cpp 和 llama-cpp-python时报错了。这两个库在window上安装需要正确的CMake编译环境,可以到对应的github repo中查询解决方案。
from inference.
window上安装需要正确的CMake编译环境
我是一个小白,请问window上安装需要正确的CMake编译环境装什么
from inference.
window上安装需要正确的CMake编译环境
我是一个小白,请问window上安装需要正确的CMake编译环境装什么
https://github.com/abetlen/llama-cpp-python 对于llama-cpp-python,参考这个仓库的readme看下windows环境下需要什么条件。他这里也给出来一些错误的解决方案:
参考这些信息先尝试单独安装llama-cpp-python。
from inference.
这两个包直接pip大概率出错,可以用下面链接里的,找到自己对应版本,手动pip安装就行了,win和linux都适用,不需要再去弄cmake啥的。
https://github.com/li-plus/chatglm.cpp/releases
https://github.com/abetlen/llama-cpp-python/releases
from inference.
己
这两个包直接pip大概率出错,可以用下面链接里的,找到自己对应版本,手动pip安装就行了,win和linux都适用,不需要再去弄cmake啥的。
谢谢
from inference.
Related Issues (20)
- Any instruction for deploying faster-whisper
- windows环境启动qwen2-instruct报错KeyError HOT 4
- BUG 使用glm4的tool call 当重新开始会问,提示报错 HOT 3
- BUG embedding和rerank模型持续显存占用 HOT 2
- [Suggestion]Enable Custom Model Store via loading `llm_family.json` from url. HOT 1
- BUG Service Unresponsive with High Request Volume for Unavailable Models
- QUESTION: when temperature=0(greedy),model answers are still random HOT 1
- xinference 后端驱动chatglm4-9b-chat,接入到dify/lobe-chat时出错,不能正常回答 HOT 7
- can not launch chattts
- Some models require login to Hugging Face for download HOT 2
- Gemma2 Supported ? HOT 1
- launch beg-reranker-base error trust_remote_code HOT 1
- Chat completion stream got an error: _get_logits_warper() missing 1 required positional argument: 'device' HOT 4
- Can't deploy bge-reranker-v2-m3 HOT 2
- deepseek-coder-6.7b-instruct 部署出错 HOT 7
- embedding模型中是否可以在多个worker节点运行 HOT 2
- 本地启动无法用gpu拉起自定义路径的本地模型 ValueError: [address=0.0.0.0:11083, pid=11576] Worker 0.0.0.0:11083 cannot use the GPUs with these indexes: [2]. Worker 0.0.0.0:11083 can only see these GPUs: []. HOT 1
- 找不到GPU
- 使用官方镜像v0.12.3,运行自定义embedding,还是去连接huggingface是为啥 HOT 1
- 能否复用通过"huggingface-cli download"下载的的模型,而不是重新在xinference下载? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from inference.