Giter Site home page Giter Site logo

glm4 9b 1m 启动报错 about glm-4 HOT 12 CLOSED

brightzhu2020 avatar brightzhu2020 commented on August 10, 2024
glm4 9b 1m 启动报错

from glm-4.

Comments (12)

bridgearchway avatar bridgearchway commented on August 10, 2024 3

Me win 10, cuda 12.1, python 3.10, gpu 3090*2, also encountered the same problem with KeyError: '<|endoftext|>'. It seems that the problem comes from AutoTokenizer.from_pretrained(...).
there is no clear basic environment recommendations. too bad

I just successfully fixed it. The version of transformer seems to be 4.40.0, which can be found in requirements.txt in basic_demo. Once I renew the version, it works. Hopefully this can help you as well.

from glm-4.

huaizhe2012 avatar huaizhe2012 commented on August 10, 2024 1

情况一样,tokenizer加载报错
transformer 升级为4.40.0确实好了

from glm-4.

zRzRzRzRzRzRzR avatar zRzRzRzRzRzRzR commented on August 10, 2024 1

请严格按照req安装依赖哦,如果是windows系统,不能装vLLM,使用transformers后端

from glm-4.

JiaweiMorris avatar JiaweiMorris commented on August 10, 2024

使用huggingface model页的多模态demo代码出现同样报错
return self.mergeable_ranks[token]
KeyError: '<|endoftext|>'

from glm-4.

bridgearchway avatar bridgearchway commented on August 10, 2024

Me win 10, cuda 12.1, python 3.10, gpu 3090*2, also encountered the same problem with KeyError: '<|endoftext|>'. It seems that the problem comes from AutoTokenizer.from_pretrained(...).

from glm-4.

brightzhu2020 avatar brightzhu2020 commented on August 10, 2024

Me win 10, cuda 12.1, python 3.10, gpu 3090*2, also encountered the same problem with KeyError: '<|endoftext|>'. It seems that the problem comes from AutoTokenizer.from_pretrained(...).
there is no clear basic environment recommendations. too bad

from glm-4.

brightzhu2020 avatar brightzhu2020 commented on August 10, 2024

Me win 10, cuda 12.1, python 3.10, gpu 3090*2, also encountered the same problem with KeyError: '<|endoftext|>'. It seems that the problem comes from AutoTokenizer.from_pretrained(...).
there is no clear basic environment recommendations. too bad

I just successfully fixed it. The version of transformer seems to be 4.40.0, which can be found in requirements.txt in basic_demo. Once I renew the version, it works. Hopefully this can help you as well.

would you please shall your full enviroment details?
i had other issues....
CUDA
Pytorch
etc..?

Thanks!

from glm-4.

bridgearchway avatar bridgearchway commented on August 10, 2024

Me win 10, cuda 12.1, python 3.10, gpu 3090*2, also encountered the same problem with KeyError: '<|endoftext|>'. It seems that the problem comes from AutoTokenizer.from_pretrained(...).
there is no clear basic environment recommendations. too bad

I just successfully fixed it. The version of transformer seems to be 4.40.0, which can be found in requirements.txt in basic_demo. Once I renew the version, it works. Hopefully this can help you as well.

would you please shall your full enviroment details? i had other issues.... CUDA Pytorch etc..?

Thanks!

Sure, my envs are:

win 10,
cuda 12.1,
python 3.10,
gpu 3090*2,
transformer==4.40.0,
torch==2.1.0

That's all for inference. Btw, I noticed that the README in basic_demo suggests "GPUs above A100, V100, 20 and older GPU architectures are not supported". I hope this may help.

from glm-4.

brightzhu2020 avatar brightzhu2020 commented on August 10, 2024

Me win 10, cuda 12.1, python 3.10, gpu 3090*2, also encountered the same problem with KeyError: '<|endoftext|>'. It seems that the problem comes from AutoTokenizer.from_pretrained(...).
there is no clear basic environment recommendations. too bad

I just successfully fixed it. The version of transformer seems to be 4.40.0, which can be found in requirements.txt in basic_demo. Once I renew the version, it works. Hopefully this can help you as well.

would you please shall your full enviroment details? i had other issues.... CUDA Pytorch etc..?
Thanks!

Sure, my envs are:

win 10, cuda 12.1, python 3.10, gpu 3090*2, transformer==4.40.0, torch==2.1.0

That's all for inference. Btw, I noticed that the README in basic_demo suggests "GPUs above A100, V100, 20 and older GPU architectures are not supported". I hope this may help.

Thanks!
it shall be a amper GPU, not a tuning GPUT

from glm-4.

brightzhu2020 avatar brightzhu2020 commented on August 10, 2024

请严格按照req安装依赖哦,如果是windows系统,不能装vLLM,使用transformers后端
非常感谢
我是从GLM3的环境继承的,全局的CUDA版本也不一样
还是希望官方出一般最低要求或者兼容性比较好的要求。一台本地机器上能跑GLM3 6b,也能跑GLM4-9b
期待越来越好,越来越完善。
等待你们的更新

from glm-4.

zRzRzRzRzRzRzR avatar zRzRzRzRzRzRzR commented on August 10, 2024

或许可以使用我们trans_cli_demo,环境还是要重新装的,现在的依赖默认不会装vLLM,但是如果用trans后段能推理的长度非常非常短,8K差不多就到消费卡24G显存的上限

from glm-4.

brightzhu2020 avatar brightzhu2020 commented on August 10, 2024

Me win 10, cuda 12.1, python 3.10, gpu 3090*2, also encountered the same problem with KeyError: '<|endoftext|>'. It seems that the problem comes from AutoTokenizer.from_pretrained(...).
there is no clear basic environment recommendations. too bad

I just successfully fixed it. The version of transformer seems to be 4.40.0, which can be found in requirements.txt in basic_demo. Once I renew the version, it works. Hopefully this can help you as well.

would you please shall your full enviroment details? i had other issues.... CUDA Pytorch etc..?
Thanks!

Sure, my envs are:

win 10, cuda 12.1, python 3.10, gpu 3090*2, transformer==4.40.0, torch==2.1.0

That's all for inference. Btw, I noticed that the README in basic_demo suggests "GPUs above A100, V100, 20 and older GPU architectures are not supported". I hope this may help.

May I ask what is the motherboard and CPU? can two GPU run maxium performance, as the PCIE channel limts of the CPU vs. two 3090. Thanks!

from glm-4.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.