Giter Site home page Giter Site logo

Comments (6)

zhangyichang avatar zhangyichang commented on May 22, 2024

你好,目前有推断的硬件资源需求,https://github.com/QwenLM/Qwen-7B#quantization

Precision MMLU Memory
BF16 56.7 16.2G
Int8 52.8 10.1G
NF4 48.9 7.4G

训练的硬件需求我们会后续更新。

from qwen.

BeastyZ avatar BeastyZ commented on May 22, 2024

我是A40 48G 显存,采用官方默认的加载精度(fp32),显存占用31G左右。

from qwen.

txy6666yr avatar txy6666yr commented on May 22, 2024

你好,目前有推断的硬件资源需求,https://github.com/QwenLM/Qwen-7B#quantization

Precision MMLU Memory
BF16 56.7 16.2G
Int8 52.8 10.1G
NF4 48.9 7.4G
训练的硬件需求我们会后续更新。
好的 谢谢

from qwen.

txy6666yr avatar txy6666yr commented on May 22, 2024

我是A40 48G 显存,采用官方默认的加载精度(fp32),显存占用31G左右。

好的,我算力不够,第一天量化int8加载报错,我今天改成fp16再试试

from qwen.

CN-COTER avatar CN-COTER commented on May 22, 2024

你好,目前有推断的硬件资源需求,https://github.com/QwenLM/Qwen-7B#quantization

Precision MMLU Memory
BF16 56.7 16.2G
Int8 52.8 10.1G
NF4 48.9 7.4G
训练的硬件需求我们会后续更新。

您好,请问能提供一个int8的Qwen-chat-7B下载链接吗?

from qwen.

CN-COTER avatar CN-COTER commented on May 22, 2024

你好,目前有推断的硬件资源需求,https://github.com/QwenLM/Qwen-7B#quantization
Precision MMLU Memory
BF16 56.7 16.2G
Int8 52.8 10.1G
NF4 48.9 7.4G
训练的硬件需求我们会后续更新。

您好,请问能提供一个int8的Qwen-chat-7B下载链接吗?

我看了一下Quant用的是AutoGPTQ,如果不方便提供int8的模型的话,可以提供一下GPTQ量化时使用到的datasets吗?

from qwen.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.