System Info transformers ve

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Thanks everyone ! Indeed I was able to repro with: <div class="highlight highlight

AttributeError: 'HQQLinear' object has no attribute 'weight' about transformers HOT 8 CLOSED

mxjmtxrm commented on June 12, 2024

AttributeError: 'HQQLinear' object has no attribute 'weight'

from transformers.

Comments (8)

mobicham commented on June 12, 2024 5

Fixed it, will do a PR right now.

from transformers.

younesbelkada commented on June 12, 2024 1

Awesome, thanks !

from transformers.

younesbelkada commented on June 12, 2024

Hi @mxjmtxrm
Can you share which model are you trying to quantize?

from transformers.

mxjmtxrm commented on June 12, 2024

It is my own model based on hf llama2 7B. I just modify the bias of qkv proj is True. So the pretrained ckpt contains .bias, and then the above error arised.

from transformers.

kadirnar commented on June 12, 2024

I use the distil-whisper model. I'm getting the same error. BitsAndBytesConfig optimization method works. HQQ method gives error.

from transformers.

kadirnar commented on June 12, 2024

@mobicham,

Can you check?

from transformers.

mobicham commented on June 12, 2024

Can you share a code snippet to reproduce this please?

from transformers.

younesbelkada commented on June 12, 2024

Thanks everyone ! Indeed I was able to repro with:

from transformers import AutoModelForSpeechSeq2Seq, HqqConfig

model_id = "distil-whisper/distil-large-v2"

quant_config  = HqqConfig(nbits=1, group_size=64, quant_zero=False, quant_scale=False, axis=0)

model = AutoModelForSpeechSeq2Seq.from_pretrained(model_id, quantization_config=quant_config, device_map="cuda")
print(model)

from transformers.

AttributeError: 'HQQLinear' object has no attribute 'weight' about transformers HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent