System Info transformers ve

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

cc <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

So this issue seems to be documented in the code itself <a href="https://github.com/hu

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Error while moving model to GPU `NotImplementedError: Cannot copy out of meta tensor; no data!` about transformers HOT 6 CLOSED

goelayu commented on June 1, 2024

Error while moving model to GPU `NotImplementedError: Cannot copy out of meta tensor; no data!`

from transformers.

Comments (6)

SunMarc commented on June 1, 2024 2

Hi @goelayu, this is expected since with torch.device('meta') also puts the buffers on the meta device. However, non persistant buffers are not saved in the state_dict. So, in the case of a llama model where we do have non persistant buffers, you get an error after loading the weights With init_empty_weights, by default, we don't put the buffer on the meta device. This is why it is working. Hope it is clearer !

from transformers.

ArthurZucker commented on June 1, 2024 1

cc @muellerzr for the accelerate related stuff rather than Sylvain!

from transformers.

goelayu commented on June 1, 2024

To add to the above, if i use init_empty_weights from accelerate I can skip the initialization without any errors.

Wondering what is the difference between the two? Also if it is possible to achieve the same using the torch.device('meta') context manager.

from transformers.

ArthurZucker commented on June 1, 2024

Mmmm could you make sure that the map_location is correct?
This might be expected, cc @SunMarc WDYT?

from transformers.

goelayu commented on June 1, 2024

So this issue seems to be documented in the code itself big_modeling.py, turns out you can't run model.to when using the meta device. I was hoping for some kind of explanation as to why is that the case?
(hence tagged @sgugger since the big_modeling.py file seems to be often modified by them)

Also if you notice my comment from above, replacing torch.device('meta') with init_empty_weights from the accelerate package seems to resolve the issue.

from transformers.

goelayu commented on June 1, 2024

@SunMarc thanks for the response, that answers my question.

from transformers.

Error while moving model to GPU `NotImplementedError: Cannot copy out of meta tensor; no data!` about transformers HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent