Comments (3)
Hi @minsuk00
you can also try the release_memory
utility method from accelerate.utils
- cc @muellerzr
from transformers.
@younesbelkada -cc @muellerz
Thanks for the suggestion, but it doesn't seem to work.
clip_text_model = accelerate.utils.release_memory(clip_text_model)
does not free any GPU memory.
Additionally, calling clip_text_model.cpu()
or torch.cuda.empty_cache()
simply results in the behavior described above.
from transformers.
cc @muellerzr regarding the accelerate behaviour.
Regarding torch.cuda.empty_cache()
it's recommended that this function is not manually used c.f. a related issue, and this discussion in the pytorch forum
from transformers.
Related Issues (20)
- Whisper generate return a slice of result if result have more than one added token HOT 2
- Gob
- LR = 0 when using DeepSpeed Config and LORA on Trainer. HOT 3
- Cannot export sdxl encoder to onnx when transformers[torch] >= 4.43.0 (Occurred when translating scaled_dot_product_attention).
- [i18n-<languageCode>] Translating docs to <languageName>spañol HOT 1
- apply_chat_template method not working correctly for llama 3 tokenizer
- Trainer has stuck during the code block of "Trainer.train" in Jupyter Notebook
- llama3 position_ids error with left padding HOT 1
- Mode-aware chat templates for distinct training and inference behaviors HOT 1
- XLMRobertaTokenizer attribute has disappeared from transformers.models.xlm_roberta
- how to fine tune TrOCR on specifique langage guide. HOT 1
- Incorrect logits shape for GIT model (microsoft/git-base-textvqa) HOT 2
- ValueError: Unrecognized model. Should have a model_type key in its config.json
- Can not detect bitsandbytes-windows HOT 4
- Using multi GPU fails with AutoModelForCausalLM quantization_config=quantization_config HOT 2
- Add multi image prompts to multimodal LLMs that support it (PaliGemma) HOT 2
- How to get the score of each token when using pipeline
- Covert chemaleon weights to hf, ImportError HOT 2
- Clarification on Classification Token. HOT 1
- ValueError: No columns in the dataset match the model's forward method signature when using SFTTrainer and DataParallel. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from transformers.