sleekmike / finetune_gpt-j_6b_8-bit Goto Github PK

Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)

License: MIT License

Python 6.18% Jupyter Notebook 93.82%

finetune_gpt-j_6b_8-bit's People

Contributors

Stargazers

Watchers

Forkers

tahercoolguy c00renut nickmitchko brandaobrandisborges shawndegroot nadkarni-lab jeromeku stockblog magjense cyrilmagsuci macguyversmusic mauricioscotton tim9510019 afsharim

finetune_gpt-j_6b_8-bit's Issues

Fine Tuning Pythia 12B

Hi @sleekmike
Great work on the notebook. I just wanted to check on the possibility of fine tuning the pythia 12B or any smaller variant.
I have some specific use cases where I wanted to explore more around Pythia 12B.
Do you have any idea if the same code you published in this repo would work with Pythia variants also. If not then can you help creating a blog or notebook for that as well?

Running without CUDA (bitsandbytes)

Heya,

Almost got this working with DirectML on Windows, but bitsandbytes requires CUDA on Linux. I'm gonna run out of VRAM if I don't do the monkey patching seen in the repo, so I was wondering if you had any ideas to not depend on it.

Thanks!

RuntimeError: Output 0 of DequantizeAndLinearBackward is a view when running finetuning example

When I run this example in Jupyter Lab and start finetuning the codeparrot example I get the following error message:

RuntimeError: Output 0 of DequantizeAndLinearBackward is a view and is being modified inplace. This view was created inside a custom Function (or because an input was returned as-is) and the autograd logic to handle view+inplace would override the custom backward associated with the custom Function, leading to incorrect gradients. This behavior is forbidden. You can fix this by cloning the output of the custom Function.

Can you please help me?

Really awsome . But i need some direction from you

Really awsome Work . I was wonder how you convert slim weight which floating point 16 provided by el euther to floating point 8 . Can you please direct if you want to do it for other models

Thanks

RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

Hello!

After some epoch get

RuntimeError: probability tensor contains either inf, nan or element < 0

And saved model stops work ((

Working Fine

Finetuning error

Hi,

I get this error when fine tuning the model:

Do I need to save a checkpoint to avoid this? Not sure how it would work?

Thanks,
a

Wow this is awesome

Able to finetune 6B on a RTX 3080 with this! Had to change batch to 32 but that's about it.

sleekmike / finetune_gpt-j_6b_8-bit Goto Github PK

finetune_gpt-j_6b_8-bit's People

Contributors

Stargazers

Watchers

Forkers

finetune_gpt-j_6b_8-bit's Issues

Fine Tuning Pythia 12B

Running without CUDA (bitsandbytes)

RuntimeError: Output 0 of DequantizeAndLinearBackward is a view when running finetuning example

Really awsome . But i need some direction from you

RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

Working Fine

Finetuning error

Wow this is awesome

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent