llm-trainer's Introduction

Train LLMs with qLoRA!

Introduction

This repository contains scripts and configurations for training and merging models using the qLoRA method for efficient model training.

Prerequisites

Python 3.x
PyTorch
Transformers
BitsAndBytes
pandas
YAML

Install the required libraries using:

pip install -r requirements.txt

Configuration File (lora_config.yaml)

This YAML file contains configuration settings for the training process. Update the auth_token with your token and adjust other parameters as per your requirement.

Training the Model (lora_train.py)

The lora_train.py script trains the model based on the configuration provided in lora_config.yaml. Please make sure the data you have is in the appropriate format and mention the column name that has the data in the config file.

To start the training process, make sure all the values in the lora_config.yaml file are correct and then run the training script:

python lora_train.py

The script will save the trained model in the specified output directory.

Merging LoRA Layers (merge_lora.py)

The merge_lora.py script merges LoRA layers into a base model.

Before running the script, fill in the model_name_or_path, auth_token, out_folder_path, and lora_checkpoint_path in the script.

Troubleshooting

If you encounter any issues, please check if your environment meets all prerequisites. For further assistance, create an issue in this repository.

llm-trainer's People

Contributors

Stargazers

Watchers

llm-trainer's Issues

ValueError: You can't pass `load_in_4bit`or `load_in_8bit` as a kwarg when passing `quantization_config` argument at the same time.

python lora_train.py
Traceback (most recent call last):
File "LLM-Trainer/lora_train.py", line 26, in
model = AutoModelForCausalLM.from_pretrained(
File "/home/chris/miniconda3/envs/lora/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 563, in from_pretrained
return model_class.from_pretrained(
File "/home/chris/miniconda3/envs/lora/lib/python3.10/site-packages/transformers/modeling_utils.py", line 2977, in from_pretrained
raise ValueError(
ValueError: You can't pass load_in_4bitor load_in_8bit as a kwarg when passing quantization_config argument at the same time.

Fixed by:
model = AutoModelForCausalLM.from_pretrained(
model_name_or_path,
#load_in_8bit=True,
device_map="cuda:0",
trust_remote_code=True,
token=auth_token,
quantization_config=bnb_config,
)

what are the package requirements versions?

Hi, thanks for the scripts! I want to train the stable3b, but I am having compatibility issues on Ubuntu 22.04.

python lora_train.py 
You are using an old version of the checkpointing format that is deprecated (We will also silently ignore `gradient_checkpointing_kwargs` in case you passed it).Please update to the new format on your modeling file. To use the new format, you need to completely remove the definition of the method `_set_gradient_checkpointing` in your model.
CUDA extension not installed.
CUDA extension not installed.
trainable params: 8710144 || all params: 1535376384 || trainable%: 0.5672969892442998
Traceback (most recent call last):
  File "/home/user/LLM-Trainer/lora_train.py", line 89, in <module>
    data = pd.read_csv(config["data_csv_path"])
                       ~~~~~~^^^^^^^^^^^^^^^^^
TypeError: 'LoraConfig' object is not subscriptable

These are my package versions. Does it work correctly on your end?

Used (https://huggingface.co/datasets/KendrickPham/fine-tuning-csv) this dataset

transformers
Version: 4.36.0.dev0

bitsandbytes
Version: 0.41.2.post2

peft
Version: 0.6.2

Recommend Projects

04rr / llm-trainer Goto Github PK

llm-trainer's Introduction

Train LLMs with qLoRA!

Introduction

Prerequisites

Configuration File (lora_config.yaml)

Training the Model (lora_train.py)

Merging LoRA Layers (merge_lora.py)

Troubleshooting

llm-trainer's People

Contributors

Stargazers

Watchers

Forkers

llm-trainer's Issues

ValueError: You can't pass `load_in_4bit`or `load_in_8bit` as a kwarg when passing `quantization_config` argument at the same time.

what are the package requirements versions?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent