A Pipeline for Fine-Tuning HuggingFace Models

This repo provides the whole pizza for fine-tuning HuggingFace models (e.g. Llama2, DeepSeek, StarCoder, or Code Llama) on any task. It has been built primarily for code generation tasks. The pipeline includes:

Both Constant Length Dataset Loader and Padded Dataset Loader. The constant length one is good for code generation (e.g. Copilot) or "further pre-training", while the padded one is typically better for instruction-tuning.
Scaling laws for computing the correct number of steps, given number of gpus, effective batch size, and number of epochs
LoRA, with 8, 4 bits and QLoRA (double quant) support
FlashAttention2 for super-duper fast long sequence training
DeepSpeed support for fine-tuning large models by offloading to multiple GPUs and the CPU
Edu-score filtering to remove non-educational data
Multi-programming-language loss evaluation (using MultiPL-E evaluation datasets)
Custom tokenizer injection
Automatic mixed precision

Generic Usage

This repo is driven by the main.py script. It supports supports a wide range of arguments, which can be listed using python main.py --help. It may be simpler to look at the scripts in the run_scripts directory, which are used to run training on the different models with different settings.

LoRA

There is built-in support for LoRA, which can be enabled by passing the --lora flag. See run_scripts/run_starcoder_lora.sh for an example. There is additional support for some "lora hacks", like double quant, which can be enabled by passing the --lora_extreme flag.

DeepSpeed

We support DeepSpeed and we recommend using it for training large models, instead of using LoRA. See run_starcoder.sh or run_codellama_34b.sh for an example. There are various deepspeed configs in this repo that can be used right away.

FlashAttention2

If you need to train on long sequences, you can use FlashAttention2. This can be enabled by passing the --fa2 flag. However, this will require you to install the FlashAttention2 package, which is not included in the requirements.

Evaluation

The evaluation for the models is done via the multipl_e_eval.sh script, and it requires an installation of the MultiPL-E repo. This is an evaluation for code generation only. Through this script, you can evaluate different checkpoints at the same time using different GPUs on multiple languages and datasets (HumanEval or MBPP).

Pushing Checkpoints

There are two scripts that can be used as helpers for pushing checkpoints to HuggingFace:

./scripts/load_and_push_to_hub.py can be used to push a single checkpoint
./scripts/push_checkpoints.py can be used to push multiple checkpoints in the given directory

Citation

If you use this code in your research, please cite it as follows:

@software{cassano2023finetuning,
    author = {Cassano, Federico},
    month = jun,
    title = {{A Pipeline for Fine-Tuning HuggingFace Models}},
    url = {https://github.com/cassanof/finetuning-harness},
    version = {1.0.0},
    year = {2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 271 Commits
deepspeed_cfgs		deepspeed_cfgs
run_scripts		run_scripts
scripts		scripts
.gitignore		.gitignore
.gitmodules		.gitmodules
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
api.py		api.py
dataset_loader.py		dataset_loader.py
lora.py		lora.py
main.py		main.py
number_of_tokens.py		number_of_tokens.py
push_checkpoints.py		push_checkpoints.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Pipeline for Fine-Tuning HuggingFace Models

Generic Usage

LoRA

DeepSpeed

FlashAttention2

Evaluation

Pushing Checkpoints

Citation

About

Releases

Packages

Languages

License

cassanof/finetuning-harness

Folders and files

Latest commit

History

Repository files navigation

A Pipeline for Fine-Tuning HuggingFace Models

Generic Usage

LoRA

DeepSpeed

FlashAttention2

Evaluation

Pushing Checkpoints

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages