Skip to content

Tools and training scripts for large language models

License

Notifications You must be signed in to change notification settings

gpauloski/llm-pytorch

Repository files navigation

LLM Training Scripts

Docs Tests pre-commit.ci status

Tools and training scripts I have developed for building large language models in PyTorch.

This repository provides:

  • data preprocessing scripts,
  • training scripts, and
  • training guides.

This repository is the successor to my old training tools BERT-PyTorch as the old code had a lot of technical debt and was not well tested. Compared to the old repository, this codebase aims to have better code health and maintainability thanks to tests, type checking, linters, documentation, etc.

Install

See the Installation Guide.

Getting Started

See the available Guides.

About

Tools and training scripts for large language models

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Contributors 4

  •  
  •  
  •  
  •  

Languages