Skip to content

colorzhang/TinyLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

TinyLM

Train a TinyLM from scratch, slides

Task and Scenario

Train a tiny LLM to write poem in Chinese

Data preprocessing

56315 Chinese Tang poem

Models trained

nanoGPT

Modified from https://github.com/karpathy/nanoGPT

size: 0.1-0.7B

Train loop: use naive train loop in nanoGPT

MyMistral

Modified from Mistral-7B

size: 0.4-0.9B

Train loop: use HuggingFace tranformers trainer

Result

Poem

Roadmap

  • Pretrain
  • Single node
  • SFT
  • Alignment
  • Multi-node multi-gpu / FSDP
  • Small model 1-7B
  • MoE
  • Multimodal
  • Fine-tune

Winston Zhang

2024/03/08

About

Train a TinyLM from scratch

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published