Skip to content

Add train_minions utils for training memory estimate#33

Open
RahulSChand wants to merge 4 commits intoHazyResearch:mainfrom
RahulSChand:train_minions
Open

Add train_minions utils for training memory estimate#33
RahulSChand wants to merge 4 commits intoHazyResearch:mainfrom
RahulSChand:train_minions

Conversation

@RahulSChand
Copy link

@RahulSChand RahulSChand commented Mar 22, 2025

Added PR to create a train_minions.py file in utils which automatically detects the underlying hardware and selects the best model + training config (Full/LoRA). There are a number of todos

  • support for quanitzation
  • support fdsp
  • support QLoRA
  • support MoE
  • Assumes a sequence length of 512 (should take this as input?)
  • Overhead for parallelization methods

@RahulSChand RahulSChand changed the title Add train_minions utils Add train_minions utils for training memory estimate Mar 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant