Skip to content

Pull requests: AI-Hypercomputer/torchprime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Implement gradient clipping
#286 opened Jun 6, 2025 by tengyifei Loading…
Add SFT trainer and sft task
#284 opened Jun 5, 2025 by jialei777 Loading…
Distributed Checkpointing
#275 opened Jun 2, 2025 by hlnchen Loading…
Add SFT
#238 opened May 10, 2025 by jialei777 Draft
3 of 10 tasks
Compatibility for 2.7 release
#213 opened Apr 23, 2025 by zpcore Loading…
Add TPU verbose logging flags
#122 opened Feb 24, 2025 by tengyifei Loading…
deepseek r1 running
#102 opened Feb 10, 2025 by qihqi Loading…
init llama infer draft Do not merge if it's a PR
#61 opened Jan 30, 2025 by yaochengji Loading…
ProTip! What’s not been updated in a month: updated:<2025-05-06.