-
Notifications
You must be signed in to change notification settings - Fork 614
Pull requests: huggingface/open-r1
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Adding GRPOTrainer Group-Based Advantage Normalisation (Paper Eq. 3)
#66
opened Jan 27, 2025 by
agulati18
Loading…
chore: update trl to grpo_vllm branch, move lighteval to uv
#30
opened Jan 25, 2025 by
gerred
Loading…
ProTip!
Filter pull requests by the default branch with base:main.