Skip to content

Pull requests: LLM360/Reasoning360

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update verl to nightly (0.4.1.dev+f0964b6)
#117 opened Jul 17, 2025 by ZYHowell Loading…
Update tensordict requirement from <=0.6.2 to <=0.9.1 dependencies Pull requests that update a dependency file python Pull requests that update python code
#116 opened Jul 14, 2025 by dependabot bot Loading…
Bump sglang[all] from 0.4.6.post5 to 0.4.9.post2 dependencies Pull requests that update a dependency file python Pull requests that update python code
#115 opened Jul 14, 2025 by dependabot bot Loading…
[fix] Pre-calculate n_samples * n_rollout
#114 opened Jul 14, 2025 by BlankCheng Loading…
Feature/ifbench
#113 opened Jul 12, 2025 by Jianshu1only Loading…
1 task
[fix] Fix math reward hanging [WIP]
#109 opened Jul 7, 2025 by BlankCheng Loading…
Bump tokenizers from 0.19.1 to 0.21.2 dependencies Pull requests that update a dependency file python Pull requests that update python code
#108 opened Jun 30, 2025 by dependabot bot Loading…
Faster ordering puzzle generation
#50 opened Apr 30, 2025 by nilabjodey Loading…
ProTip! Adding no:label will show everything without a label.