Skip to content

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Megatron]Enable CP > 1 for Qwen3.5 and fix missing Megatron logging
#8224 opened Mar 6, 2026 by yangbofun Loading…
2 of 4 tasks
Feature/ms swift custom
#8222 opened Mar 6, 2026 by LEWISZZZcc Loading…
4 tasks
add sft no cot
#8216 opened Mar 6, 2026 by vesdas Loading…
[WIP] Moe kernel for qwen3 omni in ascend
#8214 opened Mar 5, 2026 by jiaqiw09 Loading…
1 of 4 tasks
[docs] support uv
#8190 opened Mar 4, 2026 by Jintao-Huang Loading…
feat: log grpo input images to wandb
#8157 opened Mar 2, 2026 by shunk031 Loading…
1 of 4 tasks
[megatron] qwen3.5 use megatron-core
#8126 opened Feb 27, 2026 by Jintao-Huang Loading…
[feat] support frames packing for minicpmv4_5 video processing
#8046 opened Feb 13, 2026 by fanqiNO1 Loading…
2 of 4 tasks
Add QAT (Quantization-Aware Training) Support Callback
#8042 opened Feb 12, 2026 by y2logic Loading…
1 task done
[v4] refactor v4 dataset sp patch_tasks
#7878 opened Jan 23, 2026 by Jintao-Huang Loading…
fix(megatron): disable checkpointing when calculate KL
#7828 opened Jan 20, 2026 by zzc0430 Loading…
1 of 4 tasks
Update moe.sh
#7375 opened Jan 13, 2026 by Itime-ren Loading…
4 tasks
[grpo] support gigpo with gym
#7364 opened Jan 12, 2026 by londa61 Loading…
3 tasks
[feature] add support for EAFT loss
#7361 opened Jan 12, 2026 by ymxyll Loading…
3 tasks
add sglang reasoning parser
#7171 opened Dec 23, 2025 by eliasyin Loading…
1 of 4 tasks
support cce、tiledmlp、activation cpu offload
#7169 opened Dec 23, 2025 by meichangsu1 Loading…
1 of 4 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.