-
-
Notifications
You must be signed in to change notification settings - Fork 11k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Flashinfer trtllm moe to compressed tensor FP4 path
#28090
opened Nov 5, 2025 by
Victor49152
•
Draft
5 tasks
[PERF] Decouple projections from GDN custom op. Attempt 2
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#28083
opened Nov 5, 2025 by
vadiklyutiy
•
Draft
remove resolve_op_overloads and use splitting_ops directly
#28081
opened Nov 5, 2025 by
BoyuanFeng
•
Draft
Add runai model streamer e2e test for GCS
ci/build
#28079
opened Nov 4, 2025 by
amacaskill
Loading…
5 tasks done
Consolidate Nvidia ModelOpt quant config handling for all quantization methods
#28076
opened Nov 4, 2025 by
shengliangxu
Loading…
5 tasks
[Core] add support for reasoning parser plugins
deepseek
Related to DeepSeek models
frontend
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
v1
#28075
opened Nov 4, 2025 by
walterbm
Loading…
3 of 5 tasks
[Core] MoE degrade detection and graceful degradation. (Phase 0 of RFC #27774)
deepseek
Related to DeepSeek models
qwen
Related to Qwen models
#28073
opened Nov 4, 2025 by
tzulingk
Loading…
[Refactor] Optimize ONLY add when PR is ready to merge/full CI is needed
select_experts
ready
#28069
opened Nov 4, 2025 by
yewentao256
Loading…
[WIP] Rebase of https://github.com/vllm-project/vllm/pull/27134 to latest main
#28068
opened Nov 4, 2025 by
alexm-redhat
•
Draft
[Chore] Separate out attention backend constants from vllm.utils
rocm
Related to AMD ROCm
#28066
opened Nov 4, 2025 by
hezyin
Loading…
[Kernels] Split up fused_moe/layer.py, isolate more modular kernel code
#28064
opened Nov 4, 2025 by
bnellnm
Loading…
5 tasks
Fix prime rl test
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#28062
opened Nov 4, 2025 by
rzabarazesh
Loading…
5 tasks
[build][cmake]: Bundle ACL dynlibs and torch libgomp for CPU extension builds
ci/build
#28059
opened Nov 4, 2025 by
Radu2k
Loading…
3 of 5 tasks
[wip] Fix torch nightly
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#28057
opened Nov 4, 2025 by
rzabarazesh
Loading…
5 tasks
[Chore] Clean up deepseek v2/v3 config copy
deepseek
Related to DeepSeek models
speculative-decoding
#28055
opened Nov 4, 2025 by
Isotr0py
Loading…
1 of 5 tasks
[Bugfix] fix confusing OOM errors during v1 init
v1
#28051
opened Nov 4, 2025 by
shivampr
Loading…
3 of 5 tasks
[Frontend] Fix stream block and log format when enable response logging
frontend
#28049
opened Nov 4, 2025 by
esmeetu
Loading…
5 tasks
[cudagraph] fix cudagraph warning in deepseekv32
deepseek
Related to DeepSeek models
#28044
opened Nov 4, 2025 by
ZJY0516
Loading…
5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.