-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[draft][fix] rewrite completion API to avoid repetitive tokens
#5201
opened Jun 13, 2025 by
LinPoly
Loading…
[TRTLLM-5516] perf: replicate dummy request for cuda graph padding (cherry-pick #4729)
Release Blocker
PRs that blocking the final release build or branching out the release branch
#5190
opened Jun 13, 2025 by
kaiyux
Loading…
[chore] Linking fixes to NVRTC wrapper
Community want to contribute
PRs initiated from Community
#5189
opened Jun 13, 2025 by
AlessioNetti
Loading…
[TRTLLM-5653][infra] Run docs build only if PR contains only doc changes
#5184
opened Jun 13, 2025 by
zhanga5
Loading…
Add debug hook to support dump tensor data and add new debug functions easily
#5182
opened Jun 13, 2025 by
HuiGao-NV
Loading…
Removed <think> on head of reasoning_content for DeepSeek-R1 model
#5181
opened Jun 13, 2025 by
k-l-lambda
Loading…
test: Add json_mode_eval for guided decoding evaluation
#5179
opened Jun 13, 2025 by
syuoni
Loading…
enh: Add script to map tests <-> jenkins stages & vice-versa
#5177
opened Jun 13, 2025 by
venkywonka
•
Draft
feat: add multi-node support for Triton with pytorch backend
#5172
opened Jun 12, 2025 by
achartier
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.