Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: Enable EPLB to existing MoE models
#5203 opened Jun 13, 2025 by syuoni Loading…
[fix][test] Speedup Nemotron NAS unittests
#5202 opened Jun 13, 2025 by omera-nv Loading…
Test
#5199 opened Jun 13, 2025 by ZhanruiSunCh Draft
Merge current waive list with the ToT waive list
#5198 opened Jun 13, 2025 by yiqingy0 Loading…
tests: add ds r1 tp4 test
#5197 opened Jun 13, 2025 by xinhe-nv Draft
tests: add multi nodes tests
#5196 opened Jun 13, 2025 by xinhe-nv Draft
test: add deepseek rcca cases
#5195 opened Jun 13, 2025 by ruodil Loading…
refactor: dummy request creation
#5192 opened Jun 13, 2025 by lfr-0531 Loading…
[TRTLLM-5516] perf: replicate dummy request for cuda graph padding (cherry-pick #4729) Release Blocker PRs that blocking the final release build or branching out the release branch
#5190 opened Jun 13, 2025 by kaiyux Loading…
[chore] Linking fixes to NVRTC wrapper Community want to contribute PRs initiated from Community
#5189 opened Jun 13, 2025 by AlessioNetti Loading…
optimize memset before alltoall communication
#5188 opened Jun 13, 2025 by dongxuy04 Loading…
test: add llama4 models for perf test
#5187 opened Jun 13, 2025 by ruodil Loading…
add dgx b200 8gpu test case in post merge
#5185 opened Jun 13, 2025 by yuanjingx87 Loading…
feat: MoE trtllm backend kernel update
#5183 opened Jun 13, 2025 by rosenrodt Loading…
[doc] Update Perf-Overview.MD with V0.20 Release Data
#5176 opened Jun 13, 2025 by zbpatel Loading…
[feat] Add progress bar to benchmark
#5173 opened Jun 12, 2025 by arekay Loading…
ProTip! Add no:assignee to see everything that’s not assigned.