-
Notifications
You must be signed in to change notification settings - Fork 61
Pull requests: NVIDIA/Fuser
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Vectorize fp4 even if the last dim is not contiguous
#4669
opened Jun 24, 2025 by
zasdfgbnm
Loading…
Host IR LLVM Lowering 1: Build Config Change & Initial Allocate support
#4651
opened Jun 17, 2025 by
wolfcomos
Loading…
[RFC] Add Cutlass MXFP8 Grouped Gemm to
nvfuser_direct
python bindings
Cutlass
Matmuls
Thunder-Inference-Demo
#4649
opened Jun 17, 2025 by
rdspring1
Loading…
Add fused Embedding and RMSNorm benchmarks
Python Benchmarks
#4637
opened Jun 13, 2025 by
IvanYashchuk
Loading…
Add embedding_indexing benchmark and Llama 4 Maverick configuration
Python Benchmarks
#4636
opened Jun 13, 2025 by
IvanYashchuk
Loading…
[WIP] Always do CGA split in persistent Hopper matmul
#4610
opened Jun 10, 2025 by
jacobhinkle
•
Draft
auto select between warp specialized and multi-wave approaches
#4603
opened Jun 9, 2025 by
liqiangxl
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.