-
Notifications
You must be signed in to change notification settings - Fork 61
Create NVFP4 grouped gemm bindings in direct bindings #4662
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: grouped_gemm
Are you sure you want to change the base?
Conversation
Review updated until commit 78848d9 Description
Changes walkthrough 📝
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
9a69511
to
78848d9
Compare
!test |
The API for MXFP8 and NVFP4 are different in SGLang.
PR Stack:
#4649 MXFP8 Grouped GEMM Cutlass
#4662 NVFP4 Grouped GEMM Cutlass <<< This PR. ---- #4676 NVFP4 GEMM Cutlass
Example:
TODOs: