-
Notifications
You must be signed in to change notification settings - Fork 823
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QST]The best way to get origin coord from a fragment by cute? #1542
Comments
https://github.com/NVIDIA/cutlass/blob/main/media/docs/cute/0y_predication.md This documentation should let you do what you want to do |
The link is 404, can u send a available one? |
comment updated inline |
Hi thakkarV thanks for your kind reply.
The wired thing is:
Does I wrong with this? If yes how can I fix it? |
Switch to cutlass 3.4.1 it will work, 3.5 is not right. |
How can get the origin coord of register framgment for a shared mem matrix. I want use the cute::gemm to calculate matrix multiply and mask Like:
C = Mask(A x B)
So I use the cute api like below:
For now I got the tmp result for C, before write back to global mem, I want do some other calculate for C, like Mask below:
Or
The cute is less of doc, I reading the source code, but can't find a way to do like this...
The text was updated successfully, but these errors were encountered: