Support better tiling algorithms for ANE #9630
Labels
triaged
This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Milestone
🚀 The feature, motivation and pitch
Support better tiling algorithms for ANE in linear, SDPA, matmul, bmm, etc. We noticed that we can boost performance by explicitly splitting up ops in PyTorch, but ideally this would be done by the CoreML compiler.
Alternatives
No response
Additional context
No response
RFC (Optional)
No response
The text was updated successfully, but these errors were encountered: