[IterativeTilingAndFusionPass
] Wrap linalg.ops in a loop even if the shape is smaller than min tiling size
#332
Milestone
In cases where the shape of a linalg operation is smaller or equal to the minimal tile size (which is 32) the operation is untouched and left as it is. That's the problem as our GPU pipeline expects a for-loop (that will later describe a launch grid) after the
IterativeTilingAndFusion
pass. If there's no loop the pipeline breaks.For the stability reasons, I would expect that such operations would be wrapped into a single-iteration for-loop just to make pipeline working even on those corner cases:
P.S. this is not critical, as in real-life scenarios we would likely not meet ops with such small shapes
The text was updated successfully, but these errors were encountered: