You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Synced with @zhczhong, even if we improved K dimension tail handling logic, it will lead to dynamic shape in brgemm, which could not be further lowered. So we need to further restrain the tuning space.
Pull in all the restrictions to eliminate the possible "dynamic shapes" which feeding into brgemm could avoid the above lowering issue, but the config will result in sub-optimal performance.
As the "dynamic shape" here is mainly introduced by the undividable dims during tiling stage, which is actually a limited set of compile-time known shapes, we might need to support such scenario to improve the overall performance.
During the tuning phase, we observed invalid config as follows
In this case, the existing tiling logic does not correctly handle the boundary of K dimension, generating code like
and causing runtime out of bound access.
The text was updated successfully, but these errors were encountered: