You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GEMM kernels with a transposed A matrix do not currently use 2D block loads. The load lowering falls back to the LLVM path. Investigate whether or not we can use a 2D block load, or improve performance of the LLVM scalar/vector load path.