Skip to content

Investigate using block loads for A^T gemm kernels #3327

@alexbaden

Description

@alexbaden

GEMM kernels with a transposed A matrix do not currently use 2D block loads. The load lowering falls back to the LLVM path. Investigate whether or not we can use a 2D block load, or improve performance of the LLVM scalar/vector load path.

Metadata

Metadata

Assignees

Type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions