Consider make acc
matrix allocation on each call for XeTLA GEMM benchmarks
#3038
Labels
Milestone
acc
matrix allocation on each call for XeTLA GEMM benchmarks
#3038
The ability to bring the two implementations closer together, since for Triton this allocation also happens on a per-call basis.
The text was updated successfully, but these errors were encountered: