Skip to content

Commit

Permalink
Improve performance of shape 1024x1024x1024 out of box (#2839)
Browse files Browse the repository at this point in the history
Close #2822

After:
90% -> 103%
  • Loading branch information
ESI-SYD authored Nov 28, 2024
1 parent 4d3a94d commit 31fe770
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion benchmarks/triton_kernels_benchmark/gemm_benchmark.py
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@
] + [
triton.Config(
{'BLOCK_SIZE_M': 256, 'BLOCK_SIZE_N': 128, 'BLOCK_SIZE_K': 32, 'GROUP_SIZE_M': 4, 'grf_mode': 'large'},
num_stages=s, num_warps=32) for s in [2, 3]
num_stages=s, num_warps=32) for s in [2, 3, 4]
] + [
triton.Config(
{'BLOCK_SIZE_M': 64, 'BLOCK_SIZE_N': 128, 'BLOCK_SIZE_K': 32, 'GROUP_SIZE_M': 4, 'grf_mode': 'large'},
Expand Down

0 comments on commit 31fe770

Please sign in to comment.