Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[FA] Update autotune configs on default path (#2475)
In this PR, we aim to allow configurations that are equivalent to the one used on advanced path. This PR gives 4% performance improvement on geomean of FA out of box. On advanced path, `BLOCK_M` is 128, `num_warps` can be `8` or `16`. Signed-off-by: Whitney Tsang <[email protected]>
- Loading branch information