-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Summary: Add `--pt2-sdpa` option to test compiled sdpa. Pull Request resolved: #105 Test Plan: ``` $ python run.py --op flash_attention --batch 1 --n-heads 24 --seq-len 4608 --d-head 128 --only cudnn,sdpa,flash_v3 --metrics proton --native-sdpa --pt2-sdpa ``` Time/ns: <img width="1920" alt="image" src="https://github.com/user-attachments/assets/d8ce093c-4aaf-4aff-b36e-3ea97f5c2c3d"> Tflops: <img width="1920" alt="image" src="https://github.com/user-attachments/assets/c7e36b6f-04ae-4dd8-b6b8-1546702e1134"> Reviewed By: FindHao Differential Revision: D66969136 Pulled By: xuzhao9 fbshipit-source-id: 40ed0223d99a10724e495cacaaeba7770adf78a6
- Loading branch information
1 parent
e4f5305
commit d94fae4
Showing
1 changed file
with
27 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters