[FA] add persistent variant #77

manman-ren · 2024-11-25T19:03:19Z

Hongtao identified the performance issue with the initial implementation and updated the assignments of tiles to each SM.

Performance with warp specialization
(Batch, Heads, SeqLen, Dhead) triton_tutorial_flash_v2_tma_ws_persistent-tflops triton_tutorial_flash_v2_tma_ws-tflops triton_tutorial_flash_v2-tflops

         (8, 16, 8192, 128)                                              516.164                                   490.451                            423.905

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot · 2024-11-25T19:10:03Z

@manman-ren has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

xuzhao9 · 2024-11-25T19:14:17Z

Can you take a look at the CI failure? If the kernel does not work on the Triton versions we test (pytorch and Triton-main), we need to add them to the skip list: https://github.com/pytorch-labs/tritonbench/tree/main/test/test_gpu

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

xuzhao9

LGTM. We are changing to fixing H (num_heads) here and only changing seq_len, but it sounds fine to me. For those who are interested in reproducing FA3 paper numbers, we can add another option for input data shapes.

facebook-github-bot · 2024-12-02T19:37:31Z

@manman-ren has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-12-02T23:33:12Z

@manman-ren merged this pull request in 0a82d3d.

add persistent

44bdf39

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot added the cla signed label Nov 25, 2024

manman-ren marked this pull request as draft November 25, 2024 19:03

manman-ren had a problem deploying to docker-s3-upload November 25, 2024 19:03 — with GitHub Actions Failure

fix

18e6b46

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

manman-ren had a problem deploying to docker-s3-upload November 26, 2024 17:33 — with GitHub Actions Failure

skip tests

9ba5656

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

manman-ren temporarily deployed to docker-s3-upload November 26, 2024 20:35 — with GitHub Actions Inactive

lint

7747e0c

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

manman-ren temporarily deployed to docker-s3-upload November 26, 2024 22:10 — with GitHub Actions Inactive

manman-ren marked this pull request as ready for review November 26, 2024 22:25

Fix performance issue with persistent

1a612c5

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

manman-ren had a problem deploying to docker-s3-upload November 26, 2024 23:16 — with GitHub Actions Error

manman-ren had a problem deploying to docker-s3-upload November 26, 2024 23:17 — with GitHub Actions Error

htyu temporarily deployed to docker-s3-upload November 26, 2024 23:19 — with GitHub Actions Inactive

add WITH_TMA

de3628b

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

manman-ren force-pushed the fa-persistent branch from ba794c5 to de3628b Compare November 27, 2024 18:10

manman-ren had a problem deploying to docker-s3-upload November 27, 2024 18:11 — with GitHub Actions Error

lint

1f3f98d

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

manman-ren requested review from xuzhao9 and adamomainz November 27, 2024 18:15

manman-ren had a problem deploying to docker-s3-upload November 27, 2024 18:15 — with GitHub Actions Error

manman-ren had a problem deploying to docker-s3-upload November 27, 2024 18:16 — with GitHub Actions Error

fix

8cd2a52

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

manman-ren had a problem deploying to docker-s3-upload November 27, 2024 18:18 — with GitHub Actions Failure

htyu approved these changes Dec 2, 2024

View reviewed changes

xuzhao9 approved these changes Dec 2, 2024

View reviewed changes

facebook-github-bot closed this in 0a82d3d Dec 2, 2024

facebook-github-bot added the Merged label Dec 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FA] add persistent variant #77

[FA] add persistent variant #77

manman-ren commented Nov 25, 2024 •

edited

Loading

facebook-github-bot commented Nov 25, 2024

xuzhao9 commented Nov 25, 2024

xuzhao9 left a comment

facebook-github-bot commented Dec 2, 2024

facebook-github-bot commented Dec 2, 2024

[FA] add persistent variant #77

[FA] add persistent variant #77

Conversation

manman-ren commented Nov 25, 2024 • edited Loading

facebook-github-bot commented Nov 25, 2024

xuzhao9 commented Nov 25, 2024

xuzhao9 left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Dec 2, 2024

facebook-github-bot commented Dec 2, 2024

manman-ren commented Nov 25, 2024 •

edited

Loading