error with flash attention #85

staple-pi · 2023-11-25T07:02:50Z

hi,
I'm trying to run amg_example.py , but meet an Userwarning: 1Torch was not compiled with flash attention. (Triggered internally at ..\aten\src\ATen\native\transformers\cuda\sdp_utils.cpp:253.) return torch.nn.functional.scaled_dot_product_attention(q_, k_, v_, attn_mask=attn_bias) in lash_4.py:line369. how can i solve it?

staple-pi · 2023-11-25T07:23:44Z

i have already set os.environ['SEGMENT_ANYTHING_FAST_USE_FLASH_4'] = '0', but it looks like flash_4 was still used anyway

cpuhrsch · 2023-11-28T06:25:27Z

Hello @staple-pi, thanks for opening this issue. The error you're reporting doesn't seem related to flash_4, but your installation of torch. Can you share a bit more detail on what version of torch and on what platform you're using it?
Thanks!

bigmover · 2023-11-30T03:36:32Z

Hello @staple-pi, thanks for opening this issue. The error you're reporting doesn't seem related to flash_4, but your installation of torch. Can you share a bit more detail on what version of torch and on what platform you're using it? Thanks!

torch2.0.1cu117 will reproduce the error! sam-fast is specified to match torch2.2?

staple-pi · 2023-11-30T05:53:50Z

Hello @staple-pi, thanks for opening this issue. The error you're reporting doesn't seem related to flash_4, but your installation of torch. Can you share a bit more detail on what version of torch and on what platform you're using it? Thanks!

Thank you for your reply！ Actually, i have already install the preview version of torch(2.2.0.dev20231123+cu121). And i'm using it in the environment created by anaconda with python 3.10.13 on Windows

cpuhrsch · 2023-12-01T06:18:30Z

@staple-pi Hm, interesting. This might be a general issue with PyTorch on Windows. Can you use scaled_dot_product_attention otherwise or just not in the context of this project?

staple-pi · 2023-12-02T06:53:50Z

@staple-pi Hm, interesting. This might be a general issue with PyTorch on Windows. Can you use scaled_dot_product_attention otherwise or just not in the context of this project?

After test, scaled_dot_product_attention can work in elsewhere base the same environment.

cpuhrsch · 2023-12-07T18:05:06Z

@staple-pi - I landed some optimizations for the AMG. Can you try again?

staple-pi · 2023-12-09T12:16:26Z

@staple-pi - I landed some optimizations for the AMG. Can you try again?

unfortunately, the error still appear

drisspg · 2023-12-15T20:50:57Z

Are you using the sdp_kernel to only enable "flash_attention". Flash Attention2 is not supported on windows and we don't have an ETA on when/if this support will be added.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

error with flash attention #85

error with flash attention #85

staple-pi commented Nov 25, 2023 •

edited

Loading

staple-pi commented Nov 25, 2023 •

edited

Loading

cpuhrsch commented Nov 28, 2023

bigmover commented Nov 30, 2023

staple-pi commented Nov 30, 2023

cpuhrsch commented Dec 1, 2023

staple-pi commented Dec 2, 2023 •

edited

Loading

cpuhrsch commented Dec 7, 2023

staple-pi commented Dec 9, 2023

drisspg commented Dec 15, 2023

error with flash attention #85

error with flash attention #85

Comments

staple-pi commented Nov 25, 2023 • edited Loading

staple-pi commented Nov 25, 2023 • edited Loading

cpuhrsch commented Nov 28, 2023

bigmover commented Nov 30, 2023

staple-pi commented Nov 30, 2023

cpuhrsch commented Dec 1, 2023

staple-pi commented Dec 2, 2023 • edited Loading

cpuhrsch commented Dec 7, 2023

staple-pi commented Dec 9, 2023

drisspg commented Dec 15, 2023

staple-pi commented Nov 25, 2023 •

edited

Loading

staple-pi commented Nov 25, 2023 •

edited

Loading

staple-pi commented Dec 2, 2023 •

edited

Loading