You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Consider wrapping the call to self.attention in InterpretableMultiHeadAttention with with torch.backends.cuda.sdp_kernel(enable_flash=True, enable_math=False, enable_mem_efficient=True):
In order to improve speed and memory efficiency.
The text was updated successfully, but these errors were encountered:
Consider wrapping the call to self.attention in InterpretableMultiHeadAttention with
with torch.backends.cuda.sdp_kernel(enable_flash=True, enable_math=False, enable_mem_efficient=True):
In order to improve speed and memory efficiency.
The text was updated successfully, but these errors were encountered: