You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix flash_attn_supported override for cross-attention causal mask
Factor out cross_attn_causal check to avoid no-backend errors when
FA3 is installed but flash attention doesn't support non-bottom-right
causal mask with different Q/KV sequence lengths.
0 commit comments