Skip to content

Commit 09a501a

Browse files
committed
Disable Fused attn when thd is true
1 parent 00deb30 commit 09a501a

1 file changed

Lines changed: 5 additions & 0 deletions

File tree

  • bionemo-recipes/recipes/codonfm_native_te/slurm

bionemo-recipes/recipes/codonfm_native_te/slurm/1b.sh

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -121,6 +121,11 @@ echo "Job ID: ${SLURM_JOB_ID}"
121121
echo "Nodes: ${SLURM_JOB_NUM_NODES}"
122122
echo "========================================="
123123
124+
# cuDNN fused-attn sub-backend 1 OOMs on Blackwell (sm_103) with THD+padding (TE 2.12 / cuDNN 9.19); force flash-attn varlen.
125+
if [ "${USE_SEQUENCE_PACKING}" = "True" ]; then
126+
export NVTE_FUSED_ATTN=0
127+
fi
128+
124129
# Pick training script based on distributed strategy.
125130
case "${DIST_STRATEGY}" in
126131
fsdp)

0 commit comments

Comments
 (0)