You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix(pt2): neutralise shape assertions instead of erasing them
Erasing _assert_scalar nodes from the exported FX graph (and especially
calling eliminate_dead_code afterwards) disturbs the graph structure and
produces NaN gradients for DPA1/se_atten in the NoPBC case on some
Python/torch versions.
Replace each assertion's condition with True so the node stays in the
graph but never fires at runtime. This preserves the graph topology and
avoids the NaN issue across all tested configurations.
0 commit comments