Skip to content

Commit c54c7e5

Browse files
committed
Update nccl error handling flag
Signed-off-by: Davis Wertheimer <davis.wertheimer@ibm.com>
1 parent 6b2ac79 commit c54c7e5

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

fms_fsdp/utils/train_utils.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -184,7 +184,7 @@ def setup():
184184

185185
def setup_environ_flags():
186186
os.environ["TORCH_SHOW_CPP_STACKTRACES"] = str(1)
187-
os.environ["NCCL_ASYNC_ERROR_HANDLING"] = str(1)
187+
os.environ["TORCH_NCCL_ASYNC_ERROR_HANDLING"] = str(1)
188188

189189

190190
def get_mixed_precision_policy(cfg, rank):

0 commit comments

Comments
 (0)