Skip to content

Commit 02e1212

Browse files
fix: Add GROUP_RANK (#448) (#449)
Signed-off-by: oliver könig <okoenig@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com> Co-authored-by: oliver könig <okoenig@nvidia.com>
1 parent 05d08cd commit 02e1212

1 file changed

Lines changed: 1 addition & 0 deletions

File tree

nemo_run/core/execution/templates/dgxc.sh.j2

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,7 @@ export TORCHX_MAX_RETRIES={{max_retries}}
88
{%- for env_var in env_vars %}
99
{{env_var}}
1010
{%- endfor %}
11+
export GROUP_RANK=$(echo $HOSTNAME | grep -oE '[0-9]+$')
1112

1213
{%- if ft_enabled %}
1314
{{ fault_tolerance.ft_launcher_setup(fault_tol_cfg_path, fault_tol_finished_flag_file, fault_tol_job_results_file) }}

0 commit comments

Comments
 (0)