Skip to content

Commit cb34225

Browse files
authored
chore: bump Megatron-Bridge to latest main (7110a96) (#2223)
Signed-off-by: Yuki Huang <yukih@nvidia.com>
1 parent cf58b93 commit cb34225

25 files changed

Lines changed: 298 additions & 161 deletions
Submodule Megatron-Bridge updated 308 files

3rdparty/Megatron-Bridge-workspace/setup.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,7 @@
2525
bridge_src_dir = "Megatron-Bridge/src/megatron/bridge"
2626
bridge_package_name = "megatron.bridge"
2727

28+
# Default dependencies from pyproject.toml
2829
CACHED_DEPENDENCIES = [
2930
"transformers>=5.0.0,<=5.3.0",
3031
"peft>=0.18.1",
@@ -50,7 +51,7 @@
5051
# TODO(https://github.com/NVIDIA-NeMo/RL/issues/2111): upgrade to core_cu13 when we move to CUDA 13 base container
5152
"transformer-engine[pytorch,core_cu12]",
5253
"mamba-ssm",
53-
"nvidia-resiliency-ext~=0.5.0",
54+
"nvidia-resiliency-ext",
5455
"causal-conv1d",
5556
"flash-linear-attention",
5657
"timm",
Submodule Megatron-LM updated 174 files

3rdparty/Megatron-LM-workspace/setup.py

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,8 @@
5050
"nvidia-modelopt[torch]; sys_platform != 'darwin'",
5151
# TODO(https://github.com/NVIDIA-NeMo/RL/issues/2111): upgrade to core_cu13 when we move to CUDA 13 base container
5252
"transformer-engine[pytorch,core_cu12]",
53-
"nvidia-resiliency-ext @ git+https://github.com/NVIDIA/nvidia-resiliency-ext.git@v0.5.0",
53+
# VCS dependency - must match pyproject.toml [tool.uv.sources]
54+
"nvidia-resiliency-ext @ git+https://github.com/NVIDIA/nvidia-resiliency-ext.git@63154570cea17f8805a7fd15cc3b8cc2919ba575",
5455
"tqdm",
5556
"einops~=0.8",
5657
"tensorstore~=0.1,!=0.1.46,!=0.1.72",
@@ -65,10 +66,10 @@
6566
"flashinfer-python~=0.5.0",
6667
"wget",
6768
"onnxscript",
68-
# VCS dependency - must match pyproject.toml [tool.uv.sources]
69-
"emerging_optimizers @ git+https://github.com/NVIDIA-NeMo/Emerging-Optimizers.git@v0.1.0",
70-
"datasets",
7169
"fastapi~=0.50",
70+
"datasets",
71+
# VCS dependency - must match pyproject.toml [tool.uv.sources]
72+
"emerging_optimizers @ git+https://github.com/NVIDIA-NeMo/Emerging-Optimizers.git@v0.2.0",
7273
"hypercorn",
7374
"quart",
7475
"openai[aiohttp]",

examples/configs/distillation_math.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -113,6 +113,7 @@ policy: &POLICY_BASE
113113
moe_enable_deepep: false
114114
moe_token_dispatcher_type: "alltoall"
115115
moe_shared_expert_overlap: false
116+
gradient_accumulation_fusion: false
116117

117118
optimizer:
118119
optimizer: "adam"

examples/configs/dpo.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -147,6 +147,7 @@ policy:
147147
moe_enable_deepep: false
148148
moe_token_dispatcher_type: "alltoall"
149149
moe_shared_expert_overlap: false
150+
gradient_accumulation_fusion: false
150151

151152
peft:
152153
enabled: false

examples/configs/grpo_math_1B.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -155,6 +155,7 @@ policy:
155155
moe_enable_deepep: false
156156
moe_token_dispatcher_type: "alltoall"
157157
moe_shared_expert_overlap: false
158+
gradient_accumulation_fusion: false
158159

159160
peft:
160161
enabled: false

examples/configs/recipes/llm/grpo-dapomath17k-dsv3-megatron.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,6 @@ policy:
3030
sequence_parallel: true
3131
moe_permute_fusion: true
3232
apply_rope_fusion: false
33-
gradient_accumulation_fusion: false
3433
moe_enable_deepep: true
3534
moe_token_dispatcher_type: flex
3635
optimizer:

examples/configs/recipes/llm/grpo-moonlight-16ba3b-4n8g-megatron-fp8-e2e.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,6 @@ policy:
2626
num_layers_in_first_pipeline_stage: 7
2727
num_layers_in_last_pipeline_stage: 6
2828
apply_rope_fusion: false
29-
gradient_accumulation_fusion: false
3029
fp8_cfg:
3130
enabled: true
3231
optimizer:

examples/configs/recipes/llm/grpo-moonlight-16ba3b-4n8g-megatron.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,6 @@ policy:
2727
num_layers_in_first_pipeline_stage: 7
2828
num_layers_in_last_pipeline_stage: 6
2929
apply_rope_fusion: false
30-
gradient_accumulation_fusion: false
3130
optimizer:
3231
lr: 1.0e-06
3332
scheduler:

0 commit comments

Comments
 (0)