Skip to content

Commit 3ffd142

Browse files
authored
Update model name in L1_3B_ddp.yaml (#1119)
Change 3B config model from `esm2_t33_650M_UR50D` to `esm2_t36_3B_UR50D` <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit * **New Features** * Updated the training recipe to use the larger ESM2 3B model for improved capability. * **Chores** * Adjusted experiment tracking run name to align with the new model. * Preserved existing batch size and training step settings. <!-- end of auto-generated comment: release notes by coderabbit.ai --> Signed-off-by: Jared Wilber <jwilber@nvidia.com>
1 parent b3638d9 commit 3ffd142

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

recipes/esm2_native_te_nvfsdp_thd/hydra_config/L1_3B_ddp.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ defaults:
22
- defaults
33

44
# Training config
5-
model_name: esm2_t33_650M_UR50D
5+
model_name: esm2_t36_3B_UR50D
66
micro_batch_size: 32
77
num_train_steps: 10_000
88

0 commit comments

Comments
 (0)