We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 0279dcb commit c42e7d6Copy full SHA for c42e7d6
1 file changed
bionemo-recipes/recipes/esm2_native_te/hydra_config/defaults.yaml
@@ -2,7 +2,7 @@
2
model_tag: ??? # E.g., nvidia/esm2_t6_8M_UR50D, facebook/esm2_t6_8M_UR50D, or a local path (e.g ./example_8m_checkpoint)
3
num_train_steps: ???
4
5
-use_meta_device: true
+use_meta_device: false # meta-device init is still not converging
6
7
# Whether to wrap the model in torch.compile. Note, this is currently not supported with mfsdp (BIONEMO-2977).
8
# We leave this off by default since we don't see much of a performance improvement with TE layers.
0 commit comments