Bug fix: 6012573 (#1131)

sugunav14 · web-flow · commit 24ceba61fe9a · 2026-03-29T01:24:13.000+05:30
### What does this PR do? Type of change: ?   ### Usage ```python # Add a code snippet demonstrating how to use this ``` ### Testing  ### Before your PR is "*Ready for review*" Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/Model-Optimizer/blob/main/CONTRIBUTING.md) and your commits are signed (`git commit -s -S`). Make sure you read and follow the [Security Best Practices](https://github.com/NVIDIA/Model-Optimizer/blob/main/SECURITY.md#security-coding-practices-for-contributors) (e.g. avoiding hardcoded `trust_remote_code=True`, `torch.load(..., weights_only=False)`, `pickle`, etc.). - Is this change backward compatible?: ✅ / ❌ / N/A  - If you copied code from any other sources or added a new PIP dependency, did you follow guidance in `CONTRIBUTING.md`: ✅ / ❌ / N/A  - Did you write any new necessary tests?: ✅ / ❌ / N/A  - Did you update [Changelog](https://github.com/NVIDIA/Model-Optimizer/blob/main/CHANGELOG.rst)?: ✅ / ❌ / N/A  ### Additional Information   ## Summary by CodeRabbit * **Chores** * Standardized the configuration key for model precision. * Model loading now defaults to bfloat16 precision instead of float32, aligning configs and runtime behavior.  --------- Signed-off-by: Suguna Velury <178320438+sugunav14@users.noreply.github.com>
diff --git a/examples/gpt-oss/configs/sft_full.yaml b/examples/gpt-oss/configs/sft_full.yaml
@@ -1,7 +1,7 @@
 # Model
 model_name_or_path: openai/gpt-oss-20b
 attn_implementation: eager
-torch_dtype: bfloat16
+dtype: bfloat16
 
 # Dataset
 dataset_name: HuggingFaceH4/Multilingual-Thinking
diff --git a/examples/gpt-oss/configs/sft_lora.yaml b/examples/gpt-oss/configs/sft_lora.yaml
@@ -1,7 +1,7 @@
 # Model
 model_name_or_path: openai/gpt-oss-20b
 attn_implementation: eager
-torch_dtype: bfloat16
+dtype: bfloat16
 
 # Dataset
 dataset_name: HuggingFaceH4/Multilingual-Thinking
diff --git a/examples/gpt-oss/sft.py b/examples/gpt-oss/sft.py
@@ -72,7 +72,7 @@ def main(script_args, training_args, model_args, quant_args):
         "revision": model_args.model_revision,
         "trust_remote_code": model_args.trust_remote_code,
         "attn_implementation": model_args.attn_implementation,
-        "torch_dtype": getattr(model_args, "dtype", "float32"),
+        "torch_dtype": getattr(model_args, "dtype", "bfloat16"),
         "use_cache": not training_args.gradient_checkpointing,
     }
 

Original file line number	Diff line number	Diff line change
`@@ -72,7 +72,7 @@ def main(script_args, training_args, model_args, quant_args):`
`72`	`72`	`"revision": model_args.model_revision,`
`73`	`73`	`"trust_remote_code": model_args.trust_remote_code,`
`74`	`74`	`"attn_implementation": model_args.attn_implementation,`
`75`		`- "torch_dtype": getattr(model_args, "dtype", "float32"),`
	`75`	`+ "torch_dtype": getattr(model_args, "dtype", "bfloat16"),`
`76`	`76`	`"use_cache": not training_args.gradient_checkpointing,`
`77`	`77`	`}`
`78`	`78`