Bug fix 5875873 (#865)

sugunav14 · web-flow · commit 56e97c809648 · 2026-02-28T23:52:44.000+05:30
## What does this PR do? **Type of change:** Bug fix  **Overview:** Newer version of trl uses dtype instead of torch_dtype. Modified code to set float32 as default for older versions of trl that you torch_dtype. ## Usage  ```python # Add a code snippet demonstrating how to use this ``` ## Testing  ## Before your PR is "*Ready for review*"  - **Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/Model-Optimizer/blob/main/CONTRIBUTING.md)** and your commits are signed. - **Is this change backward compatible?**: Yes/No  - **Did you write any new necessary tests?**: Yes/No - **Did you add or update any necessary documentation?**: Yes/No - **Did you update [Changelog](https://github.com/NVIDIA/Model-Optimizer/blob/main/CHANGELOG.rst)?**: Yes/No  ## Additional Information   ## Summary by CodeRabbit * **Bug Fixes** * Enhanced error handling in model training examples to safely manage missing dtype attributes, preventing crashes during initialization when torch_dtype is not configured.  --------- Signed-off-by: Suguna Velury <178320438+sugunav14@users.noreply.github.com>
diff --git a/examples/gpt-oss/sft.py b/examples/gpt-oss/sft.py
@@ -72,7 +72,7 @@ def main(script_args, training_args, model_args, quant_args):
         "revision": model_args.model_revision,
         "trust_remote_code": model_args.trust_remote_code,
         "attn_implementation": model_args.attn_implementation,
-        "torch_dtype": model_args.torch_dtype,
+        "torch_dtype": getattr(model_args, "dtype", "float32"),
         "use_cache": not training_args.gradient_checkpointing,
     }
 

Original file line number	Diff line number	Diff line change
`@@ -72,7 +72,7 @@ def main(script_args, training_args, model_args, quant_args):`
`72`	`72`	`"revision": model_args.model_revision,`
`73`	`73`	`"trust_remote_code": model_args.trust_remote_code,`
`74`	`74`	`"attn_implementation": model_args.attn_implementation,`
`75`		`- "torch_dtype": model_args.torch_dtype,`
	`75`	`+ "torch_dtype": getattr(model_args, "dtype", "float32"),`
`76`	`76`	`"use_cache": not training_args.gradient_checkpointing,`
`77`	`77`	`}`
`78`	`78`