Fix: quant config error on quantized offline eagle (#925)

h-guo18 · danielkorzekwa · commit 65b3f88dc352 · 2026-03-04T03:27:25.000-08:00
## What does this PR do? **Type of change:** ?  **Overview:** ? ## Usage  ```python # Add a code snippet demonstrating how to use this ``` ## Testing  ## Before your PR is "*Ready for review*"  - **Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/Model-Optimizer/blob/main/CONTRIBUTING.md)** and your commits are signed. - **Is this change backward compatible?**: Yes/No  - **Did you write any new necessary tests?**: Yes/No - **Did you add or update any necessary documentation?**: Yes/No - **Did you update [Changelog](https://github.com/NVIDIA/Model-Optimizer/blob/main/CHANGELOG.rst)?**: Yes/No  ## Additional Information   ## Summary by CodeRabbit ## Release Notes * **Refactor** * Enhanced quantization configuration handling for transformer models through improved type validation, ensuring more robust processing of quantized model configurations.  Signed-off-by: h-guo18 <67671475+h-guo18@users.noreply.github.com> Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>
diff --git a/modelopt/torch/speculative/plugins/transformers.py b/modelopt/torch/speculative/plugins/transformers.py
@@ -48,7 +48,7 @@
 )
 from transformers.trainer_pt_utils import LabelSmoother
 from transformers.utils import ModelOutput
-from transformers.utils.quantization_config import QuantizationMethod
+from transformers.utils.quantization_config import CompressedTensorsConfig
 
 from ..eagle.conversion import EagleDMRegistry
 from ..eagle.eagle_model import EagleModel
@@ -585,12 +585,9 @@ def modify(
             self.eagle_config._attn_implementation = "sdpa"
 
         # Patch for Kimi-K2-Thinking, avoid quantizing drafter
-        if (
-            hasattr(self.config, "quantization_config")
-            and self.config.quantization_config.quant_method
-            == QuantizationMethod.COMPRESSED_TENSORS
-        ):
-            self.config.quantization_config.quantization_config.ignore.append("re:.*eagle_module.*")
+        quant_config = getattr(self.config, "quantization_config", None)
+        if isinstance(quant_config, CompressedTensorsConfig):
+            quant_config.ignore.append("re:.*eagle_module.*")
 
         # Set default aux_hidden_state layers
         if (