Update openvino_quantizer.rst

anzr299 · web-flow · commit ba70b4ac510e · 2026-05-11T17:11:05.000+04:00
diff --git a/unstable_source/openvino_quantizer.rst b/unstable_source/openvino_quantizer.rst
@@ -118,29 +118,29 @@ After we capture the FX Module to be quantized, we will import the OpenVINOQuant
 
 .. code-block:: python
 
-    from nncf.experimental.torch.fx import OpenVINOQuantizer
+    from executorch.backends.openvino.quantizer import OpenVINOQuantizer
+    from executorch.backends.openvino.quantizer import QuantizationMode
 
     quantizer = OpenVINOQuantizer()
 
 ``OpenVINOQuantizer`` has several optional parameters that allow tuning the quantization process to get a more accurate model.
 Below is the list of essential parameters and their description:
 
 
-* ``preset`` - defines quantization scheme for the model. Two types of presets are available:
+* ``mode`` - defines quantization scheme for the model. Multiple modes are supported:
 
-    * ``PERFORMANCE`` (default) - defines symmetric quantization of weights and activations
+    * ``INT8_SYM`` (default) - defines symmetric quantization of weights and activations. This is the best for performance
 
-    * ``MIXED`` - weights are quantized with symmetric quantization and the activations are quantized with asymmetric quantization. This preset is recommended for models with non-ReLU and asymmetric activation functions, e.g. ELU, PReLU, GELU, etc.
+    * ``INT8_MIXED`` - weights are quantized with symmetric quantization and the activations are quantized with asymmetric quantization. This preset is recommended for models with non-ReLU and asymmetric activation functions, e.g. ELU, PReLU, GELU, etc.
 
-    .. code-block:: python
-
-        OpenVINOQuantizer(preset=nncf.QuantizationPreset.MIXED)
+    * ``INT8_TRANSFORMER`` - special quantization scheme to preserve accuracy after quantization of Transformer models (BERT, Llama, etc.). None is default, i.e. no specific scheme is defined.
 
-* ``model_type`` - used to specify quantization scheme required for specific type of the model. Transformer is the only supported special quantization scheme to preserve accuracy after quantization of Transformer models (BERT, Llama, etc.). None is default, i.e. no specific scheme is defined.
+    * ``INT8WO_SYM``, ``INT8WO_ASYM``, ``INT4WO_SYM``, ``INT4WO_ASYM`` - these are weights-only quantization schemes. They apply vanilla min-max quantization to model weights to INT8/INT4 with Symmetric and Asymmetric schemes.
 
     .. code-block:: python
 
-        OpenVINOQuantizer(model_type=nncf.ModelType.Transformer)
+        OpenVINOQuantizer(mode=QuantizationMode.INT8_SYM)
+
 
 * ``ignored_scope`` - this parameter can be used to exclude some layers from the quantization process to preserve the model accuracy.  For example, when you want to exclude the last layer of the model from quantization.  Below are some examples of how to use this parameter: