You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/envvars.rst
+24Lines changed: 24 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -287,6 +287,30 @@ Kernel Configuration
287
287
:Default: ``0``
288
288
:Description: Enable row-scaled NVFP4 tensors for forward activation quantizers in the ``NVFP4BlockScaling`` recipe. When set to ``1`` (or when ``NVFP4BlockScaling(row_scaled_activation=True)`` is used), rowwise ``amax`` metadata is stored as one FP32 value per tensor row instead of a single scalar.
289
289
290
+
.. envvar:: NVTE_NVFP4_4OVER6
291
+
292
+
:Type: ``str`` (``none``, ``weights``, ``activations``, or ``all``)
293
+
:Default: ``none``
294
+
:Description: Enable 4over6 adaptive NVFP4 block scaling for weights, activations, or both in the ``NVFP4BlockScaling`` recipe. For each selected FP4 block, quantization compares map-to-4 and map-to-6 candidates and stores the candidate with lower configured error. ``none`` keeps standard NVFP4. Current 4over6 support targets RL and post-training scenarios; pre-training paths that combine 4over6 with RHT are not yet implemented.
295
+
296
+
.. envvar:: NVTE_NVFP4_4OVER6_E4M3_USE_256
297
+
298
+
:Type: ``str`` (``none``, ``weights``, ``activations``, or ``all``)
299
+
:Default: ``all``
300
+
:Description: Select NVFP4 4over6 quantizers that use 256 instead of 448 as the global E4M3 scale bound. By default, all 4over6 quantizers use 256. Set the env var to ``none`` (or set ``NVFP4BlockScaling(nvfp4_4over6_e4m3_use_256="none")``) to use the standard NVFP4 448 bound for all 4over6 quantizers. This option is only meaningful for tensor roles that also enable :envvar:`NVTE_NVFP4_4OVER6`.
301
+
302
+
.. envvar:: NVTE_NVFP4_4OVER6_ERR_MODE
303
+
304
+
:Type: ``str`` (``MAE`` or ``MSE``)
305
+
:Default: ``MAE``
306
+
:Description: Select the input-domain error metric used by NVFP4 4over6 map-to-4 versus map-to-6 candidate selection in the ``NVFP4BlockScaling`` recipe.
307
+
308
+
.. envvar:: NVTE_NVFP4_4OVER6_ERR_USE_FAST_MATH
309
+
310
+
:Type: ``int`` (0 or 1)
311
+
:Default: ``0``
312
+
:Description: Allow the NVFP4 4over6 candidate error computation to use faster non-strict floating-point expressions. By default, 4over6 error comparison uses strict expressions; ``NVTE_USE_FAST_MATH`` does not control this error-comparison path.
0 commit comments