You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
QONNX (Quantized ONNX) introduces three new custom operators -- [`Quant`](docs/qonnx-custom-ops/quant_op.md), [`BipolarQuant`](docs/qonnx-custom-ops/bipolar_quant_op.md), and [`Trunc`](docs/qonnx-custom-ops/trunc_op.md) -- in order to represent arbitrary-precision uniform quantization in [ONNX](onnx.md). This enables:
11
+
QONNX (Quantized ONNX) introduces three new custom operators -- `Quant`, `BipolarQuant`, and `Trunc` -- in order to represent arbitrary-precision uniform quantization in [ONNX](onnx.md). This enables:
12
12
13
13
* Representation of binary, ternary, 3-bit, 4-bit, 6-bit or any other quantization.
14
14
* Quantization is an operator itself, and can be applied to any parameter or layer input.
0 commit comments