Skip to content

Commit 77afaa7

Browse files
author
Abhijeet
committed
adding Automatic mixed precision quantization documentation under Advanced Features
1 parent 6b57dc6 commit 77afaa7

6 files changed

Lines changed: 4 additions & 4 deletions

File tree

128 Bytes
Binary file not shown.
12 Bytes
Binary file not shown.

docs/_build/html/_sources/features/index.rst.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ Fully automatic, Hessian-aware per-layer bit width assignment using a
4343
greedy algorithm. Enabled by setting ``auto_quantization: True``:
4444

4545
* Estimates per-layer sensitivity via Hessian eigenvalue (power iteration)
46-
* Greedy assignment from ``{2, 4, 8}`` bit widths maximising accuracy per bit
46+
* Greedy assignment from ``{2, 4, 8, 32}`` bit widths maximising accuracy per bit
4747
* Automatic average bit width selection via binary search calibration
4848
* Fixes regression tasks where uniform 8-bit QAT fails
4949

docs/_build/html/features/index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1420,7 +1420,7 @@ <h2><span class="section-number">8.8. </span>Feature Overview<a class="headerlin
14201420
greedy algorithm. Enabled by setting <code class="docutils literal notranslate"><span class="pre">auto_quantization:</span> <span class="pre">True</span></code>:</p>
14211421
<ul class="simple">
14221422
<li><p>Estimates per-layer sensitivity via Hessian eigenvalue (power iteration)</p></li>
1423-
<li><p>Greedy assignment from <code class="docutils literal notranslate"><span class="pre">{2,</span> <span class="pre">4,</span> <span class="pre">8}</span></code> bit widths maximising accuracy per bit</p></li>
1423+
<li><p>Greedy assignment from <code class="docutils literal notranslate"><span class="pre">{2,</span> <span class="pre">4,</span> <span class="pre">8,</span> <span class="pre">32}</span></code> bit widths maximising accuracy per bit</p></li>
14241424
<li><p>Automatic average bit width selection via binary search calibration</p></li>
14251425
<li><p>Fixes regression tasks where uniform 8-bit QAT fails</p></li>
14261426
</ul>

docs/_build/html/searchindex.js

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

docs/source/features/index.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ Fully automatic, Hessian-aware per-layer bit width assignment using a
4343
greedy algorithm. Enabled by setting ``auto_quantization: True``:
4444

4545
* Estimates per-layer sensitivity via Hessian eigenvalue (power iteration)
46-
* Greedy assignment from ``{2, 4, 8}`` bit widths maximising accuracy per bit
46+
* Greedy assignment from ``{2, 4, 8, 32}`` bit widths maximising accuracy per bit
4747
* Automatic average bit width selection via binary search calibration
4848
* Fixes regression tasks where uniform 8-bit QAT fails
4949

0 commit comments

Comments
 (0)