7. Docs: QAT performance analysis and model name corrections by musicalplatypus · Pull Request #12 · TexasInstruments/tinyml-tensorlab

musicalplatypus · 2026-04-07T12:33:17Z

Summary

Two documentation improvements:

1. QAT Training Performance Analysis (`quantization.rst`)

Adds a new section explaining why QAT (Quantization-Aware Training) is significantly slower than float training, with a detailed breakdown of each overhead factor:

FakeQuantize nodes — 2N+1 extra forward/backward ops per batch per layer
Observer min/max reductions — full-tensor reduction per observer per batch
ceil2_tensor GPU syncs — power-of-2 scale snapping forces device-to-host synchronization
Soft-quantize variants (4-bit/2-bit) — triple per-node cost from sigmoid/tanh differentiable rounding
torch.compile ordering — QAT model runs in eager mode because FX graph rewrite discards compiled version

Includes a summary table and source file references for each factor.

2. Model Name Corrections (`DEVICE_TASK_SUPPORT.md`)

Updated model names in the device/task support matrix to match the current model registry (e.g., TimeSeries_Generic_2k → CLS_2k).

Files Changed (2 files)

docs/source/features/quantization.rst
tinyml-modelmaker/DEVICE_TASK_SUPPORT.md

Document the six major overhead sources in quantization-aware training vs float training: FakeQuantize nodes, observer statistics, power-of-2 scale snapping, soft-quantize variants, backward pass complexity, and torch.compile ordering. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Replace outdated TimeSeries_Generic_* names with current registry names: - Classification: CLS_*_NPU, CLS_ResAdd_3k, CLS_ResCat_3k - Regression: REGR_* - Anomaly Detection: AD_* - Forecasting: FCST_* Updated across DEVICE_TASK_SUPPORT.md and all example config.yaml and documentation files. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge in TINYML-ALGO/tinyml-agent-skills from 2026/pranav_a to main * commit 'ff02a0e95ef4841fb883ba841d353a125d4e4447': restructuring agent skill

de8af16d Pull request #45: https://jira.itg.ti.com/browse/TINYML_ALGO-698 REVERT: e48ef1a Pull request #14: TINYML_ALGO-711: fixing readme REVERT: 16fc6a6 TINYML_ALGO-711: fixing readme REVERT: e3639d2 Pull request #13: removing pycache REVERT: f8bb3b7 removing pycache REVERT: dd38428 Pull request #12: restructuring agent skill REVERT: ff02a0e restructuring agent skill REVERT: d26c6a5 Pull request #11: fixing tiny ml name REVERT: 640ffd3 fixing tiny ml name REVERT: 4ee3a19 Pull request #10: 2026/pranav a REVERT: be83fc6 minor fixes REVERT: e3a5700 removed assets, included autoMP quant REVERT: 1af575a Pull request #9: correcting npu devices list REVERT: 31e9eb1 correcting npu devices list REVERT: 59b209b Pull request #8: improving readme REVERT: 8c3260b improving readme REVERT: 668916f Pull request #7: improving readme REVERT: 68686b3 improving readme REVERT: 814316e Pull request #6: fixes to readme and marketplace json REVERT: e4bc0b4 fixes to readme and marketplace json REVERT: 6a64208 Pull request #5: fixes to readme REVERT: 0f9c868 fixes to readme REVERT: 52f95ff Pull request #4: 2026/pranav a REVERT: 443295d fixes to readme REVERT: 1881112 fixes to readme and marketplace json REVERT: 229ab57 Pull request #3: 2026/pranav a REVERT: 6519104 minor readme fix REVERT: 38e9f9f minor readme fix REVERT: db81f81 Pull request #2: minor readme fix REVERT: 1c0737a minor readme fix REVERT: 0a0c02d Pull request #1: minor readme fix REVERT: b682335 minor readme fix REVERT: 062eb39 Initial Commit git-subtree-dir: tinyml-agent-skills git-subtree-split: de8af16d9e23de3e9bda3d811a0ebdece1178260

t5fkg8d44d-beep and others added 2 commits April 7, 2026 07:18

musicalplatypus changed the title ~~Docs: QAT performance analysis and model name corrections~~ 7. Docs: QAT performance analysis and model name corrections Apr 7, 2026

Adithya-Thonse pushed a commit that referenced this pull request Jun 12, 2026

Pull request #12: restructuring agent skill

dd38428

Merge in TINYML-ALGO/tinyml-agent-skills from 2026/pranav_a to main * commit 'ff02a0e95ef4841fb883ba841d353a125d4e4447': restructuring agent skill

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

7. Docs: QAT performance analysis and model name corrections#12

7. Docs: QAT performance analysis and model name corrections#12
musicalplatypus wants to merge 2 commits into
TexasInstruments:mainfrom
musicalplatypus:pr/documentation

musicalplatypus commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

musicalplatypus commented Apr 7, 2026

Summary

1. QAT Training Performance Analysis (quantization.rst)

2. Model Name Corrections (DEVICE_TASK_SUPPORT.md)

Files Changed (2 files)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1. QAT Training Performance Analysis (`quantization.rst`)

2. Model Name Corrections (`DEVICE_TASK_SUPPORT.md`)