Commit 0bba44a
Github Executorch
[Cortex-M]: Add int8 I/O quantization to Cortex-M export path
Apply QuantizeInputs and QuantizeOutputs passes in the Cortex-M
compilation path to strip the float-in/float-out wrapper from
quantized models. This produces a fully int8 model that accepts
and returns int8 tensors directly.
The passes are applied after to_edge_transform_and_lower but before
CortexMPassManager, since the latter renames quantized_decomposed
ops to cortex_m variants which the I/O passes cannot recognize.1 parent 0907294 commit 0bba44a
1 file changed
Lines changed: 13 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
| 50 | + | |
| 51 | + | |
50 | 52 | | |
51 | 53 | | |
52 | 54 | | |
| |||
860 | 862 | | |
861 | 863 | | |
862 | 864 | | |
| 865 | + | |
| 866 | + | |
| 867 | + | |
| 868 | + | |
| 869 | + | |
| 870 | + | |
| 871 | + | |
| 872 | + | |
| 873 | + | |
| 874 | + | |
| 875 | + | |
863 | 876 | | |
864 | 877 | | |
865 | 878 | | |
| |||
0 commit comments