Name	Name	Last commit message	Last commit date
parent directory ..
assets	assets
config.yaml	config.yaml
gas_sensor.py	gas_sensor.py
readme.md	readme.md

Gas Sensor: Differences during inference between Native PyTorch quantization and TI-provided quantization Wrappers

-Tushar Sharma, Adithya Thonse, Fasna S

Overview

The Gas Sensor Array under Low Concentration dataset, available from the UCI Machine Learning Repository, provides time-series data recorded from a chemical sensor array exposed to low concentrations of six different gases. It is specifically designed to support research on gas identification and quantification at sub-part-per-million (ppb) levels, which is particularly relevant for applications such as environmental monitoring, air quality assessment, and medical diagnostics.

The dataset comprises recordings from 10 metal-oxide semiconductor (MOS) sensors. Each instance in the dataset contains a 30-minute time-series measurement sampled at 1 Hz, resulting in 9000 sensor readings per sample. These readings are split evenly across the 10 sensors, each contributing 900 data points. Each sample corresponds to a specific gas (ethanol, acetone, toluene, ethyl acetate, isopropanol, or n-hexane) at one of three concentration levels: 50, 100, or 200 ppb. In total, the dataset contains 90 samples, five per gas-concentration pair.This dataset is ideal for developing and benchmarking machine learning algorithms for classification (gas type recognition), regression (concentration estimation), and time-series analysis. Its challenges lie in the low signal-to-noise ratio and high dimensionality, making it an excellent testbed for advanced feature extraction.

The dataset used will have 10 measurable parameters/variables - (Sensor_0, Sensor_1, Sensor_2, Sensor_3, Sensor_4, Sensor_5, Sensor_6, Sensor_7, Sensor_8, Sensor_9) i.e the readings from gas sensors and there is 1 prediction variable i.e. the gas present and it's concentration.

Downloading dataset

For this demonstration, we'll work with detecting three different gases: ethyl acetate, isopropanol, and hexane, each at a 100ppb concentration level. To get started, execute the gas_sensor Python script, which will generate and compress the necessary data into a zip file called gas_sensor_dataset.zip.

cd examples/gas_sensor
python gas_sensor.py

The path of this zipped dataset file is already mentioned in yaml configuration yaml, make sure it is same.

dataset:
    input_data_path: 'examples/gas_sensor/gas_sensor_dataset.zip'

Usage in Tiny ML ModelZoo

This zipped dataset is designed to work with Tiny ML ModelMaker. Run the modelmaker with the yaml configuration using the below code.

run_tinyml_modelzoo.sh examples/gas_sensor/config.yaml

run_tinyml_modelzoo.sh is the script to run modelmaker. It take two required arguments TARGET_SOC and CONFIG_FILE.
examples/gas_sensor/config.yaml path of the yaml configuration file to run

The users can configure the yaml configuration to change parameters related to data preprocessing feature extraction, training, testing, model and model compilation. In this example, we will configure the parameter of training, specifically output_int.

training:
    output_int: True #False

Quantization Background

ModelMaker handles several functions, one of which is model quantization. After training the model with floating-point precision, it undergoes either Quantization Aware Training (QAT) or Post Training Quantization (PTQ) calibration to produce a quantized version. This quantized model is then transformed into a different format optimized for use with the hardware accelerators available on TI-MCUs. The converted model is subsequently used to validate accuracy. The illustration below demonstrates the inference process using the converted model, where the input provided is (0.91, 0.87, 0.13).

The expected output from the floating-point model is (0, 0.89, 0.12), which is essentially a scaled version of the quantized output (0, 13, 7). By default, ModelMaker trains the model to produce quantized outputs, as this approach is faster and operates in the integer domain. In contrast, generating dequantized (floating-point) outputs requires operations in the floating-point domain, which can slow down inference.

Quantized/Dequantized Output

Users may notice differences during inference between Native PyTorch quantization and the TI-provided quantization wrappers. When using GenericTinyMLQATFxModule or GenericTinyMLPTQFxModule, which rely on Native PyTorch quantization, the output is dequantized by default. In contrast, quantizing the model with TINPUTinyMLQATFxModule or TINPUTinyMLPTQFxModule produces quantized outputs by default.

Our solution offers flexibility to obtain either quantized or dequantized outputs when using the TINYML wrappers (TINPUTinyMLQATFxModule or TINPUTinyMLPTQFxModule). This can be controlled through the training section of the YAML configuration by setting the output_int parameter to either True or False depending on the desired output format.

True (default): No multiply block is added to the last layer, and the model outputs remain in quantized integer format.
False: A multiply block is added to the last layer of the ONNX model to perform dequantization, resulting in dequantized floating-point outputs.

training:
    output_int: False # True (default)

Furthermore, you can visualize the outputs realized from models in test_vector.c PATH=data/projects/gas_sensor/run/-/CLS_1k_NPU/training/quantization/golden_vectors/test_vector.c

The test_vector.c file consists of many test cases to verify the outputs of converted model w.r.t qdq quantized model. Let's see the effect of output_int on these test_vectors. For output_int set to:

True:
- Floating Model Inference: int8_t golden_output[3] = { 3, -2, -1, } ;
- Quantized Model Inference: int8_t golden_output[3] = { 28, -14, -13, } ;
False:
- Floating Model Inference: int8_t golden_output[3] = { 3, -2, -1, } ;
- Quantized Model Inference: int8_t golden_output[3] = { 0, 0, 0, } ; # { 0.875, -0.4375, -0.40625}

Since the dequantized output values here are of range (0, 1), we aren't able to see the mantissa of float values when type-casted to int. When output_int is False, the scaling factor found from trained model is 0.03125. Therefore

    scaling_factor = 0.03125
    dequantized_output = { 0.875, -0.4375, -0.40625 }
    # quantized_output == dequantized_output / scaling_factor
    quantized_output = { 28, -14, -13, }

This will match the output from Generic Quantization and TINPU Quantization.

Performance on device

We benchmarked the performance of the CLS_1k_NPU model in both the cases. The device used is F28P55x which comes with a HW accelearator (TINPU) to give low latency performance on ML models. Numbers are provided for running the model on CPU and NPU.

Configuration	AI Model Cycles	Inference Time (us)	Flash Usage (B)	SRAM Usage (B)
CPU (output_int = true)	665234	4434.89	4143	3880
CPU (output_int = false)	665801	4438.67	4169	3880
NPU (output_int = true)	80339	535.59	3529	1680
NPU (output_int = false)	80354	535.69	3560	1682

Update history: [29th Dec 2025]: Compatible with v1.2 of Tiny ML Modelmaker [30th May 2025]: Compatible with v1.0 of Tiny ML Modelmaker

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

Gas Sensor: Differences during inference between Native PyTorch quantization and TI-provided quantization Wrappers

-Tushar Sharma, Adithya Thonse, Fasna S

Overview

Downloading dataset

Usage in Tiny ML ModelZoo

Quantization Background

Quantized/Dequantized Output

Performance on device

FilesExpand file tree

gas_sensor

Directory actions

More options

Directory actions

More options

Latest commit

History

gas_sensor

Folders and files

parent directory

readme.md

Gas Sensor: Differences during inference between Native PyTorch quantization and TI-provided quantization Wrappers

-Tushar Sharma, Adithya Thonse, Fasna S

Overview

Downloading dataset

Usage in Tiny ML ModelZoo

Quantization Background

Quantized/Dequantized Output

Performance on device