Name	Name	Last commit message	Last commit date
parent directory ..
val_tiny_imagenet	val_tiny_imagenet
README.md	README.md
config_gpu_fp32.json	config_gpu_fp32.json
imagenet.py	imagenet.py
info.yml	info.yml
requirements.txt	requirements.txt
vit-base-patch16-224.py	vit-base-patch16-224.py
vit_qnn_fp32_ctx.json	vit_qnn_fp32_ctx.json

Name

Last commit message

Last commit date

vit-base-patch16-224.py

vit_qnn_fp32_ctx.json

Vision Transformer (ViT) Optimization with PTQ on Qualcomm NPU using QNN EP

This example performs ViT optimization on Qualcomm NPU with ONNX Runtime PTQ. It performs the optimization pipeline:

PyTorch Model -> Onnx Model -> Quantized Onnx Model

It requires x86 python environment on a Windows ARM machine with onnxruntime-qnn installed.

NOTE: The model quantization part of the workflow can also be done on a Linux/Windows machine with a different onnxruntime package installed. Remove the "evaluators" and "evaluator" sections from the configuration file to skip the evaluation step.

Test with Tiny-ImageNet-200

Tiny-ImageNet-200 is a smaller subset of the ImageNet dataset containing 200 classes, commonly used for benchmarking deep learning models.

You can test output model with provided scripts. It is also a example you can refer about inference with model.

Download dataset from http://cs231n.stanford.edu/tiny-imagenet-200.zip and extract.
Go to subfolder val_tiny_imagenet. In val_tiny_imagenet.py, update path_to_tiny_imagenet with Tiny-ImageNet-200 root path and path_to_model. Modify limit as how many number you want in your test.
Run

python .\val_tiny_imagenet.py

QNN-GPU:

Please install Olive directly using:

pip install olive-ai

To run the config:

olive run --config config_gpu_fp32.json

✅ Optimized model saved in: output/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Vision Transformer (ViT) Optimization with PTQ on Qualcomm NPU using QNN EP

Test with Tiny-ImageNet-200

QNN-GPU:

FilesExpand file tree

QNN

Directory actions

More options

Directory actions

More options

Latest commit

History

QNN

Folders and files

parent directory

README.md

Vision Transformer (ViT) Optimization with PTQ on Qualcomm NPU using QNN EP

Test with Tiny-ImageNet-200

QNN-GPU: