Commit 331c835

authored and

committed

Integrate Automated QDQ placement tool - part 2.2 (#845)

## What does this PR do? This PR implements RegionSearch class. RegionSearch could help partition big ONNX model into small region. QDQ autouning will be performed on the regions. **Overview:** ? ## Usage  ```python # Add a code snippet demonstrating how to use this ``` ## Testing  ## Before your PR is "*Ready for review*"  - **Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/Model-Optimizer/blob/main/CONTRIBUTING.md)** and your commits are signed. - **Is this change backward compatible?**: Yes - **Did you write any new necessary tests?**: Yes - **Did you add or update any necessary documentation?**: No, document updates is in Part 4. - **Did you update [Changelog](https://github.com/NVIDIA/Model-Optimizer/blob/main/CHANGELOG.rst)?**: CHANGELOG will be updated when all changes are ready. ## Additional Information   ## Summary by CodeRabbit ## Release Notes **Refactor** * Improved ONNX quantization backend with new optimization framework and extensive test coverage to enhance internal graph processing capabilities.  --------- Signed-off-by: Will Guo <willg@nvidia.com> Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

1 parent 278c70b commit 331c835Copy full SHA for 331c835

2 files changed

+1428

-0

lines changed

modelopt/onnx/quantization/autotune
- region_search.py
tests/unit/onnx/quantization/autotune
- test_region_search.py

2 files changed

+1428

-0

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit 331c835

2 files changed

2 files changed

File tree

2 files changed

2 files changed

0 commit comments