Introduce Preprocessing for Optimized Quantization in `quantize-ort.py` #238

Tim-Siu · 2024-02-26T15:50:18Z

Issues

Resolve #239

Running the quantization script quantize-ort.py cannot reproduce the quantized model in the repo. The current script will produce int8 quantized ppresnet50 at a size of over 120 MB, which is significantly different from the existing quantized models in the repo at the size of ~26 MB. After some investigation, I think the reason might be that preprocessing is missing. The ONNX documentation seems to suggest preprocessing is highly encouraged.

Left: Computation graph of already quantized models in the repo or models quantized by the updated script.
Right: Computation graph of the model quantized by the original script.

We can see that the current script will result in a model with an unoptimized computation graph and redundant computation nodes.

Key Changes

A preprocessing step is added in quantize-ort.py. Optimization is automatically carried out by the quant_pre_process method.

Expected Benefits

Running the updated script should allow us to reproduce the quantized models already in the repo.

Tim-Siu · 2024-02-26T16:06:46Z

By the way, I am very interested in working on the "Quantized models for OpenCV Model Zoo" project as part of Google Summer of Code. I am looking forward to any opportunities to contribute to opencv_zoo :)

fengyuentau

Thank you for the contribution!

fengyuentau · 2024-02-27T06:22:19Z

By the way, I am very interested in working on the "Quantized models for OpenCV Model Zoo" project as part of Google Summer of Code. I am looking forward to any opportunities to contribute to opencv_zoo :)

We will see whether we have such project in this year's GSoC. In the meantime, you are welcome to apply for other projects.

fengyuentau · 2024-02-27T06:24:30Z

Also attaching this page for reference if cli is preferred: https://github.com/microsoft/onnxruntime-inference-examples/blob/main/quantization/image_classification/cpu/ReadMe.md

Tim-Siu added 2 commits February 26, 2024 23:22

Add preprocessing with optimization before quantization

e11a205

Clean up unintended changed

212a53d

fengyuentau self-requested a review February 27, 2024 06:14

fengyuentau self-assigned this Feb 27, 2024

fengyuentau added the quantization Anything related to model quantization label Feb 27, 2024

fengyuentau added this to the 4.10.0 milestone Feb 27, 2024

fengyuentau approved these changes Feb 27, 2024

View reviewed changes

fengyuentau merged commit bdd60c2 into opencv:main Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce Preprocessing for Optimized Quantization in `quantize-ort.py` #238

Introduce Preprocessing for Optimized Quantization in `quantize-ort.py` #238

Uh oh!

Tim-Siu commented Feb 26, 2024 •

edited

Loading

Uh oh!

Tim-Siu commented Feb 26, 2024

Uh oh!

fengyuentau left a comment

Uh oh!

fengyuentau commented Feb 27, 2024

Uh oh!

fengyuentau commented Feb 27, 2024

Uh oh!

Uh oh!

Introduce Preprocessing for Optimized Quantization in quantize-ort.py #238

Introduce Preprocessing for Optimized Quantization in quantize-ort.py #238

Uh oh!

Conversation

Tim-Siu commented Feb 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issues

Uh oh!

Tim-Siu commented Feb 26, 2024

Uh oh!

fengyuentau left a comment

Choose a reason for hiding this comment

Uh oh!

fengyuentau commented Feb 27, 2024

Uh oh!

fengyuentau commented Feb 27, 2024

Uh oh!

Uh oh!

Introduce Preprocessing for Optimized Quantization in `quantize-ort.py` #238

Introduce Preprocessing for Optimized Quantization in `quantize-ort.py` #238

Tim-Siu commented Feb 26, 2024 •

edited

Loading