onnx-quantize

ONNX Quantization Framework built on top of

⚠️ This project is under active development.

📦 Installation

Install directly from PyPI:

pip install onnx-quantize

🚀 Quick Start

Here’s a minimal example to quantize an ONNX model:

from onnx_quantize import QConfig, QuantType, quantize 
import onnx

# Load your model
model = onnx.load("your_model.onnx")

# Define quantization configuration
qconfig = QConfig(
    is_static=False,
    activations_dtype=QuantType.QInt8,
    activations_symmetric=False,
    weights_dtype=QuantType.QInt8,
    weights_symmetric=True,
    weights_per_channel=False,
)

# Quantize the model
qmodel = quantize(model, qconfig)

# Save the quantized model
onnx.save(qmodel, "qmodel.onnx")

🧩 Features (planned)

The goal is to have all of what Neural compressor have but using ONNXScript and ONNX IR.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
.vscode		.vscode
src/onnx_quantize		src/onnx_quantize
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

onnx-quantize

📦 Installation

🚀 Quick Start

About

Uh oh!

Releases 1

Packages

Languages

License

AyoubMDL/onnx_quantize

Folders and files

Latest commit

History

Repository files navigation

onnx-quantize

📦 Installation

🚀 Quick Start

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages