Fix CPU QLinearConv: support per-channel weight zero points with distinct values by Copilot · Pull Request #28456 · microsoft/onnxruntime

Copilot · 2026-05-11T17:24:08Z

Description

The CPU QLinearConv kernel incorrectly rejected per-channel weight zero point tensors whose values were not all identical, even though the ONNX spec allows this for asymmetric per-channel quantization.

Kernel (qlinearconv.cc):

Removed the ORT_ENFORCE in ComputeOffset that required all per-channel W zero points to be equal
Extracted W zero point reading from ComputeOffset into Compute() directly, exposing the full per-channel array
Added W_zero_point_is_per_channel / W_zero_point_is_uniform flags
GEMM path: sets PerColumnZeroPoints = true and passes W_zero_point_data + group_id * group_output_channels when ZPs differ — MLAS already supported this
Depthwise path: requires uniform W zero points (since MlasConvDepthwise takes a scalar FilterZeroPoint); non-uniform per-channel ZPs automatically fall back to the group-GEMM path instead

Tests (qlinearconv_op_test.cc):

Added zero_points_ vector field to QuantizedTensor and SetWeightZeroPoints() method to QLinearConvOpTester
Updated ComputeExpectedOutput and Run() to emit a per-channel ZP tensor when set
Added three new test cases covering uint8 activations, int8 activations, and grouped convolution with per-channel W zero points

Motivation and Context

CPUExecutionProvider threw QLinearConv : zero point of per-channel filter must be same at runtime for any model using asymmetric per-channel weight quantization (distinct zero points per output channel), despite w_scale and w_zp both being valid 1-D [Cout] tensors per the ONNX spec. This made a common quantization pattern completely unusable on CPU.

w_zp = np.array([5, 90], dtype=np.uint8)  # different per-channel ZPs → was rejected

…inct values Agent-Logs-Url: https://github.com/microsoft/onnxruntime/sessions/b3963943-a627-4094-b987-a75ad1b49f62 Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

github-actions

You can commit the suggested changes from lintrunner.

tianleiwu

Kernel-side routing looks correct overall; the remaining gap is regression coverage around the new depthwise fallback.

Initial plan

c025244

Copilot AI assigned Copilot and tianleiwu May 11, 2026

Copilot started work on behalf of tianleiwu May 11, 2026 17:24 View session

Copilot AI linked an issue May 11, 2026 that may be closed by this pull request

CPU QLinearConv rejects per-channel weight zero points with different values #28447

Open

Fix CPU QLinearConv: support per-channel weight zero points with dist…

9d1021c

…inct values Agent-Logs-Url: https://github.com/microsoft/onnxruntime/sessions/b3963943-a627-4094-b987-a75ad1b49f62 Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix CPU QLinearConv for per-channel weight zero points~~ Fix CPU QLinearConv: support per-channel weight zero points with distinct values May 11, 2026

Copilot finished work on behalf of tianleiwu May 11, 2026 18:19

Copilot AI requested a review from tianleiwu May 11, 2026 18:19

github-actions Bot reviewed May 12, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/cpu/quantization/qlinearconv.cc Outdated

tianleiwu reviewed May 12, 2026

View reviewed changes

Comment thread onnxruntime/test/providers/cpu/nn/qlinearconv_op_test.cc

address feedback

748a937

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix CPU QLinearConv: support per-channel weight zero points with distinct values#28456

Fix CPU QLinearConv: support per-channel weight zero points with distinct values#28456
Copilot wants to merge 3 commits into
mainfrom
copilot/fix-qlinearconv-per-channel-zero-points

Copilot AI commented May 11, 2026 •

edited

Loading

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

tianleiwu left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tianleiwu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented May 11, 2026 •

edited

Loading