Arm backend: Add function to return quant params for lowered graph #12390

wwwind · 2025-07-11T07:38:27Z

Summary:
Add function to return quant params for lowered graph and remove these Q/DQ from the graph. If they are needed, then the EdgeProgramManager should be copied before use of this function.

Change-Id: I09de39c603d68d5ac5de4614a35eb7e3fc9ba518

Signed-off-by: Elena Zhelezina [email protected]

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218

Signed-off-by: Elena Zhelezina <[email protected]> Change-Id: I09de39c603d68d5ac5de4614a35eb7e3fc9ba518

pytorch-bot · 2025-07-11T07:38:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12390

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 6 Unrelated Failures

As of commit a50b14b with merge base a0618c8 ():

NEW FAILURE - The following job has failed:

trunk / test-llama-torchao-lowbit / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 134

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / linux / linux-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success
pull / unittest / macos / macos-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success
pull / unittest-editable / linux / linux-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success
trunk / unittest-release / linux / linux-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success
trunk / unittest-release / macos / macos-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success

This comment was automatically generated by Dr. CI and updates every 15 minutes.

digantdesai

Perhaps I am missing something, can you help me understand the motivation please?

digantdesai · 2025-07-11T12:05:07Z

exir/backend/io_quant_params.py

+from executorch.exir.passes.quantize_io_pass import QuantizeInputs, QuantizeOutputs
+
+
+def extract_io_quant_params(


Perhaps move this to the quantize_io_pass.py?

digantdesai · 2025-07-11T12:05:34Z

exir/backend/io_quant_params.py

+    output_idxs: Sequence[int] = (0,),
+) -> Dict[str, Dict[str, Dict[str, Any]]]:
+    """
+    Returns quantization parameters such as scale/zero_point:


can't we get these after quantize_io_pass and then the config methods it adds?

wwwind · 2025-07-11T12:52:48Z

Thank you for the review! @digantdesai

We need this function in our workflow: we have a graphic use case when we need to move out Q/DQ nodes and get scale/zero points so we prepare input data in our plugin. Then we pass it to the subgraph. Currently, we need to call these two passes and then to extract these data from config, which is not very friendly for our users. Here there is just one function call that does this job.

Arm backend: Add function to return quant params for lowered graph

62db20b

Signed-off-by: Elena Zhelezina <[email protected]> Change-Id: I09de39c603d68d5ac5de4614a35eb7e3fc9ba518

wwwind requested review from JacobSzwejbka and larryliu0820 as code owners July 11, 2025 07:38

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 11, 2025

wwwind requested a review from oscarandersson8218 July 11, 2025 07:38

wwwind added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm ciflow/trunk release notes: arm Changes to the ARM backend delegate labels Jul 11, 2025

wwwind added 2 commits July 11, 2025 08:49

Merge branch 'main' into io_params

b2e3bd9

Merge branch 'main' into io_params

a50b14b

digantdesai reviewed Jul 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Arm backend: Add function to return quant params for lowered graph #12390

Arm backend: Add function to return quant params for lowered graph #12390

Uh oh!

wwwind commented Jul 11, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

digantdesai left a comment

Uh oh!

digantdesai Jul 11, 2025

Uh oh!

digantdesai Jul 11, 2025

Uh oh!

wwwind commented Jul 11, 2025

Uh oh!

Uh oh!

		from executorch.exir.passes.quantize_io_pass import QuantizeInputs, QuantizeOutputs


		def extract_io_quant_params(

Arm backend: Add function to return quant params for lowered graph #12390

Are you sure you want to change the base?

Arm backend: Add function to return quant params for lowered graph #12390

Uh oh!

Conversation

wwwind commented Jul 11, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12390

❌ 1 New Failure, 6 Unrelated Failures

Uh oh!

digantdesai left a comment

Choose a reason for hiding this comment

Uh oh!

digantdesai Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

digantdesai Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

wwwind commented Jul 11, 2025

Uh oh!

Uh oh!

wwwind commented Jul 11, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Jul 11, 2025 •

edited

Loading