[ET-VK] Optimizing buffer to int8 quantized packing op to improve width packed performance. #12388

trivedivivek · 2025-07-11T05:17:39Z

Stack from ghstack (oldest at bottom):

[ET-VK] Adding push constant and ubo verison of select and slice ops to improve memory and performance. #12358
[ET-VK] Adding get or create int function to read int value. #12357
-> [ET-VK] Optimizing buffer to int8 quantized packing op to improve width packed performance. #12388
[ET-VK] Minor performance improvements for buffer to int8 quantized packing. #12383
[ET-VK] Using push constants for unary op. #12308

This diff simplifies looping in int8 quantized packing operation for width pack tensor, to improve performance.

Differential Revision: D78143041

…th packed performance. This diff simplifies looping in int8 quantized packing operation for width pack tensor, to improve performance. Differential Revision: [D78143041](https://our.internmc.facebook.com/intern/diff/D78143041/) [ghstack-poisoned]

pytorch-bot · 2025-07-11T05:17:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12388

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (5 Unrelated Failures)

As of commit 2ac3962 with merge base 31ba959 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-eval_llama-mmlu-linux / linux-job (gh) (matched linux rule in flaky-rules.json)
The process '/usr/bin/git' failed with exit code 128

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / linux / linux-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success
pull / unittest / macos / macos-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success
pull / unittest-editable / linux / linux-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
devtools/inspector/tests/inspector_utils_test.py::TestInspectorUtils::test_equip_debug_handle_to_export_program_success

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…th packed performance. This diff simplifies looping in int8 quantized packing operation for width pack tensor, to improve performance. Differential Revision: [D78143041](https://our.internmc.facebook.com/intern/diff/D78143041/) ghstack-source-id: 295570980 Pull Request resolved: #12388

facebook-github-bot · 2025-07-11T05:17:51Z

This pull request was exported from Phabricator. Differential Revision: D78143041

…improve width packed performance." This diff simplifies looping in int8 quantized packing operation for width pack tensor, to improve performance. Differential Revision: [D78143041](https://our.internmc.facebook.com/intern/diff/D78143041/) [ghstack-poisoned]

facebook-github-bot · 2025-07-11T15:39:08Z

This pull request was exported from Phabricator. Differential Revision: D78143041

trivedivivek requested a review from SS-JIA as a code owner July 11, 2025 05:17

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 11, 2025

facebook-github-bot added the fb-exported label Jul 11, 2025

trivedivivek added the release notes: vulkan Changes to the Vulkan backend delegate label Jul 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK] Optimizing buffer to int8 quantized packing op to improve width packed performance. #12388

[ET-VK] Optimizing buffer to int8 quantized packing op to improve width packed performance. #12388

Uh oh!

trivedivivek commented Jul 11, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jul 11, 2025

Uh oh!

facebook-github-bot commented Jul 11, 2025

Uh oh!

Uh oh!

[ET-VK] Optimizing buffer to int8 quantized packing op to improve width packed performance. #12388

Are you sure you want to change the base?

[ET-VK] Optimizing buffer to int8 quantized packing op to improve width packed performance. #12388

Uh oh!

Conversation

trivedivivek commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12388

✅ You can merge normally! (5 Unrelated Failures)

Uh oh!

facebook-github-bot commented Jul 11, 2025

Uh oh!

facebook-github-bot commented Jul 11, 2025

Uh oh!

Uh oh!

trivedivivek commented Jul 11, 2025 •

edited

Loading

pytorch-bot bot commented Jul 11, 2025 •

edited

Loading