Skip to content

[ET-VK] Optimizing buffer to int8 quantized packing op to improve width packed performance. #12388

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: gh/trivedivivek/120/base
Choose a base branch
from

Conversation

trivedivivek
Copy link
Contributor

@trivedivivek trivedivivek commented Jul 11, 2025

…th packed performance.

This diff simplifies looping in int8 quantized packing operation for width pack tensor, to improve performance.

Differential Revision: [D78143041](https://our.internmc.facebook.com/intern/diff/D78143041/)

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Jul 11, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12388

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (5 Unrelated Failures)

As of commit 2ac3962 with merge base 31ba959 (image):

FLAKY - The following job failed but was likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 11, 2025
trivedivivek added a commit that referenced this pull request Jul 11, 2025
…th packed performance.

This diff simplifies looping in int8 quantized packing operation for width pack tensor, to improve performance.

Differential Revision: [D78143041](https://our.internmc.facebook.com/intern/diff/D78143041/)

ghstack-source-id: 295570980
Pull Request resolved: #12388
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D78143041

…improve width packed performance."

This diff simplifies looping in int8 quantized packing operation for width pack tensor, to improve performance.

Differential Revision: [D78143041](https://our.internmc.facebook.com/intern/diff/D78143041/)

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D78143041

@trivedivivek trivedivivek added the release notes: vulkan Changes to the Vulkan backend delegate label Jul 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported release notes: vulkan Changes to the Vulkan backend delegate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants