Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Shape inference: GatherBlockQuantized dispatcher #23748

Merged
merged 4 commits into from
Feb 21, 2025

Conversation

jambayk
Copy link
Contributor

@jambayk jambayk commented Feb 19, 2025

Description

Add shape infer dispatcher for GatherBlockQuantized contrib op. It reuses the dispatcher for Gather op since the first two inputs have the same specs. The output elem type comes from input 2 (scales) for GatherBlockQuantized.

Motivation and Context

Support shape inference for models with GatherBlockQuantized op.

@jambayk jambayk requested a review from tianleiwu February 19, 2025 04:52
tianleiwu
tianleiwu previously approved these changes Feb 20, 2025
@microsoft microsoft deleted a comment from azure-pipelines bot Feb 20, 2025
@jambayk
Copy link
Contributor Author

jambayk commented Feb 20, 2025

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

Copy link

Azure Pipelines successfully started running 8 pipeline(s).

@jambayk
Copy link
Contributor Author

jambayk commented Feb 20, 2025

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, Windows x64 QNN CI Pipeline, Big Models

Copy link

Azure Pipelines successfully started running 5 pipeline(s).

@jambayk
Copy link
Contributor Author

jambayk commented Feb 20, 2025

/azp run Linux Android Emulator QNN CI Pipeline, Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI

Copy link

Azure Pipelines successfully started running 5 pipeline(s).

@jambayk
Copy link
Contributor Author

jambayk commented Feb 20, 2025

/azp run Linux OpenVINO CI Pipeline

Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@jambayk jambayk merged commit 6715d4c into main Feb 21, 2025
78 checks passed
@jambayk jambayk deleted the jambayk/gatherquantized-infer branch February 21, 2025 00:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants