Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QNN EP] Fix regression for MatMul with two quantized/dynamic uint16 inputs #23419

Merged
merged 5 commits into from
Jan 17, 2025

Conversation

adrianlizarraga
Copy link
Contributor

Description

  • Fixes regression for MatMul with two quantized/dynamic uint16 inputs. We need to convert input[1] to uint8 to pass QNN validation.
  • Separates translation of ONNX MatMul -> QNN MatMul and ONNX MatMul -> QNN FullyConnected to separate functions to make the code more readable.

Motivation and Context

The following PR updated the handling of MatMul. The logic to handle MatMul with two non-const uint16 inputs was not ported from simple_op_builder.cc to the new matmul_op_builder.cc.

#22639

@adrianlizarraga adrianlizarraga requested review from HectorSVC, centwang and jywu-msft and removed request for HectorSVC January 17, 2025 21:45
@adrianlizarraga adrianlizarraga marked this pull request as ready for review January 17, 2025 21:46
@adrianlizarraga adrianlizarraga added the ep:QNN issues related to QNN exeution provider label Jan 17, 2025
@adrianlizarraga adrianlizarraga merged commit a9bf0be into main Jan 17, 2025
97 of 98 checks passed
@adrianlizarraga adrianlizarraga deleted the adrianl/matmul-dynamic-uint16-inputs-regression branch January 17, 2025 23:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ep:QNN issues related to QNN exeution provider
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants