[QNN EP] Fix regression for MatMul with two quantized/dynamic uint16 inputs #23419

adrianlizarraga · 2025-01-17T19:35:40Z

Description

Fixes regression for MatMul with two quantized/dynamic uint16 inputs. We need to convert input[1] to uint8 to pass QNN validation.
Separates translation of ONNX MatMul -> QNN MatMul and ONNX MatMul -> QNN FullyConnected to separate functions to make the code more readable.

Motivation and Context

The following PR updated the handling of MatMul. The logic to handle MatMul with two non-const uint16 inputs was not ported from simple_op_builder.cc to the new matmul_op_builder.cc.

#22639

…inputs

onnxruntime/core/providers/qnn/builder/opbuilder/matmul_op_builder.cc

adrianlizarraga added 3 commits January 17, 2025 11:28

[QNN EP] Fix regression for MatMul with two quantized/dynamic uint16 …

98e7ca7

…inputs

Merge branch 'main' into adrianl/matmul-dynamic-uint16-inputs-regression

b0d434f

Clean up

6cb605f

adrianlizarraga commented Jan 17, 2025

View reviewed changes

onnxruntime/core/providers/qnn/builder/opbuilder/matmul_op_builder.cc Show resolved Hide resolved

adrianlizarraga added 2 commits January 17, 2025 12:25

Remove unnecessary std::move()

b7d1da8

Handle case where input[1] needs to be reshaped and converted to uint8

b9f5db5

adrianlizarraga requested review from HectorSVC, centwang and jywu-msft and removed request for HectorSVC January 17, 2025 21:45

adrianlizarraga marked this pull request as ready for review January 17, 2025 21:46

adrianlizarraga added the ep:QNN issues related to QNN exeution provider label Jan 17, 2025

HectorSVC approved these changes Jan 17, 2025

View reviewed changes

adrianlizarraga merged commit a9bf0be into main Jan 17, 2025
97 of 98 checks passed

adrianlizarraga deleted the adrianl/matmul-dynamic-uint16-inputs-regression branch January 17, 2025 23:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN EP] Fix regression for MatMul with two quantized/dynamic uint16 inputs #23419

[QNN EP] Fix regression for MatMul with two quantized/dynamic uint16 inputs #23419

adrianlizarraga commented Jan 17, 2025

[QNN EP] Fix regression for MatMul with two quantized/dynamic uint16 inputs #23419

[QNN EP] Fix regression for MatMul with two quantized/dynamic uint16 inputs #23419

Conversation

adrianlizarraga commented Jan 17, 2025

Description

Motivation and Context