Qualcomm AI Engine Direct - Support 2-bits quantization 16a2w by jethroqti · Pull Request #19632 · pytorch/executorch

jethroqti · 2026-05-18T11:26:36Z

Qualcomm AI Engine Direct - Support 2-bits quantization 16a2w

Summary:
1.Add 2-bits quantization basis 16a2w quantizer with standard symmetric
2.Support per channel and linear layers
3.Currently support soc model SM8850

Test plan:
python backends/qualcomm/tests/test_qnn_delegate.py TestQNNQuantizedOperator.test_qnn_backend_16a2w_conv2d -b build-android -H ${HOST} -s ${SN} -m SM8850 python backends/qualcomm/tests/test_qnn_delegate.py TestQNNQuantizedOperator.test_qnn_backend_16a2w_linear -b build-android -H ${HOST} -s ${SN} -m SM8850

Summary: 1.Add 2-bits quantization basis 16a2w quantizer with standard symmetric 2.Support per channel and linear layers 3.Currently support soc model SM8850 Test plan: python backends/qualcomm/tests/test_qnn_delegate.py TestQNNQuantizedOperator.test_qnn_backend_16a2w_conv2d -b build-android -H ${HOST} -s ${SN} -m SM8850 python backends/qualcomm/tests/test_qnn_delegate.py TestQNNQuantizedOperator.test_qnn_backend_16a2w_linear -b build-android -H ${HOST} -s ${SN} -m SM8850

pytorch-bot · 2026-05-18T11:26:41Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19632

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

⚠️ 11 Awaiting Approval

As of commit 0223c46 with merge base 824cbff ():

AWAITING APPROVAL - The following workflows need approval before CI can run:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jethroqti · 2026-05-18T11:27:22Z

@pytorchbot label "release notes: qualcomm"

jethroqti · 2026-05-18T11:31:33Z

This PR is used to support 2-bits quantization basis 16a2w. Please take a look. Thanks.
@psiddh @haowhsu-quic @shewu-quic @winskuo-quic @DannyYuyang-quic

jethroqti requested review from abhinaykukkadapu and psiddh as code owners May 18, 2026 11:26

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 18, 2026

pytorch-bot Bot added the release notes: qualcomm Changes to the Qualcomm backend delegate label May 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qualcomm AI Engine Direct - Support 2-bits quantization 16a2w#19632

Qualcomm AI Engine Direct - Support 2-bits quantization 16a2w#19632
jethroqti wants to merge 1 commit into
pytorch:mainfrom
CodeLinaro:dev1/quant/pcq2bit

jethroqti commented May 18, 2026

Uh oh!

pytorch-bot Bot commented May 18, 2026 •

edited

Loading

Uh oh!

jethroqti commented May 18, 2026

Uh oh!

jethroqti commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jethroqti commented May 18, 2026

Uh oh!

pytorch-bot Bot commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19632

❗ 2 Active SEVs

⚠️ 11 Awaiting Approval

Uh oh!

jethroqti commented May 18, 2026

Uh oh!

jethroqti commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pytorch-bot Bot commented May 18, 2026 •

edited

Loading