We are currently working on AI model deployment on the Qualcomm QCS6490 platform (Radxa Dragon Q6A) using Qualcomm AI Runtime (QAIRT/QNN) targeting HTP/NPU execution. During deployment and validation, we identified a significant SDK compatibility issue that is preventing successful execution of Qualcomm AI Hub generated artifacts on the target hardware.
Target Platform Information
Hardware Platform: Qualcomm QCS6490 / Radxa Dragon Q6A
Target Runtime: Qualcomm AI Runtime (QAIRT/QNN) on HTP/NPU
Officially Supported SDK Version: QAIRT 2.42.0.251225
Host Environment: Ubuntu 24.04 x86_64, Python 3.10
Issue Summary
We successfully used Qualcomm AI Hub to export and compile AI models (for example, YOLOv8 and Real-ESRGAN) targeting:
qnn_context_binary
w8a8 quantization
qualcomm-qcs6490 chipset
The export, quantization, linking, profiling, and hosted inference validation on Qualcomm AI Hub completed successfully.
AI Hub profiling confirmed NPU execution on a hosted QCS6490 device:
Real-ESRGAN x4plus: npu (1096 ops), cpu (0), gpu (0)
YOLOv8 detection: npu (254 ops), cpu (0), gpu (0)
Generated artifacts were successfully downloaded, including .bin context binaries.
Primary Issue – QAIRT SDK Runtime Compatibility
The generated model metadata indicates that Qualcomm AI Hub currently compiles artifacts using:
QAIRT/QNN SDK version: 2.45.x
However, the target Radxa Dragon Q6A environment officially supports:
QAIRT SDK version: 2.42.0.251225
When attempting local deployment using qnn-net-run, execution fails with the following runtime error:
Initializing HtpProvider
Creating context from binary file: yolov8_det.bin
[ERROR] Using newer context binary on old SDK
[ERROR] Fail to get context blob with err 5000
[ERROR] Failed to create context from binary with err 0x1388
[ERROR] Could not create context from binary
Create From Binary failure
Technical Observation
This behavior suggests a runtime/compiler compatibility issue:
QNN/QAIRT serialized HTP context binaries appear to be version-sensitive.
Artifacts generated using QAIRT 2.45.x do not execute on QAIRT 2.42.0.251225 runtime.
Matching compiler/runtime versions may be required for successful deployment.