Profile Job Results
Jobs
jgl69968g
Results Ready
Name
controlnet_quantized_UNet_Quantized
Target Device
- Samsung Galaxy S23
- Android 13
- Snapdragon® 8 Gen 2 | SM8550
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
input_1
: uint16[1, 64, 64, 4]input_2
: uint16[1, 1280]input_3
: uint16[1, 77, 768]controlnet_downblock1
: uint16[1, 64, 64, 320]controlnet_downblock2
: uint16[1, 64, 64, 320]controlnet_downblock3
: uint16[1, 64, 64, 320]controlnet_downblock4
: uint16[1, 32, 32, 320]controlnet_downblock5
: uint16[1, 32, 32, 640]controlnet_downblock6
: uint16[1, 32, 32, 640]controlnet_downblock7
: uint16[1, 16, 16, 640]controlnet_downblock8
: uint16[1, 16, 16, 1280]controlnet_downblock9
: uint16[1, 16, 16, 1280]controlnet_downblock10
: uint16[1, 8, 8, 1280]controlnet_downblock11
: uint16[1, 8, 8, 1280]controlnet_downblock12
: uint16[1, 8, 8, 1280]controlnet_midblock
: uint16[1, 8, 8, 1280]Completion Time
6/4/2025, 6:07:38 PM
Versions
- QAIRT: v2.32.6.250402152434_116405
- QNN Backend API: 5.32.0
- QNN Core API: 2.24.0
- Android: 13 (TP1A.220624.014)
- AI Hub: aihub-2025.05.30.0
Estimated Inference Time
258 ms
Estimated Peak Memory Usage
13 ‑ 15 MB
Compute Units
NPU
5433
Stage | Time | Memory |
---|---|---|
First App Load | 2.68 s | 2‑3 MB |
Subsequent App Load | 1.50 s | 1‑3 MB |
Inference | 258 ms | 13‑15 MB |
QNN | Value |
---|---|
context_options.htp_options.performance_mode | BURST |
default_graph_options.htp_options.optimizations[0].type | FINALIZE_OPTIMIZATION_FLAG |
default_graph_options.htp_options.optimizations[0].value | 3.0 |
default_graph_options.htp_options.precision | FLOAT16 |
default_graph_options.htp_options.vtcm_size | 0 |
Sign up to run this model on a hosted Qualcomm® device!
Run on device