Profile Job Results

Jobs
jp34k8zxg
Results Ready
Name
whisper_large_v3_turbo_encoder
Target Device
  • QCS8450 (Proxy)
  • Android 13
  • Qualcomm® QCS8450
Creator
ai-hub-support@qti.qualcomm.com
Input Specs
input_features: float16[1, 128, 3000]
Completion Time
4/25/2026, 10:06:35 AM
Options
--max_profiler_iterations 10
Versions
  • QAIRT: v2.45.0.260326154327
  • QNN Backend API: 5.45.0
  • QNN Core API: 2.34.0
  • Android: 13 (TP1A.220624.014)
  • AI Hub: aihub-2026.04.13.0
Estimated Inference Time
1.31 s
Estimated Peak Memory Usage
1 ‑ 11 MB
Compute Units
NPU
5026
StageTimeMemory
First App Load
3.00 s31‑41 MB
Subsequent App Load
1.12 s1‑11 MB
Inference
1.31 s1‑11 MB
QNNValue
context_options.htp_options.performance_modeBURST
default_graph_options.htp_options.optimizations[0].typeFINALIZE_OPTIMIZATION_FLAG
default_graph_options.htp_options.optimizations[0].value3.0
default_graph_options.htp_options.precisionFLOAT16
default_graph_options.htp_options.vtcm_size0

Sign up to run this model on a hosted Qualcomm® device!

Run on device