Profile Job Results
Jobs
jp8mv0vq5
Results Ready
Name
whisper_small_quantized_w8a16_WhisperSmallEncoderQuantizable
Target Device
- QCS6490 (Proxy)
- Android 12
- Qualcomm® QCS6490
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
input_features
: uint16[1, 80, 3000]Completion Time
10/4/2025, 3:36:11 PM
Options
--max_profiler_iterations 10
Versions
- QAIRT: v2.38.0.250901140452_125126
- QNN Backend API: 5.38.0
- QNN Core API: 2.28.1
- Android: 12 (SP1A.210812.016)
- AI Hub: aihub-2025.09.26.0
Estimated Inference Time
613 ms
Estimated Peak Memory Usage
1 ‑ 13 MB
Compute Units
NPU
1918
Stage | Time | Memory |
---|---|---|
First App Load | 857 ms | 1‑12 MB |
Subsequent App Load | 664 ms | 0‑11 MB |
Inference | 613 ms | 1‑13 MB |
QNN | Value |
---|---|
context_options.htp_options.performance_mode | BURST |
default_graph_options.htp_options.optimizations[0].type | FINALIZE_OPTIMIZATION_FLAG |
default_graph_options.htp_options.optimizations[0].value | 3.0 |
default_graph_options.htp_options.precision | FLOAT16 |
default_graph_options.htp_options.vtcm_size | 0 |
Sign up to run this model on a hosted Qualcomm® device!
Run on device