Qualcomm® AI HubAI Hub

Profile Job Results

Jobs
jgzj3zw4p
Results Ready
Name
whisper_small_v2_HfWhisperDecoder
Target Device
  • Snapdragon X Elite CRD
  • Windows 11
  • Snapdragon® X Elite | SC8380XP
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
input_ids: int32[1, 1]
attention_mask: float16[1, 1, 1, 200]
k_cache_self_0_in: float16[12, 1, 64, 199]
v_cache_self_0_in: float16[12, 1, 199, 64]
k_cache_self_1_in: float16[12, 1, 64, 199]
v_cache_self_1_in: float16[12, 1, 199, 64]
k_cache_self_2_in: float16[12, 1, 64, 199]
v_cache_self_2_in: float16[12, 1, 199, 64]
k_cache_self_3_in: float16[12, 1, 64, 199]
v_cache_self_3_in: float16[12, 1, 199, 64]
k_cache_self_4_in: float16[12, 1, 64, 199]
v_cache_self_4_in: float16[12, 1, 199, 64]
k_cache_self_5_in: float16[12, 1, 64, 199]
v_cache_self_5_in: float16[12, 1, 199, 64]
k_cache_self_6_in: float16[12, 1, 64, 199]
v_cache_self_6_in: float16[12, 1, 199, 64]
k_cache_self_7_in: float16[12, 1, 64, 199]
v_cache_self_7_in: float16[12, 1, 199, 64]
k_cache_self_8_in: float16[12, 1, 64, 199]
v_cache_self_8_in: float16[12, 1, 199, 64]
k_cache_self_9_in: float16[12, 1, 64, 199]
v_cache_self_9_in: float16[12, 1, 199, 64]
k_cache_self_10_in: float16[12, 1, 64, 199]
v_cache_self_10_in: float16[12, 1, 199, 64]
k_cache_self_11_in: float16[12, 1, 64, 199]
v_cache_self_11_in: float16[12, 1, 199, 64]
k_cache_cross_0: float16[12, 1, 64, 1500]
v_cache_cross_0: float16[12, 1, 1500, 64]
k_cache_cross_1: float16[12, 1, 64, 1500]
v_cache_cross_1: float16[12, 1, 1500, 64]
k_cache_cross_2: float16[12, 1, 64, 1500]
v_cache_cross_2: float16[12, 1, 1500, 64]
k_cache_cross_3: float16[12, 1, 64, 1500]
v_cache_cross_3: float16[12, 1, 1500, 64]
k_cache_cross_4: float16[12, 1, 64, 1500]
v_cache_cross_4: float16[12, 1, 1500, 64]
k_cache_cross_5: float16[12, 1, 64, 1500]
v_cache_cross_5: float16[12, 1, 1500, 64]
k_cache_cross_6: float16[12, 1, 64, 1500]
v_cache_cross_6: float16[12, 1, 1500, 64]
k_cache_cross_7: float16[12, 1, 64, 1500]
v_cache_cross_7: float16[12, 1, 1500, 64]
k_cache_cross_8: float16[12, 1, 64, 1500]
v_cache_cross_8: float16[12, 1, 1500, 64]
k_cache_cross_9: float16[12, 1, 64, 1500]
v_cache_cross_9: float16[12, 1, 1500, 64]
k_cache_cross_10: float16[12, 1, 64, 1500]
v_cache_cross_10: float16[12, 1, 1500, 64]
k_cache_cross_11: float16[12, 1, 64, 1500]
v_cache_cross_11: float16[12, 1, 1500, 64]
position_ids: int32[1]
Completion Time
6/28/2025, 8:04:25 AM
Options
--qairt_version 2.33
Versions
  • ONNX Runtime: 1.21.1
  • QAIRT: v2.33.2.250410134701_117956
  • Windows: Windows 11 (26100)
  • Build ID: APSS.WP_HA.1.0.c90-07350-SC8380XPSRSFNWZA-2
  • AI Hub: aihub-2025.06.12.0
Estimated Inference Time
10.2 ms
Estimated Peak Memory Usage
286 MB
Compute Units
NPU
2277
StageTimeMemory
First App Load
981 ms366 MB
Subsequent App Load
610 ms366 MB
Inference
10.2 ms286 MB
ONNX RuntimeValue
execution_modeSEQUENTIAL
intra_op_num_threads0
inter_op_num_threads0
enable_memory_patternfalse
enable_cpu_memory_arenafalse
graph_optimization_levelENABLE_ALL
QNN Execution ProviderValue
htp_performance_mode"burst"
htp_graph_finalization_optimization_mode"3"
enable_htp_fp16_precision"1"
capture_network_visualizationsfalse

Sign up to run this model on a hosted Qualcomm® device!

Run on device