Qualcomm® AI HubAI Hub

Profile Job Results

Jobs
jgzjmj3op
Results Ready
Name
whisper_small_en_HfWhisperDecoder
Target Device
  • Snapdragon X Elite CRD
  • Windows 11
  • Snapdragon® X Elite | SC8380XP
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
input_ids: int32[1, 1]
position_ids: int32[1]
k_cache_self_0_in: float16[12, 1, 64, 199]
v_cache_self_0_in: float16[12, 1, 199, 64]
attention_mask: float16[1, 1, 1, 200]
k_cache_cross_0: float16[12, 1, 64, 1500]
v_cache_cross_0: float16[12, 1, 1500, 64]
k_cache_self_1_in: float16[12, 1, 64, 199]
v_cache_self_1_in: float16[12, 1, 199, 64]
k_cache_cross_1: float16[12, 1, 64, 1500]
v_cache_cross_1: float16[12, 1, 1500, 64]
k_cache_self_2_in: float16[12, 1, 64, 199]
v_cache_self_2_in: float16[12, 1, 199, 64]
k_cache_cross_2: float16[12, 1, 64, 1500]
v_cache_cross_2: float16[12, 1, 1500, 64]
k_cache_self_3_in: float16[12, 1, 64, 199]
v_cache_self_3_in: float16[12, 1, 199, 64]
k_cache_cross_3: float16[12, 1, 64, 1500]
v_cache_cross_3: float16[12, 1, 1500, 64]
k_cache_self_4_in: float16[12, 1, 64, 199]
v_cache_self_4_in: float16[12, 1, 199, 64]
k_cache_cross_4: float16[12, 1, 64, 1500]
v_cache_cross_4: float16[12, 1, 1500, 64]
k_cache_self_5_in: float16[12, 1, 64, 199]
v_cache_self_5_in: float16[12, 1, 199, 64]
k_cache_cross_5: float16[12, 1, 64, 1500]
v_cache_cross_5: float16[12, 1, 1500, 64]
k_cache_self_6_in: float16[12, 1, 64, 199]
v_cache_self_6_in: float16[12, 1, 199, 64]
k_cache_cross_6: float16[12, 1, 64, 1500]
v_cache_cross_6: float16[12, 1, 1500, 64]
k_cache_self_7_in: float16[12, 1, 64, 199]
v_cache_self_7_in: float16[12, 1, 199, 64]
k_cache_cross_7: float16[12, 1, 64, 1500]
v_cache_cross_7: float16[12, 1, 1500, 64]
k_cache_self_8_in: float16[12, 1, 64, 199]
v_cache_self_8_in: float16[12, 1, 199, 64]
k_cache_cross_8: float16[12, 1, 64, 1500]
v_cache_cross_8: float16[12, 1, 1500, 64]
k_cache_self_9_in: float16[12, 1, 64, 199]
v_cache_self_9_in: float16[12, 1, 199, 64]
k_cache_cross_9: float16[12, 1, 64, 1500]
v_cache_cross_9: float16[12, 1, 1500, 64]
k_cache_self_10_in: float16[12, 1, 64, 199]
v_cache_self_10_in: float16[12, 1, 199, 64]
k_cache_cross_10: float16[12, 1, 64, 1500]
v_cache_cross_10: float16[12, 1, 1500, 64]
k_cache_self_11_in: float16[12, 1, 64, 199]
v_cache_self_11_in: float16[12, 1, 199, 64]
k_cache_cross_11: float16[12, 1, 64, 1500]
v_cache_cross_11: float16[12, 1, 1500, 64]
Completion Time
8/23/2025, 7:29:06 PM
Options
--qairt_version latest
Versions
  • QAIRT: v2.37.0.250724175447_124859
  • QNN Backend API: 5.37.0
  • QNN Core API: 2.27.0
  • Windows: Windows 11 (26100)
  • Build ID: APSS.WP_HA.1.0-08200-SC8380XRELSFNWZA-3
  • AI Hub: aihub-2025.08.21.0
Estimated Inference Time
10.7 ms
Estimated Peak Memory Usage
392 MB
Compute Units
NPU
2277
StageTimeMemory
First App Load
28.4 s1 GB
Subsequent App Load
27.0 s1 GB
Inference
10.7 ms392 MB
QNNValue
context_options.htp_options.performance_modeBURST
default_graph_options.htp_options.optimizations[0].typeFINALIZE_OPTIMIZATION_FLAG
default_graph_options.htp_options.optimizations[0].value3.0
default_graph_options.htp_options.precisionFLOAT16
default_graph_options.htp_options.vtcm_size0

Sign up to run this model on a hosted Qualcomm® device!

Run on device