Profile Job Results

Jobs
jmg9ozwwg
Results Ready
Name
whisper_base_en_WhisperDecoder
Target Device
  • Snapdragon X Elite CRD
  • Windows 11
  • Snapdragon® X Elite | SC8380XP
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
x: int32[1, 1]
index: int32[1, 1]
k_cache_cross: float32[6, 8, 64, 1500]
v_cache_cross: float32[6, 8, 1500, 64]
k_cache_self: float32[6, 8, 64, 224]
v_cache_self: float32[6, 8, 224, 64]
Completion Time
8/11/2024, 5:45:56 AM
Versions
  • ONNX Runtime: 1.18.1
  • QNN: v2.24.0.240626131148_96320
  • Windows: Windows 11 (26100)
  • AI Hub: aihub-2024.08.01.0
Estimated Inference Time
14.3 ms
Estimated Peak Memory Usage
108 MB
Compute Units
NPU
844
StageTimeMemory
First App Load
28.3 s2 GB
Subsequent App Load
2.18 s166 MB
Inference
14.3 ms108 MB
ONNX RuntimeValue
execution_modeSEQUENTIAL
intra_op_num_threads0
inter_op_num_threads0
enable_memory_patternfalse
enable_cpu_memory_arenafalse
graph_optimization_levelENABLE_ALL
QNN Execution ProviderValue
htp_performance_mode"burst"
htp_graph_finalization_optimization_mode"3"
enable_htp_fp16_precision"1"

Sign up to run this model on a hosted Qualcomm® device!

Run on device