Profile Job Results

Jobs
jopr7yl9g
Results Ready
Name
whisper_small_en_WhisperDecoder
Target Device
  • Snapdragon X Elite CRD
  • Windows 11
  • Snapdragon® X Elite | SC8380XP
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
x: int32[1, 1]
index: int32[1, 1]
k_cache_cross: float32[12, 12, 64, 1500]
v_cache_cross: float32[12, 12, 1500, 64]
k_cache_self: float32[12, 12, 64, 224]
v_cache_self: float32[12, 12, 224, 64]
Completion Time
8/27/2024, 8:32:59 AM
Versions
  • QNN: v2.25.0.240728104910_97711
  • QNN Backend API: 5.25.0
  • QNN Core API: 2.18.0
  • Windows: Windows 11 (26100)
  • AI Hub: aihub-2024.08.23.0
Estimated Inference Time
10.3 ms
Estimated Peak Memory Usage
61 MB
Compute Units
NPU
2255
StageTimeMemory
First App Load
1.13 s18 MB
Subsequent App Load
1.11 s18 MB
Inference
10.3 ms61 MB
QNNValue
context_options.htp_options.performance_modeBURST
default_graph_options.htp_options.precisionFLOAT16

Sign up to run this model on a hosted Qualcomm® device!

Run on device