Qualcomm® AI HubAI Hub

Profile Job Results

Jobs
jz576wkrg
Results Ready
Name
whisper_small_en_WhisperDecoder
Target Device
Samsung Galaxy S24+ (14)
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
x: int32[1, 1]
index: int32[1, 1]
k_cache_cross: float32[12, 12, 64, 1500]
v_cache_cross: float32[12, 12, 1500, 64]
k_cache_self: float32[12, 12, 64, 224]
v_cache_self: float32[12, 12, 224, 64]
Completion Time
6/23/2024, 8:59:36 AM
Estimated Inference Time
53.3 ms
Estimated Peak Memory Usage
83 - 294 MB
Compute Units
NPU
2302
StageTimeMemory
Compilation
0 ms0 MB
First App Load
58.3 s3 GB
Subsequent App Load
1.04 min2,823-74 MB
Inference
53.3 ms83-294 MB

Sign up to run this model on a hosted Qualcomm® device!

Run on device