Profile Job Results

    Jobs
    j1p80dnkg
    Results Ready
    Name
    whisper_small_en_WhisperDecoder
    Target Device
    QCS8550 (Proxy) (12)
    Creator
    ai-hub-support@qti.qualcomm.com
    Target Model
    Input Specs
    x: int32[1, 1]
    index: int32[1, 1]
    mask: int32[1, 224, 768]
    b0_cross_attn_k: float32[1, 1500, 768]
    b0_cross_attn_v: float32[1, 1500, 768]
    b1_cross_attn_k: float32[1, 1500, 768]
    b1_cross_attn_v: float32[1, 1500, 768]
    b2_cross_attn_k: float32[1, 1500, 768]
    b2_cross_attn_v: float32[1, 1500, 768]
    b3_cross_attn_k: float32[1, 1500, 768]
    b3_cross_attn_v: float32[1, 1500, 768]
    b4_cross_attn_k: float32[1, 1500, 768]
    b4_cross_attn_v: float32[1, 1500, 768]
    b5_cross_attn_k: float32[1, 1500, 768]
    b5_cross_attn_v: float32[1, 1500, 768]
    b6_cross_attn_k: float32[1, 1500, 768]
    b6_cross_attn_v: float32[1, 1500, 768]
    b7_cross_attn_k: float32[1, 1500, 768]
    b7_cross_attn_v: float32[1, 1500, 768]
    b8_cross_attn_k: float32[1, 1500, 768]
    b8_cross_attn_v: float32[1, 1500, 768]
    b9_cross_attn_k: float32[1, 1500, 768]
    b9_cross_attn_v: float32[1, 1500, 768]
    b10_cross_attn_k: float32[1, 1500, 768]
    b10_cross_attn_v: float32[1, 1500, 768]
    b11_cross_attn_k: float32[1, 1500, 768]
    b11_cross_attn_v: float32[1, 1500, 768]
    b0_self_attn_k: float32[1, 224, 768]
    b0_self_attn_v: float32[1, 224, 768]
    b1_self_attn_k: float32[1, 224, 768]
    b1_self_attn_v: float32[1, 224, 768]
    b2_self_attn_k: float32[1, 224, 768]
    b2_self_attn_v: float32[1, 224, 768]
    b3_self_attn_k: float32[1, 224, 768]
    b3_self_attn_v: float32[1, 224, 768]
    b4_self_attn_k: float32[1, 224, 768]
    b4_self_attn_v: float32[1, 224, 768]
    b5_self_attn_k: float32[1, 224, 768]
    b5_self_attn_v: float32[1, 224, 768]
    b6_self_attn_k: float32[1, 224, 768]
    b6_self_attn_v: float32[1, 224, 768]
    b7_self_attn_k: float32[1, 224, 768]
    b7_self_attn_v: float32[1, 224, 768]
    b8_self_attn_k: float32[1, 224, 768]
    b8_self_attn_v: float32[1, 224, 768]
    b9_self_attn_k: float32[1, 224, 768]
    b9_self_attn_v: float32[1, 224, 768]
    b10_self_attn_k: float32[1, 224, 768]
    b10_self_attn_v: float32[1, 224, 768]
    b11_self_attn_k: float32[1, 224, 768]
    b11_self_attn_v: float32[1, 224, 768]
    Completion Time
    4/16/2024, 7:47:38 PM
    Estimated Inference Time
    46.0 ms
    Estimated Peak Memory Usage
    16 - 19 MB
    Compute Units
    NPU
    879
    CPU
    2
    StageTimeMemory
    First App Load
    41.1 s4 GB
    Subsequent App Load
    1.18 s350-352 MB
    Inference
    46.0 ms16-19 MB

    Sign up to run this model on a hosted Qualcomm® device!

    Run on device