Qualcomm® AI HubAI Hub

Profile Job Results

Jobs
j5w3n9djp
Results Ready
Name
trocr_float_TrOCRDecoder
Target Device
  • XR2 Gen 2 (Proxy)
  • Android 13
  • Qualcomm® QCS8450
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
index: int32[1]
input_ids: int32[1, 1]
kv_0_attn_key: float32[1, 8, 19, 32]
kv_0_attn_val: float32[1, 8, 19, 32]
kv_0_cross_attn_key: float32[1, 8, 578, 32]
kv_0_cross_attn_val: float32[1, 8, 578, 32]
kv_1_attn_key: float32[1, 8, 19, 32]
kv_1_attn_val: float32[1, 8, 19, 32]
kv_1_cross_attn_key: float32[1, 8, 578, 32]
kv_1_cross_attn_val: float32[1, 8, 578, 32]
kv_2_attn_key: float32[1, 8, 19, 32]
kv_2_attn_val: float32[1, 8, 19, 32]
kv_2_cross_attn_key: float32[1, 8, 578, 32]
kv_2_cross_attn_val: float32[1, 8, 578, 32]
kv_3_attn_key: float32[1, 8, 19, 32]
kv_3_attn_val: float32[1, 8, 19, 32]
kv_3_cross_attn_key: float32[1, 8, 578, 32]
kv_3_cross_attn_val: float32[1, 8, 578, 32]
kv_4_attn_key: float32[1, 8, 19, 32]
kv_4_attn_val: float32[1, 8, 19, 32]
kv_4_cross_attn_key: float32[1, 8, 578, 32]
kv_4_cross_attn_val: float32[1, 8, 578, 32]
kv_5_attn_key: float32[1, 8, 19, 32]
kv_5_attn_val: float32[1, 8, 19, 32]
kv_5_cross_attn_key: float32[1, 8, 578, 32]
kv_5_cross_attn_val: float32[1, 8, 578, 32]
Completion Time
10/4/2025, 9:55:49 PM
Versions
  • QAIRT: v2.38.0.250901140452_125126
  • QNN Backend API: 5.38.0
  • QNN Core API: 2.28.1
  • Android: 13 (TP1A.220624.014)
  • AI Hub: aihub-2025.09.26.0
Estimated Inference Time
2.86 ms
Estimated Peak Memory Usage
4 ‑ 130 MB
Compute Units
NPU
382
StageTimeMemory
First App Load
4.43 s729‑738 MB
Subsequent App Load
4.34 s748‑1,014 MB
Inference
2.86 ms4‑130 MB
QNNValue
context_options.htp_options.performance_modeBURST
default_graph_options.htp_options.optimizations[0].typeFINALIZE_OPTIMIZATION_FLAG
default_graph_options.htp_options.optimizations[0].value3.0
default_graph_options.htp_options.precisionFLOAT16
default_graph_options.htp_options.vtcm_size0

Sign up to run this model on a hosted Qualcomm® device!

Run on device