Profile Job Results

Jobs
jpx11n4lg
Results Ready
Name
trocr_float_TrOCRDecoder
Target Device
  • Dragonwing IQ-9075 EVK
  • Qualcomm Linux 1.5
  • Qualcomm® Dragonwing™ IQ-9075 | QCS9075
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
input_ids: int32[1, 1]
index: int32[1]
kv_0_attn_key: float32[1, 8, 19, 32]
kv_0_attn_val: float32[1, 8, 19, 32]
kv_0_cross_attn_key: float32[1, 8, 578, 32]
kv_0_cross_attn_val: float32[1, 8, 578, 32]
kv_1_attn_key: float32[1, 8, 19, 32]
kv_1_attn_val: float32[1, 8, 19, 32]
kv_1_cross_attn_key: float32[1, 8, 578, 32]
kv_1_cross_attn_val: float32[1, 8, 578, 32]
kv_2_attn_key: float32[1, 8, 19, 32]
kv_2_attn_val: float32[1, 8, 19, 32]
kv_2_cross_attn_key: float32[1, 8, 578, 32]
kv_2_cross_attn_val: float32[1, 8, 578, 32]
kv_3_attn_key: float32[1, 8, 19, 32]
kv_3_attn_val: float32[1, 8, 19, 32]
kv_3_cross_attn_key: float32[1, 8, 578, 32]
kv_3_cross_attn_val: float32[1, 8, 578, 32]
kv_4_attn_key: float32[1, 8, 19, 32]
kv_4_attn_val: float32[1, 8, 19, 32]
kv_4_cross_attn_key: float32[1, 8, 578, 32]
kv_4_cross_attn_val: float32[1, 8, 578, 32]
kv_5_attn_key: float32[1, 8, 19, 32]
kv_5_attn_val: float32[1, 8, 19, 32]
kv_5_cross_attn_key: float32[1, 8, 578, 32]
kv_5_cross_attn_val: float32[1, 8, 578, 32]
Completion Time
1/31/2026, 7:34:26 PM
Versions
  • TensorFlow Lite: 2.17.0
  • QAIRT: v2.42.0.251225135753_193295
  • QNN TfLite Delegate: v2.42.0.251225135753_193295
  • Qualcomm Linux: 1.5-ver.1.1
  • AI Hub: aihub-2026.01.22.0
Estimated Inference Time
2.55 ms
Estimated Peak Memory Usage
0 ‑ 83 MB
Compute Units
NPU
399
StageTimeMemory
First App Load
5.76 s703‑704 MB
Subsequent App Load
209 ms76 MB
Inference
2.55 ms0‑83 MB
TensorFlow LiteValue
number_of_threads4
QNN DelegateValue
backend_typekHtpBackend
log_levelkLogLevelWarn
htp_options.performance_modekHtpBurst
htp_options.precisionkHtpFp16
htp_options.optimization_strategykHtpOptimizeForInference
htp_options.useConvHmxtrue

Sign up to run this model on a hosted Qualcomm® device!

Run on device