Profile Job Results

Jobs
jpx16nvjg
Results Ready
Name
opus_mt_en_es_float_OpusMTDecoder
Target Device
  • Snapdragon 8 Elite Gen 5 QRD
  • Android 16
  • Snapdragon® 8 Elite Gen 5 | SM8850
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
input_ids: int32[1, 1]
position: int32[1]
encoder_attention_mask: int32[1, 256]
block_0_past_self_key_states: float16[1, 8, 255, 64]
block_0_past_self_value_states: float16[1, 8, 255, 64]
block_0_cross_key_states: float16[1, 8, 256, 64]
block_0_cross_value_states: float16[1, 8, 256, 64]
block_1_past_self_key_states: float16[1, 8, 255, 64]
block_1_past_self_value_states: float16[1, 8, 255, 64]
block_1_cross_key_states: float16[1, 8, 256, 64]
block_1_cross_value_states: float16[1, 8, 256, 64]
block_2_past_self_key_states: float16[1, 8, 255, 64]
block_2_past_self_value_states: float16[1, 8, 255, 64]
block_2_cross_key_states: float16[1, 8, 256, 64]
block_2_cross_value_states: float16[1, 8, 256, 64]
block_3_past_self_key_states: float16[1, 8, 255, 64]
block_3_past_self_value_states: float16[1, 8, 255, 64]
block_3_cross_key_states: float16[1, 8, 256, 64]
block_3_cross_value_states: float16[1, 8, 256, 64]
block_4_past_self_key_states: float16[1, 8, 255, 64]
block_4_past_self_value_states: float16[1, 8, 255, 64]
block_4_cross_key_states: float16[1, 8, 256, 64]
block_4_cross_value_states: float16[1, 8, 256, 64]
block_5_past_self_key_states: float16[1, 8, 255, 64]
block_5_past_self_value_states: float16[1, 8, 255, 64]
block_5_cross_key_states: float16[1, 8, 256, 64]
block_5_cross_value_states: float16[1, 8, 256, 64]
Completion Time
1/27/2026, 5:27:09 AM
Options
--qairt_version 2.41
Versions
  • QAIRT: v2.41.0.251128145156_191518
  • QNN Backend API: 5.41.0
  • QNN Core API: 2.31.0
  • Android: 16 (BQ2A.251016.001-BP2A.250605.031.A3)
  • Build ID: Kaanapali.LA.1.0.r1-01040-STD.PROD-1
  • AI Hub: aihub-2026.01.22.0
Estimated Inference Time
1.87 ms
Estimated Peak Memory Usage
0 ‑ 243 MB
Compute Units
NPU
350
StageTimeMemory
First App Load
3.83 s690‑697 MB
Subsequent App Load
113 ms0‑244 MB
Inference
1.87 ms0‑243 MB
QNNValue
context_options.htp_options.performance_modeBURST
default_graph_options.htp_options.optimizations[0].typeFINALIZE_OPTIMIZATION_FLAG
default_graph_options.htp_options.optimizations[0].value3.0
default_graph_options.htp_options.precisionFLOAT16
default_graph_options.htp_options.vtcm_size0

Sign up to run this model on a hosted Qualcomm® device!

Run on device