Profile Job Results

Jobs
jgkd7j1yp
Results Ready
Name
pi05_action_expert
Target Device
  • Snapdragon X Elite CRD
  • Windows 11
  • Snapdragon® X Elite | SC8380XP
Creator
ai-hub-support@qti.qualcomm.com
Input Specs
full_att_4d: float32[1, 1, 50, 1018]
rope_emb_sin: float32[1, 50, 1, 128]
rope_emb_cos: float32[1, 50, 1, 128]
x_t: float32[1, 50, 32]
time_step: float32[1]
key_cache_l0: float32[1, 968, 1, 256]
key_cache_l1: float32[1, 968, 1, 256]
key_cache_l2: float32[1, 968, 1, 256]
key_cache_l3: float32[1, 968, 1, 256]
key_cache_l4: float32[1, 968, 1, 256]
key_cache_l5: float32[1, 968, 1, 256]
key_cache_l6: float32[1, 968, 1, 256]
key_cache_l7: float32[1, 968, 1, 256]
key_cache_l8: float32[1, 968, 1, 256]
key_cache_l9: float32[1, 968, 1, 256]
key_cache_l10: float32[1, 968, 1, 256]
key_cache_l11: float32[1, 968, 1, 256]
key_cache_l12: float32[1, 968, 1, 256]
key_cache_l13: float32[1, 968, 1, 256]
key_cache_l14: float32[1, 968, 1, 256]
key_cache_l15: float32[1, 968, 1, 256]
key_cache_l16: float32[1, 968, 1, 256]
key_cache_l17: float32[1, 968, 1, 256]
value_cache_l0: float32[1, 968, 1, 256]
value_cache_l1: float32[1, 968, 1, 256]
value_cache_l2: float32[1, 968, 1, 256]
value_cache_l3: float32[1, 968, 1, 256]
value_cache_l4: float32[1, 968, 1, 256]
value_cache_l5: float32[1, 968, 1, 256]
value_cache_l6: float32[1, 968, 1, 256]
value_cache_l7: float32[1, 968, 1, 256]
value_cache_l8: float32[1, 968, 1, 256]
value_cache_l9: float32[1, 968, 1, 256]
value_cache_l10: float32[1, 968, 1, 256]
value_cache_l11: float32[1, 968, 1, 256]
value_cache_l12: float32[1, 968, 1, 256]
value_cache_l13: float32[1, 968, 1, 256]
value_cache_l14: float32[1, 968, 1, 256]
value_cache_l15: float32[1, 968, 1, 256]
value_cache_l16: float32[1, 968, 1, 256]
value_cache_l17: float32[1, 968, 1, 256]
Completion Time
5/30/2026, 8:06:58 AM
Options
--qairt_version 2.42
Versions
  • ONNX Runtime: 1.25.0
  • QAIRT: v2.42.0.251225135753_193295
  • Windows: Windows 11 (26200)
  • Build ID: APSS.WP_HA.1.0-09300-SC8380XRELSFNWZA-12
  • AI Hub: aihub-2026.05.27.0
Estimated Inference Time
27.9 ms
Estimated Peak Memory Usage
385 MB
Compute Units
NPU
3836
StageTimeMemory
First App Load
1.00 s444 MB
Subsequent App Load
789 ms444 MB
Inference
27.9 ms385 MB
ONNX RuntimeValue
execution_modeSEQUENTIAL
intra_op_num_threads0
inter_op_num_threads0
enable_memory_patternfalse
enable_cpu_memory_arenafalse
graph_optimization_levelENABLE_ALL
QNN Execution ProviderValue
htp_performance_mode"burst"
htp_graph_finalization_optimization_mode"3"
enable_htp_fp16_precision"1"
capture_network_visualizationsfalse
context_priority"normal"
offload_graph_io_quantization"1"

Sign up to run this model on a hosted Qualcomm® device!

Run on device