Profile Job Results

Jobs
jp836wzxg
Results Ready
Name
zipformer_encoder
Target Device
  • Snapdragon X2 Elite CRD
  • Windows 11
  • Snapdragon® X2 Elite | SC8480XP
Creator
ai-hub-support@qti.qualcomm.com
Target Model
Input Specs
cached_len_0: int32[2, 1]
cached_len_1: int32[4, 1]
cached_len_2: int32[3, 1]
cached_len_3: int32[2, 1]
cached_len_4: int32[4, 1]
x: float32[1, 71, 80]
cached_avg_0: float32[2, 1, 384]
cached_avg_1: float32[4, 1, 384]
cached_avg_2: float32[3, 1, 384]
cached_avg_3: float32[2, 1, 384]
cached_avg_4: float32[4, 1, 384]
cached_key_0: float32[2, 128, 1, 192]
cached_key_1: float32[4, 64, 1, 192]
cached_key_2: float32[3, 32, 1, 192]
cached_key_3: float32[2, 16, 1, 192]
cached_key_4: float32[4, 64, 1, 192]
cached_val_0: float32[2, 128, 1, 96]
cached_val_1: float32[4, 64, 1, 96]
cached_val_2: float32[3, 32, 1, 96]
cached_val_3: float32[2, 16, 1, 96]
cached_val_4: float32[4, 64, 1, 96]
cached_val2_0: float32[2, 128, 1, 96]
cached_val2_1: float32[4, 64, 1, 96]
cached_val2_2: float32[3, 32, 1, 96]
cached_val2_3: float32[2, 16, 1, 96]
cached_val2_4: float32[4, 64, 1, 96]
cached_conv1_0: float32[2, 1, 384, 30]
cached_conv1_1: float32[4, 1, 384, 30]
cached_conv1_2: float32[3, 1, 384, 30]
cached_conv1_3: float32[2, 1, 384, 30]
cached_conv1_4: float32[4, 1, 384, 30]
cached_conv2_0: float32[2, 1, 384, 30]
cached_conv2_1: float32[4, 1, 384, 30]
cached_conv2_2: float32[3, 1, 384, 30]
cached_conv2_3: float32[2, 1, 384, 30]
cached_conv2_4: float32[4, 1, 384, 30]
Completion Time
4/5/2026, 3:15:55 AM
Options
--qairt_version 2.42 --max_profiler_iterations 10
Versions
  • ONNX Runtime: 1.24.3
  • QAIRT: v2.42.0.251225135753_193295
  • Windows: Windows 11 (28000)
  • Build ID: APSS.WP_GL.1.0.c4-04500-SC8480XRELCSP4ZA-5
  • AI Hub: aihub-2026.03.29.0
Estimated Inference Time
8.35 ms
Estimated Peak Memory Usage
75 MB
Compute Units
NPU
2649
StageTimeMemory
First App Load
413 ms103 MB
Subsequent App Load
337 ms103 MB
Inference
8.35 ms75 MB
ONNX RuntimeValue
execution_modeSEQUENTIAL
intra_op_num_threads0
inter_op_num_threads0
enable_memory_patternfalse
enable_cpu_memory_arenafalse
graph_optimization_levelENABLE_ALL
QNN Execution ProviderValue
htp_performance_mode"burst"
htp_graph_finalization_optimization_mode"3"
enable_htp_fp16_precision"1"
capture_network_visualizationsfalse
context_priority"normal"
offload_graph_io_quantization"1"

Sign up to run this model on a hosted Qualcomm® device!

Run on device