CVT

Construct a map view from sensors mounted on a vehicle.

Cross‑View Transformer generates real‑time bird's‑eye view maps from multiple vehicle cameras for autonomous driving.

Technical Details

Model checkpoint:vehicles_50k.pt
Inference latency:RealTime
Input resolution:1x6x3x224x480
Number of parameters:1.33M
Model size (float):5.18 MB

Applicable Scenarios

  • Object Detection

Supported Form Factors

  • Phone
  • Tablet
  • IoT

License

Model:MIT

Supported Devices

  • QCS8275 (Proxy)
  • QCS8450 (Proxy)
  • QCS8550 (Proxy)
  • QCS9075 (Proxy)
  • SA7255P ADP
  • SA8255 (Proxy)
  • SA8295P ADP
  • SA8650 (Proxy)
  • SA8775P ADP
  • Samsung Galaxy S21
  • Samsung Galaxy S21 Ultra
  • Samsung Galaxy S21+
  • Samsung Galaxy S22 5G
  • Samsung Galaxy S22 Ultra 5G
  • Samsung Galaxy S22+ 5G
  • Samsung Galaxy S23
  • Samsung Galaxy S23 Ultra
  • Samsung Galaxy S23+
  • Samsung Galaxy S24
  • Samsung Galaxy S24 Ultra
  • Samsung Galaxy S24+
  • Samsung Galaxy S25
  • Samsung Galaxy S25 Ultra
  • Samsung Galaxy S25+
  • Samsung Galaxy Tab S8
  • Snapdragon 8 Elite Gen 5 QRD
  • Snapdragon X Elite CRD
  • Snapdragon X Plus 8-Core CRD
  • Xiaomi 12
  • Xiaomi 12 Pro
  • XR2 Gen 2 (Proxy)

Supported Chipsets

  • Qualcomm® QCS8275 (Proxy)
  • Qualcomm® QCS8550 (Proxy)
  • Qualcomm® QCS9075 (Proxy)
  • Qualcomm® SA7255P
  • Qualcomm® SA8255P (Proxy)
  • Qualcomm® SA8295P
  • Qualcomm® SA8650P (Proxy)
  • Qualcomm® SA8775P
  • Snapdragon® 8 Elite Mobile
  • Snapdragon® 8 Elite Gen5 Mobile
  • Snapdragon® 8 Gen 1 Mobile
  • Snapdragon® 8 Gen 2 Mobile
  • Snapdragon® 8 Gen 3 Mobile
  • Snapdragon® 888 Mobile
  • Snapdragon® X Elite
  • Snapdragon® X Plus 8-Core

Looking for more? See models created by industry leaders.

Discover Model Makers