InternImage

InternImage Large‑Scale Vision Foundation Model.

InternImage employs DCNv3 as its core operator to equips the model with dynamic and effective receptive fields required for downstream tasks like object detection and segmentation, while enabling adaptive spatial aggregation.

Technical Details

Model checkpoint:internimage_t_1k_224
Input resolution:1x3x224x224
Number of parameters:30.6M
Model size (float):117 MB

Applicable Scenarios

  • Self driving cars

License

Model:MIT

Supported Devices

  • Dragonwing Q-6690 MTP
  • Dragonwing RB3 Gen 2 Vision Kit
  • QCS8275 (Proxy)
  • QCS8450 (Proxy)
  • QCS8550 (Proxy)
  • QCS9075 (Proxy)
  • Samsung Galaxy S21
  • Samsung Galaxy S21 Ultra
  • Samsung Galaxy S21+
  • Samsung Galaxy S22 5G
  • Samsung Galaxy S22 Ultra 5G
  • Samsung Galaxy S22+ 5G
  • Samsung Galaxy S23
  • Samsung Galaxy S23 Ultra
  • Samsung Galaxy S23+
  • Samsung Galaxy S24
  • Samsung Galaxy S24 Ultra
  • Samsung Galaxy S24+
  • Samsung Galaxy S25
  • Samsung Galaxy S25 Ultra
  • Samsung Galaxy S25+
  • Samsung Galaxy Tab S8
  • Snapdragon 7 Gen 4 QRD
  • Snapdragon 8 Elite Gen 5 QRD
  • Snapdragon X Elite CRD
  • Snapdragon X Plus 8-Core CRD
  • Xiaomi 12
  • Xiaomi 12 Pro
  • XR2 Gen 2 (Proxy)

Supported Chipsets

  • Qualcomm® QCM6690
  • Qualcomm® QCS6490
  • Qualcomm® QCS8275 (Proxy)
  • Qualcomm® QCS8550 (Proxy)
  • Qualcomm® QCS9075 (Proxy)
  • Snapdragon® 7 Gen 4 Mobile
  • Snapdragon® 8 Elite Mobile
  • Snapdragon® 8 Elite Gen 5 Mobile
  • Snapdragon® 8 Gen 1 Mobile
  • Snapdragon® 8 Gen 2 Mobile
  • Snapdragon® 8 Gen 3 Mobile
  • Snapdragon® 888 Mobile
  • Snapdragon® X Elite
  • Snapdragon® X Plus 8-Core

Looking for more? See models created by industry leaders.

Discover Model Makers