InternImage

InternImage Large‑Scale Vision Foundation Model.

InternImage employs DCNv3 as its core operator to equips the model with dynamic and effective receptive fields required for downstream tasks like object detection and segmentation, while enabling adaptive spatial aggregation.

Technical Details

Model checkpoint:internimage_t_1k_224
Input resolution:1x3x224x224
Number of parameters:30.6M
Model size (float):117 MB

Applicable Scenarios

  • Self driving cars

License

Model:MIT

Supported IoT Devices

  • Dragonwing IQ-9075 EVK
  • QCS8275 (Proxy)
  • QCS8550 (Proxy)

Supported IoT Chipsets

  • Qualcomm® QCS8275 (Proxy)
  • Qualcomm® QCS8550 (Proxy)
  • Qualcomm® QCS9075

Looking for more? See models created by industry leaders.

Discover Model Makers