InternImage
InternImage Large‑Scale Vision Foundation Model.
InternImage employs DCNv3 as its core operator to equips the model with dynamic and effective receptive fields required for downstream tasks like object detection and segmentation, while enabling adaptive spatial aggregation.
Technical Details
Model checkpoint:internimage_t_1k_224
Input resolution:1x3x224x224
Number of parameters:30.6M
Model size (float):117 MB
Applicable Scenarios
- Self driving cars
License
Model:MIT
Supported Compute Devices
- Snapdragon X Elite CRD
- Snapdragon X Plus 8-Core CRD
Supported Compute Chipsets
- Snapdragon® X Elite
- Snapdragon® X Plus 8-Core
Looking for more? See models created by industry leaders.
Discover Model Makers








