HomeMobile ModelsSegment-Anything-Model

    Segment-Anything-Model

    High-quality segmentation mask generation around any object in an image with simple input prompt.

    Transformer based encoder-decoder where prompts specify what to segment in an image thereby allowing segmentation without the need for additional training. The image encoder generates embeddings and the lightweight decoder operates on the embeddings for point and mask based image segmentation.

    TorchScriptTFLite
    33.6ms
    Inference Time
    0-235MB
    Memory Usage
    340NPU
    Layers

    Technical Details

    Model checkpoint:vit_l
    Input resolution:720p (720x1280)
    Number of parameters (SAMDecoder):5.11M
    Model size (SAMDecoder):19.6 MB

    Applicable Scenarios

    • Factory Automation
    • Robotic Navigation
    • Camera

    Supported Mobile Form Factors

    • Phone
    • Tablet

    Licenses

    Source Model:APACHE-2.0
    Deployable Model:AI Model Hub License

    Tags

    • foundation
      A “foundation” model is versatile and designed for multi-task capabilities, without the need for fine-tuning.

    Supported Mobile Devices

    • Google Pixel 3
    • Google Pixel 3a
    • Google Pixel 3a XL
    • Google Pixel 4
    • Google Pixel 4a
    • Google Pixel 5a 5G
    • Samsung Galaxy S21
    • Samsung Galaxy S21 Ultra
    • Samsung Galaxy S21+
    • Samsung Galaxy S22 5G
    • Samsung Galaxy S22 Ultra 5G
    • Samsung Galaxy S22+ 5G
    • Samsung Galaxy S23
    • Samsung Galaxy S23 Ultra
    • Samsung Galaxy S23+
    • Samsung Galaxy S24
    • Samsung Galaxy S24 Ultra
    • Samsung Galaxy S24+
    • Samsung Galaxy Tab S8
    • Xiaomi 12
    • Xiaomi 12 Pro

    Supported Mobile Chipsets

    • Snapdragon® 8 Gen 1 Mobile
    • Snapdragon® 8 Gen 2 Mobile
    • Snapdragon® 8 Gen 3 Mobile
    • Snapdragon® 888 Mobile