HomeAll ModelsSegment-Anything-Model


High-quality segmentation mask generation around any object in an image with simple input prompt.

Transformer based encoder-decoder where prompts specify what to segment in an image thereby allowing segmentation without the need for additional training. The image encoder generates embeddings and the lightweight decoder operates on the embeddings for point and mask based image segmentation.

Inference Time
Memory Usage

Technical Details

Model checkpoint:vit_l
Input resolution:720p (720x1280)
Number of parameters (SAMDecoder):5.11M
Model size (SAMDecoder):19.6 MB

Applicable Scenarios

  • Factory Automation
  • Robotic Navigation
  • Camera

Supported Form Factors

  • Phone
  • Tablet


Source Model:APACHE-2.0
Deployable Model:AI Model Hub License


  • foundation
    A “foundation” model is versatile and designed for multi-task capabilities, without the need for fine-tuning.

Supported Devices

  • Google Pixel 3
  • Google Pixel 3a
  • Google Pixel 3a XL
  • Google Pixel 4
  • Google Pixel 4a
  • Google Pixel 5a 5G
  • QCS8550 (Proxy)
  • Samsung Galaxy S21
  • Samsung Galaxy S21 Ultra
  • Samsung Galaxy S21+
  • Samsung Galaxy S22 5G
  • Samsung Galaxy S22 Ultra 5G
  • Samsung Galaxy S22+ 5G
  • Samsung Galaxy S23
  • Samsung Galaxy S23 Ultra
  • Samsung Galaxy S23+
  • Samsung Galaxy S24
  • Samsung Galaxy S24 Ultra
  • Samsung Galaxy S24+
  • Samsung Galaxy Tab S8
  • Xiaomi 12
  • Xiaomi 12 Pro

Supported Chipsets

  • Qualcomm® QCS8550
  • Snapdragon® 8 Gen 1 Mobile
  • Snapdragon® 8 Gen 2 Mobile
  • Snapdragon® 8 Gen 3 Mobile
  • Snapdragon® 888 Mobile
  • Snapdragon® X Elite