HomeAll ModelsSegment-Anything-Model

Segment-Anything-Model

High-quality segmentation mask generation around any object in an image with simple input prompt.

Transformer based encoder-decoder where prompts specify what to segment in an image thereby allowing segmentation without the need for additional training. The image encoder generates embeddings and the lightweight decoder operates on the embeddings for point and mask based image segmentation.

34.8ms
Inference Time
2-239MB
Memory Usage
342NPU
Layers

Technical Details

Model checkpoint:vit_l
Input resolution:720p (720x1280)
Number of parameters (SAMDecoder):5.11M
Model size (SAMDecoder):19.6 MB

Applicable Scenarios

  • Factory Automation
  • Robotic Navigation
  • Camera

Supported Form Factors

  • Phone
  • Tablet

Licenses

Source Model:APACHE-2.0
Deployable Model:AI Model Hub License

Tags

  • foundation
    A “foundation” model is versatile and designed for multi-task capabilities, without the need for fine-tuning.

Supported Devices

  • Google Pixel 3
  • Google Pixel 3a
  • Google Pixel 3a XL
  • Google Pixel 4
  • Google Pixel 4a
  • Google Pixel 5a 5G
  • QCS8550 (Proxy)
  • Samsung Galaxy S21
  • Samsung Galaxy S21 Ultra
  • Samsung Galaxy S21+
  • Samsung Galaxy S22 5G
  • Samsung Galaxy S22 Ultra 5G
  • Samsung Galaxy S22+ 5G
  • Samsung Galaxy S23
  • Samsung Galaxy S23 Ultra
  • Samsung Galaxy S23+
  • Samsung Galaxy S24
  • Samsung Galaxy S24 Ultra
  • Samsung Galaxy S24+
  • Samsung Galaxy Tab S8
  • Xiaomi 12
  • Xiaomi 12 Pro

Supported Chipsets

  • Qualcomm® QCS8550
  • Snapdragon® 8 Gen 1 Mobile
  • Snapdragon® 8 Gen 2 Mobile
  • Snapdragon® 8 Gen 3 Mobile
  • Snapdragon® 888 Mobile
  • Snapdragon® X Elite