HomeMobile ModelsMediaPipe-Pose-Estimation


Detect and track human body poses in real-time images and video streams.

The MediaPipe Pose Landmark Detector is a machine learning pipeline that predicts bounding boxes and pose skeletons of poses in an image.

Inference Time
Memory Usage

Technical Details

Input resolution:256x256
Number of parameters (MediaPipePoseDetector):815K
Model size (MediaPipePoseDetector):3.14 MB
Number of parameters (MediaPipePoseLandmarkDetector):3.37M
Model size (MediaPipePoseLandmarkDetector):12.9 MB

Applicable Scenarios

  • Accessibility
  • Augmented Reality
  • ARVR

Supported Mobile Form Factors

  • Phone
  • Tablet


Source Model:APACHE-2.0
Deployable Model:AI Model Hub License


  • real-time
    A “real-time” model can typically achieve 5-60 predictions per second. This translates to latency ranging up to 200 ms per prediction.

Supported Mobile Devices

  • Google Pixel 3
  • Google Pixel 3a
  • Google Pixel 3a XL
  • Google Pixel 4
  • Google Pixel 4a
  • Google Pixel 5a 5G
  • Samsung Galaxy S21
  • Samsung Galaxy S21 Ultra
  • Samsung Galaxy S21+
  • Samsung Galaxy S22 5G
  • Samsung Galaxy S22 Ultra 5G
  • Samsung Galaxy S22+ 5G
  • Samsung Galaxy S23
  • Samsung Galaxy S23 Ultra
  • Samsung Galaxy S23+
  • Samsung Galaxy S24
  • Samsung Galaxy S24 Ultra
  • Samsung Galaxy S24+
  • Samsung Galaxy Tab S8
  • Xiaomi 12
  • Xiaomi 12 Pro

Supported Mobile Chipsets

  • Snapdragon® 8 Gen 1 Mobile
  • Snapdragon® 8 Gen 2 Mobile
  • Snapdragon® 8 Gen 3 Mobile
  • Snapdragon® 888 Mobile