MediaPipe-Pose-Estimation

Detect and track human face, hand, and torso in real‑time images and video streams.

The MediaPipe Pose Landmark Detector is a machine learning pipeline that predicts bounding boxes and pose skeletons of the face, hands, and torso in an image.

Model Repository Hugging Face Research Paper

Technical Details

Input resolution:256x256

Number of parameters (PoseDetector):815K

Model size (PoseDetector) (float):3.14 MB

Number of parameters (PoseLandmarkDetector):3.36M

Model size (PoseLandmarkDetector) (float):12.9 MB

Applicable Scenarios

Accessibility
Augmented Reality
ARVR

Supported Form Factors

Phone
Tablet
IoT

License

Model:APACHE-2.0

Supported Devices

Dragonwing IQ-9075 EVK
Dragonwing Q-6690 MTP
Dragonwing RB3 Gen 2 Vision Kit
QCS8275 (Proxy)
QCS8450 (Proxy)
QCS8550 (Proxy)
SA7255P ADP
SA8295P ADP
SA8775P ADP
Samsung Galaxy S21
Samsung Galaxy S21 Ultra
Samsung Galaxy S21+
Samsung Galaxy S22 5G
Samsung Galaxy S22 Ultra 5G
Samsung Galaxy S22+ 5G
Samsung Galaxy S23
Samsung Galaxy S23 Ultra
Samsung Galaxy S23+
Samsung Galaxy S24
Samsung Galaxy S24 Ultra
Samsung Galaxy S24+
Samsung Galaxy S25
Samsung Galaxy S25 Ultra
Samsung Galaxy S25+
Samsung Galaxy Tab S8
Snapdragon 7 Gen 4 QRD
Snapdragon 8 Elite Gen 5 QRD
Snapdragon X Elite CRD
Snapdragon X Plus 8-Core CRD
Snapdragon X2 Elite CRD
Xiaomi 12
Xiaomi 12 Pro
XR2 Gen 2 (Proxy)

Supported Chipsets

Qualcomm® QCM6690
Qualcomm® QCS6490
Qualcomm® QCS8275 (Proxy)
Qualcomm® QCS8550 (Proxy)
Qualcomm® QCS9075
Qualcomm® SA7255P
Qualcomm® SA8295P
Qualcomm® SA8775P
Snapdragon® 7 Gen 4 Mobile
Snapdragon® 8 Elite Mobile
Snapdragon® 8 Elite Gen 5 Mobile
Snapdragon® 8 Gen 1 Mobile
Snapdragon® 8 Gen 2 Mobile
Snapdragon® 8 Gen 3 Mobile
Snapdragon® 888 Mobile
Snapdragon® X Elite
Snapdragon® X Plus 8-Core
Snapdragon® X2 Elite

Related Models

See all models

MediaPipe-Hand-Detection

Real-time hand detection optimized for mobile and edge.

MediaPipe-Face-Detection

Detect faces and locate facial features in real-time video and image streams.

MediaPipe-Selfie-Segmentation

Segments the person from background in a selfie image and realtime background segmentation in video conferencing.

Looking for more? See models created by industry leaders.

Discover Model Makers

By Industry

By Model Maker

Models from Tech Mahindra now available for purchase on AI Hub

Models from G42 now available for purchase on AI Hub

Sample Apps By Use Cases

Walk through deploying an AI model on device

Read our getting started guide and learn how to use Qualcomm AI Hub

Model Makers

Collaborators

Build AI-powered vision models and integrate them seamlessly with AI Hub

Learn about the collaboration between Amazon SageMaker and AI Hub

Communication

Code

Get help, share stories, and hear announcements on our Slack channel

Visit Qualcomm's organization card on Hugging Face

Learn

Discover