ResNet-2Plus1D

Sports and human action recognition in videos.

ResNet (2+1)D Convolutions is a network which explicitly factorizes 3D convolution into two separate and successive operations, a 2D spatial convolution and a 1D temporal convolution. It used for video understanding applications.

Not supported

This model is currently not supported on any Mobile chipset.

To see performance metrics for this model on other chipsets, click the button below.

View for other chipsets

Technical Details

Model checkpoint:Kinetics-400
Input resolution:112x112
Number of parameters:31.5M
Model size (float):120 MB
Model size (w8a8):30.8 MB

Applicable Scenarios

  • Camera
  • Action Recognition

Supported Mobile Form Factors

  • Phone
  • Tablet

License

Tags

  • backbone

Supported Mobile Devices

  • Samsung Galaxy S21
  • Samsung Galaxy S21 Ultra
  • Samsung Galaxy S22 5G
  • Samsung Galaxy S22 Ultra 5G
  • Samsung Galaxy S22+ 5G
  • Samsung Galaxy S23
  • Samsung Galaxy S23 Ultra
  • Samsung Galaxy S23+
  • Samsung Galaxy S24
  • Samsung Galaxy S24 Ultra
  • Samsung Galaxy S24+
  • Samsung Galaxy S25
  • Samsung Galaxy S25 Ultra
  • Samsung Galaxy S25+
  • Samsung Galaxy Tab S8
  • Snapdragon 7 Gen 4 QRD
  • Snapdragon 8 Elite Gen 5 QRD
  • Xiaomi 12
  • Xiaomi 12 Pro

Supported Mobile Chipsets

  • Snapdragon® 7 Gen 4 Mobile
  • Snapdragon® 8 Elite Mobile
  • Snapdragon® 8 Elite Gen 5 Mobile
  • Snapdragon® 8 Gen 1 Mobile
  • Snapdragon® 8 Gen 2 Mobile
  • Snapdragon® 8 Gen 3 Mobile
  • Snapdragon® 888 Mobile

Related Models

See all models

Looking for more? See models created by industry leaders.

Discover Model Makers