Video-MAE
Sports and human action recognition in videos.
Video MAE (Masked Auto Encoder) is a network for doing video classification that uses the ViT (Vision Transformer) backbone.
Not supported
This model is currently not supported on any Automotive chipset.
To see performance metrics for this model on other chipsets, click the button below.
View for other chipsetsTechnical Details
Model checkpoint:Kinectics-400
Input resolution:224x224
Number of parameters:87.7M
Model size (float):335 MB
Applicable Scenarios
- Camera
- Action Recognition
License
Model:CC-BY-4.0
Tags
- backbone
Supported Automotive Devices
- SA7255P ADP
- SA8255P ADP
- SA8295P ADP
- SA8650P ADP
- SA8775P ADP
Supported Automotive Chipsets
- Qualcomm® SA7255P
- Qualcomm® SA8295P
- Qualcomm® SA8775P
Related Models
See all modelsLooking for more? See models created by industry leaders.
Discover Model Makers









