HuggingFace-WavLM-Base-Plus
Real-time Speech processing.
HuggingFaceWavLMBasePlus is a real time speech processing backbone based on Microsoft's WavLM model.
Technical Details
Model checkpoint:wavlm-libri-clean-100h-base-plus
Input resolution:1x320000
Number of parameters:95.1M
Model size:363 MB
Applicable Scenarios
- Smart Home
- Accessibility
Licenses
Source Model:MIT
Deployable Model:AI Model Hub License
Tags
- backboneA “backbone” model is designed to extract task-agnostic representations from specific data modalities (e.g., images, text, speech). This representation can then be fine-tuned for specialized tasks.
Supported Automotive Devices
- SA8255 (Proxy)
- SA8650 (Proxy)
- SA8775 (Proxy)
Supported Automotive Chipsets
- Qualcomm® SA8255P
- Qualcomm® SA8650P
- Qualcomm® SA8775P