HuggingFace-WavLM-Base-Plus
Real-time Speech processing.
HuggingFaceWavLMBasePlus is a real time speech processing backbone based on Microsoft's WavLM model.
Not supported
This model is currently not supported on any Compute chipset.
To see performance metrics for this model on other chipsets, click the button below.
View for other chipsetsTechnical Details
Model checkpoint:wavlm-libri-clean-100h-base-plus
Input resolution:1x320000
Number of parameters:95.1M
Model size:363 MB
Applicable Scenarios
- Smart Home
- Accessibility
Licenses
Source Model:MIT
Deployable Model:AI Model Hub License
Tags
- backboneA “backbone” model is designed to extract task-agnostic representations from specific data modalities (e.g., images, text, speech). This representation can then be fine-tuned for specialized tasks.
Supported Compute Chipsets
- Snapdragon® X Elite