Distil-Whisper

Distilled Whisper model for fast English automatic speech recognition (ASR).

Distil‑Whisper Small English is a distilled version of Whisper Small, optimized for fast and efficient automatic speech recognition.

Technical Details

Model checkpoint:distil-whisper/distil-small.en
Input resolution:80x3000 (30 seconds audio)
Max decoded sequence length:200 tokens
Number of parameters (encoder):166M
Model size (encoder) (float):332 MB
Number of parameters (decoder):211M
Model size (decoder) (float):450MB

Applicable Scenarios

  • Smart Home
  • Accessibility
  • Real-time Transcription

License

Model:MIT

Supported Compute Devices

  • Snapdragon X Elite CRD
  • Snapdragon X Plus 8-Core CRD
  • Snapdragon X2 Elite CRD

Supported Compute Chipsets

  • Snapdragon® X Elite
  • Snapdragon® X Plus 8-Core
  • Snapdragon® X2 Elite

Related Models

See all models

Looking for more? See models created by industry leaders.

Discover Model Makers