Model Card for Lite-Whisper large-v3-fast

Lite-Whisper is a compressed version of OpenAI Whisper with LiteASR. See our GitHub repository and paper for details. The paper is also available on Hugging Face: Link to Hugging Face Paper Page

Benchmark Results

Following is the average word error rate (WER) evaluated on the ESB datasets:\

Model Average WER (↓) Encoder Size Decoder Size
whisper-large-v3 10.1 635M 907M
lite-whisper-large-v3-acc 10.1 429M 907M
lite-whisper-large-v3 10.2 377M 907M
lite-whisper-large-v3-fast 11.3 308M 907M
       
whisper-large-v3-turbo 10.1 635M 172M
lite-whisper-large-v3-turbo-acc 10.2 421M 172M
lite-whisper-large-v3-turbo 12.6 374M 172M
lite-whisper-large-v3-turbo-fast 20.1 313M 172M
       
whisper-medium 14.8 306M 457M
Downloads last month
42
Safetensors
Model size
1.28B params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.

Model tree for efficient-speech/lite-whisper-large-v3-fast

Finetuned
(417)
this model
Quantizations
1 model

Collection including efficient-speech/lite-whisper-large-v3-fast