Hugging Face

Distil-Whisper Large v3.5

A knowledge-distilled, English-only version of OpenAI Whisper Large v3 from Hugging Face. Trained on 98k hours with a 'patient' teacher and SpecAugment, it runs ~1.5× faster than Whisper Large v3 Turbo while matching accuracy.

0.8B paramsDense

View on Hugging Face Source Code Official Page

Our Take

Best for: Open-source asr workloads

A solid 0.8B-parameter dense audio model from Hugging Face. Treat the modality benchmarks above as the leading indicator of fit — composite scoring across modalities is still maturing.

Generated from this model’s benchmarks and ranking signals. Editor reviews refine it over time.

Model Specifications

Parameters0.8B

ArchitectureDense

ProviderHugging Face

Download Size6.1 GB

Community

Monthly Downloads57.3K

Likes89

Last Updated29 days ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

MITView Full License

Performance & Scoring

Benchmarks

WER

7.2%

Overall Score

68.9BB

Benchmark40%

85.6

Popularity25%

48.7

Efficiency25%

62.2

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

102 devices


ACEMAGIC M1A Pro (i9-13900HK + ARC A770)ACEMAGIC	SS	1.0 GB
Acer Veriton GN100 AI MiniAcer	SS	1.0 GB
AMD Instinct MI300XAMD	SS	1.0 GB
AMD Instinct MI325XAMD	SS	1.0 GB
AMD Instinct MI355XAMD	SS	1.0 GB
AMD Radeon RX 7600 8GBAMD	SS	1.0 GB
AMD Radeon RX 7700 XTAMD	SS	1.0 GB
AMD Radeon RX 7800 XTAMD	SS	1.0 GB
AMD Radeon RX 7900 XTAMD	SS	1.0 GB
AMD Radeon RX 7900 XTXAMD	SS	1.0 GB
AMD Radeon RX 9070AMD	SS	1.0 GB
AMD Radeon RX 9070 XTAMD	SS	1.0 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	1.0 GB
Apple M4Apple	SS	1.0 GB
Apple M4 Max (40-core GPU)Apple	SS	1.0 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	1.0 GB
Apple M5Apple	SS	1.0 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	1.0 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	1.0 GB
Apple Mac Mini (M1, 2020)Apple	SS	1.0 GB
Apple Mac Mini (M2, 2023)Apple	SS	1.0 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	1.0 GB
Apple Mac Mini (M4, 2024)Apple	SS	1.0 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	1.0 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	1.0 GB

Rows per page

Page 1 of 5

About This Model

Distil-Whisper Large v3.5

Distil-Whisper is Hugging Face's knowledge-distilled version of Whisper, introduced in Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling (Gandhi, von Platen & Rush, 2023). The v3.5 release is the latest English checkpoint in the family.

Architecture

Encoder–decoder Transformer, encoder copied & frozen from Whisper Large v3
Only 2 decoder layers (vs 32 in Whisper Large v3)
Trained on 98k hours of diverse public speech (4× more than prior Distil-Whisper), pseudo-labelled by Whisper Large v3
'Patient' teacher with extended schedule + aggressive SpecAugment

What makes it distinctive

Drop-in replacement for Whisper Large v3 on English audio
~1.5× faster than Whisper-Large-v3-Turbo; ~2× faster than Whisper-Large-v3
Supports both chunked and OpenAI sequential long-form transcription algorithms
Works as a perfect speculative decoding draft model for Whisper Large v3
Compatible with Whisper.cpp, Faster-Whisper, Transformers, Transformers.js, MLX

Use cases

Production English transcription, on-device / browser ASR via ONNX, speculative decoding to accelerate Whisper, long-form podcast/meeting transcription.

Related Models

Hugging Face

Parler-TTS Large v1

2.2BDense

2.2B

Hugging Face

Parler-TTS Mini v1

0.88BDense

0.88B

Find the Best Hardware for This Model

Use our hardware calculator to find the optimal device for running this model.

0.8B

Hugging Face

Distil-Whisper Large v3.5

0.8B paramsDense

View on Hugging Face Source Code Official Page

Our Take

Best for: Open-source asr workloads

A solid 0.8B-parameter dense audio model from Hugging Face. Treat the modality benchmarks above as the leading indicator of fit — composite scoring across modalities is still maturing.

Generated from this model’s benchmarks and ranking signals. Editor reviews refine it over time.

Model Specifications

Parameters0.8B

ArchitectureDense

ProviderHugging Face

Download Size6.1 GB

Community

Monthly Downloads57.3K

Likes89

Last Updated29 days ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

MITView Full License

Performance & Scoring

Benchmarks

WER

7.2%

Overall Score

68.9BB

Benchmark40%

85.6

Popularity25%

48.7

Efficiency25%

62.2

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

102 devices


ACEMAGIC M1A Pro (i9-13900HK + ARC A770)ACEMAGIC	SS	1.0 GB
Acer Veriton GN100 AI MiniAcer	SS	1.0 GB
AMD Instinct MI300XAMD	SS	1.0 GB
AMD Instinct MI325XAMD	SS	1.0 GB
AMD Instinct MI355XAMD	SS	1.0 GB
AMD Radeon RX 7600 8GBAMD	SS	1.0 GB
AMD Radeon RX 7700 XTAMD	SS	1.0 GB
AMD Radeon RX 7800 XTAMD	SS	1.0 GB
AMD Radeon RX 7900 XTAMD	SS	1.0 GB
AMD Radeon RX 7900 XTXAMD	SS	1.0 GB
AMD Radeon RX 9070AMD	SS	1.0 GB
AMD Radeon RX 9070 XTAMD	SS	1.0 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	1.0 GB
Apple M4Apple	SS	1.0 GB
Apple M4 Max (40-core GPU)Apple	SS	1.0 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	1.0 GB
Apple M5Apple	SS	1.0 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	1.0 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	1.0 GB
Apple Mac Mini (M1, 2020)Apple	SS	1.0 GB
Apple Mac Mini (M2, 2023)Apple	SS	1.0 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	1.0 GB
Apple Mac Mini (M4, 2024)Apple	SS	1.0 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	1.0 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	1.0 GB

Rows per page

Page 1 of 5

About This Model

Distil-Whisper Large v3.5

Architecture

Encoder–decoder Transformer, encoder copied & frozen from Whisper Large v3
Only 2 decoder layers (vs 32 in Whisper Large v3)
Trained on 98k hours of diverse public speech (4× more than prior Distil-Whisper), pseudo-labelled by Whisper Large v3
'Patient' teacher with extended schedule + aggressive SpecAugment

What makes it distinctive

Drop-in replacement for Whisper Large v3 on English audio
~1.5× faster than Whisper-Large-v3-Turbo; ~2× faster than Whisper-Large-v3
Supports both chunked and OpenAI sequential long-form transcription algorithms
Works as a perfect speculative decoding draft model for Whisper Large v3
Compatible with Whisper.cpp, Faster-Whisper, Transformers, Transformers.js, MLX

Use cases

Production English transcription, on-device / browser ASR via ONNX, speculative decoding to accelerate Whisper, long-form podcast/meeting transcription.

Related Models

Hugging Face

Parler-TTS Large v1

2.2BDense

2.2B

Hugging Face

Parler-TTS Mini v1

0.88BDense

0.88B

Find the Best Hardware for This Model

Use our hardware calculator to find the optimal device for running this model.