NVIDIA

NVIDIA Parakeet TDT 0.6B v3

NVIDIA Parakeet TDT 0.6B v3 is a 600M-parameter multilingual ASR model supporting 25 European languages with automatic language detection, offering the highest throughput among multilingual models on the Hugging Face Open ASR leaderboard.

0.6B paramsDense

View on Hugging Face Source Code Official Page

Model Specifications

Parameters0.6B

ArchitectureDense

ProviderNVIDIA

Download Size12.5 GB

Community

Monthly Downloads398.8K

Likes807

Last Updated8 days ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

6.3%

Overall Score

85.6SS

Benchmark40%

87.4

Popularity25%

83.3

Efficiency25%

91.1

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	0.9 GB
AMD Instinct MI300XAMD	SS	0.9 GB
AMD Instinct MI325XAMD	SS	0.9 GB
AMD Instinct MI355XAMD	SS	0.9 GB
AMD Radeon RX 7600 8GBAMD	SS	0.9 GB
AMD Radeon RX 7700 XTAMD	SS	0.9 GB
AMD Radeon RX 7800 XTAMD	SS	0.9 GB
AMD Radeon RX 7900 XTAMD	SS	0.9 GB
AMD Radeon RX 7900 XTXAMD	SS	0.9 GB
AMD Radeon RX 9070AMD	SS	0.9 GB
AMD Radeon RX 9070 XTAMD	SS	0.9 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	0.9 GB
Apple M4Apple	SS	0.9 GB
Apple M4 Max (40-core GPU)Apple	SS	0.9 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	0.9 GB
Apple M5Apple	SS	0.9 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	0.9 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	0.9 GB
Apple Mac Mini (M1, 2020)Apple	SS	0.9 GB
Apple Mac Mini (M2, 2023)Apple	SS	0.9 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	0.9 GB
Apple Mac Mini (M4, 2024)Apple	SS	0.9 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	0.9 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	0.9 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	0.9 GB

Rows per page

Page 1 of 4

About This Model

NVIDIA Parakeet TDT 0.6B v3

Parakeet-tdt-0.6b-v3 is a 600-million-parameter multilingual ASR model designed for high-throughput speech-to-text transcription. It extends parakeet-tdt-0.6b-v2 by expanding language support from English to 25 European languages and automatically detects the audio language without prompting. It has the highest throughput among multilingual models on the Hugging Face leaderboard.

Architecture: FastConformer encoder with Token-and-Duration Transducer (TDT) decoder. Uses a unified SentencePiece tokenizer with a vocabulary of 8,192 tokens optimized across all 25 languages. Supports long audio transcription — up to 24 minutes with full attention (A100 80GB) or up to 3 hours with local attention.

Training: Initialized from a multilingual CTC checkpoint pretrained on the Granary dataset, then trained for 150,000 steps on 128 A100 GPUs with temperature sampling (0.5) for language balancing. Stage 2 fine-tuning for 5,000 steps on 4 A100 GPUs used ~7,500 hours of human-transcribed data from NeMo ASR Set 3.0.

Languages: English, Spanish, French, German, Italian, Portuguese, Dutch, Russian, Polish, Ukrainian, Slovak, Bulgarian, Finnish, Romanian, Croatian, Czech, Swedish, Estonian, Hungarian, Lithuanian, Danish, Maltese, Slovenian, Latvian, Greek (25 total).

Use cases: Multilingual transcription services, voice assistants, subtitle generation, conversational AI, voice analytics platforms, on-device multilingual ASR. Provides automatic punctuation, capitalization, and word-level timestamps.

Related Models

NVIDIA

Find the best hardware for this model

Use our hardware calculator to find the optimal device for running this model.

0.6B

NVIDIA

NVIDIA Parakeet TDT 0.6B v3

0.6B paramsDense

View on Hugging Face Source Code Official Page

Model Specifications

Parameters0.6B

ArchitectureDense

ProviderNVIDIA

Download Size12.5 GB

Community

Monthly Downloads398.8K

Likes807

Last Updated8 days ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

6.3%

Overall Score

85.6SS

Benchmark40%

87.4

Popularity25%

83.3

Efficiency25%

91.1

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	0.9 GB
AMD Instinct MI300XAMD	SS	0.9 GB
AMD Instinct MI325XAMD	SS	0.9 GB
AMD Instinct MI355XAMD	SS	0.9 GB
AMD Radeon RX 7600 8GBAMD	SS	0.9 GB
AMD Radeon RX 7700 XTAMD	SS	0.9 GB
AMD Radeon RX 7800 XTAMD	SS	0.9 GB
AMD Radeon RX 7900 XTAMD	SS	0.9 GB
AMD Radeon RX 7900 XTXAMD	SS	0.9 GB
AMD Radeon RX 9070AMD	SS	0.9 GB
AMD Radeon RX 9070 XTAMD	SS	0.9 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	0.9 GB
Apple M4Apple	SS	0.9 GB
Apple M4 Max (40-core GPU)Apple	SS	0.9 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	0.9 GB
Apple M5Apple	SS	0.9 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	0.9 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	0.9 GB
Apple Mac Mini (M1, 2020)Apple	SS	0.9 GB
Apple Mac Mini (M2, 2023)Apple	SS	0.9 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	0.9 GB
Apple Mac Mini (M4, 2024)Apple	SS	0.9 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	0.9 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	0.9 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	0.9 GB