NVIDIA

NVIDIA Canary 1B v2

NVIDIA Canary 1B v2 is a scaled multilingual speech recognition and translation model supporting 25 European languages with state-of-the-art accuracy and 10x faster inference than comparable models.

0.978B paramsDense

View on Hugging Face Source Code Official Page

Model Specifications

Parameters0.978B

ArchitectureDense

ProviderNVIDIA

Download Size12.7 GB

Community

Monthly Downloads167.7K

Likes377

Last Updated4 months ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

7.2%

Overall Score

71.9AA

Benchmark40%

85.7

Popularity25%

69.3

Efficiency25%

53.3

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	1.1 GB
AMD Instinct MI300XAMD	SS	1.1 GB
AMD Instinct MI325XAMD	SS	1.1 GB
AMD Instinct MI355XAMD	SS	1.1 GB
AMD Radeon RX 7600 8GBAMD	SS	1.1 GB
AMD Radeon RX 7700 XTAMD	SS	1.1 GB
AMD Radeon RX 7800 XTAMD	SS	1.1 GB
AMD Radeon RX 7900 XTAMD	SS	1.1 GB
AMD Radeon RX 7900 XTXAMD	SS	1.1 GB
AMD Radeon RX 9070AMD	SS	1.1 GB
AMD Radeon RX 9070 XTAMD	SS	1.1 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	1.1 GB
Apple M4Apple	SS	1.1 GB
Apple M4 Max (40-core GPU)Apple	SS	1.1 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	1.1 GB
Apple M5Apple	SS	1.1 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	1.1 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	1.1 GB
Apple Mac Mini (M1, 2020)Apple	SS	1.1 GB
Apple Mac Mini (M2, 2023)Apple	SS	1.1 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	1.1 GB
Apple Mac Mini (M4, 2024)Apple	SS	1.1 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	1.1 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	1.1 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	1.1 GB

Rows per page

Page 1 of 4

About This Model

NVIDIA Canary 1B v2

Canary-1b-v2 is a scaled and enhanced version of the Canary family featuring 978 million parameters, supporting 25 European languages (expanded from 4 in canary-1b/canary-1b-flash). It is the first NeMo model to leverage the full NVIDIA Granary dataset plus NeMo ASR Set 3.0, demonstrating multitask (ASR + speech-to-text translation) and multilingual capabilities. It offers quality comparable to models 3× larger while running up to 10× faster.

Architecture: Encoder-decoder with FastConformer encoder (32 layers) and Transformer decoder (8 layers), 978M parameters. Uses a unified SentencePiece tokenizer with a vocabulary of 16,384 tokens optimized across all 25 supported languages.

Training: Trained on the Granary dataset (improved pseudo-labels and filtered corpora) combined with NeMo ASR Set 3.0 human-labeled data. All transcripts include punctuation and capitalization.

Languages: Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Russian, Ukrainian.

Features: Automatic punctuation and capitalization, word and segment-level timestamps, dynamic chunking for long-form transcription, robust noise performance. Tops the Hugging Face multilingual open-ASR leaderboard at release.

Related Models

NVIDIA

Find the best hardware for this model

Use our hardware calculator to find the optimal device for running this model.

0.978B

NVIDIA

NVIDIA Canary 1B v2

NVIDIA Canary 1B v2 is a scaled multilingual speech recognition and translation model supporting 25 European languages with state-of-the-art accuracy and 10x faster inference than comparable models.

0.978B paramsDense

View on Hugging Face Source Code Official Page

Model Specifications

Parameters0.978B

ArchitectureDense

ProviderNVIDIA

Download Size12.7 GB

Community

Monthly Downloads167.7K

Likes377

Last Updated4 months ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

7.2%

Overall Score

71.9AA

Benchmark40%

85.7

Popularity25%

69.3

Efficiency25%

53.3

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	1.1 GB
AMD Instinct MI300XAMD	SS	1.1 GB
AMD Instinct MI325XAMD	SS	1.1 GB
AMD Instinct MI355XAMD	SS	1.1 GB
AMD Radeon RX 7600 8GBAMD	SS	1.1 GB
AMD Radeon RX 7700 XTAMD	SS	1.1 GB
AMD Radeon RX 7800 XTAMD	SS	1.1 GB
AMD Radeon RX 7900 XTAMD	SS	1.1 GB
AMD Radeon RX 7900 XTXAMD	SS	1.1 GB
AMD Radeon RX 9070AMD	SS	1.1 GB
AMD Radeon RX 9070 XTAMD	SS	1.1 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	1.1 GB
Apple M4Apple	SS	1.1 GB
Apple M4 Max (40-core GPU)Apple	SS	1.1 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	1.1 GB
Apple M5Apple	SS	1.1 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	1.1 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	1.1 GB
Apple Mac Mini (M1, 2020)Apple	SS	1.1 GB
Apple Mac Mini (M2, 2023)Apple	SS	1.1 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	1.1 GB
Apple Mac Mini (M4, 2024)Apple	SS	1.1 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	1.1 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	1.1 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	1.1 GB

Rows per page

Page 1 of 4

About This Model

NVIDIA Canary 1B v2

Training: Trained on the Granary dataset (improved pseudo-labels and filtered corpora) combined with NeMo ASR Set 3.0 human-labeled data. All transcripts include punctuation and capitalization.