NVIDIA

NVIDIA Canary 180M Flash

NVIDIA Canary 180M Flash is a compact 182M-parameter multilingual encoder-decoder ASR and translation model supporting 4 languages with >1200 RTFx inference speed, designed for mobile and edge deployment.

0.182B paramsDense

View on Hugging Face Source Code Official Page

Model Specifications

Parameters0.182B

ArchitectureDense

ProviderNVIDIA

Download Size737 MB

Community

Monthly Downloads1.2K

Likes98

Last Updated1 years ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

6.9%

Overall Score

74.6AA

Benchmark40%

86.1

Popularity25%

34.7

Efficiency25%

97.8

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	0.6 GB
AMD Instinct MI300XAMD	SS	0.6 GB
AMD Instinct MI325XAMD	SS	0.6 GB
AMD Instinct MI355XAMD	SS	0.6 GB
AMD Radeon RX 7600 8GBAMD	SS	0.6 GB
AMD Radeon RX 7700 XTAMD	SS	0.6 GB
AMD Radeon RX 7800 XTAMD	SS	0.6 GB
AMD Radeon RX 7900 XTAMD	SS	0.6 GB
AMD Radeon RX 7900 XTXAMD	SS	0.6 GB
AMD Radeon RX 9070AMD	SS	0.6 GB
AMD Radeon RX 9070 XTAMD	SS	0.6 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	0.6 GB
Apple M4Apple	SS	0.6 GB
Apple M4 Max (40-core GPU)Apple	SS	0.6 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	0.6 GB
Apple M5Apple	SS	0.6 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	0.6 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	0.6 GB
Apple Mac Mini (M1, 2020)Apple	SS	0.6 GB
Apple Mac Mini (M2, 2023)Apple	SS	0.6 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	0.6 GB
Apple Mac Mini (M4, 2024)Apple	SS	0.6 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	0.6 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	0.6 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	0.6 GB

Rows per page

Page 1 of 4

About This Model

NVIDIA Canary 180M Flash

Canary-180M-Flash is the smallest member of the NVIDIA NeMo Canary Flash family, with 182 million parameters and inference speed of more than 1200 RTFx on open-asr-leaderboard datasets. It supports ASR in 4 languages (English, German, French, Spanish) and bidirectional translation between English and the other three languages, with optional punctuation and capitalization (PnC). It also offers experimental word-level and segment-level timestamps.

Architecture: Encoder-decoder with FastConformer encoder and Transformer decoder, based on the Canary Flash architecture. Uses a concatenated SentencePiece tokenizer.

Training: Trained using the NVIDIA NeMo framework for 219K steps with 2D bucketing and OOMptimizer on 32 NVIDIA A100 80GB GPUs.

Use cases: On-device speech recognition and translation (e.g., smartphones), real-time translation earbuds, low-latency voice assistants, and applications where privacy or offline use is required. Released under CC-BY-4.0 for commercial use.

Related Models

NVIDIA

Find the best hardware for this model

Use our hardware calculator to find the optimal device for running this model.

0.182B

NVIDIA

NVIDIA Canary 180M Flash

0.182B paramsDense

View on Hugging Face Source Code Official Page

Model Specifications

Parameters0.182B

ArchitectureDense

ProviderNVIDIA

Download Size737 MB

Community

Monthly Downloads1.2K

Likes98

Last Updated1 years ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

6.9%

Overall Score

74.6AA

Benchmark40%

86.1

Popularity25%

34.7

Efficiency25%

97.8

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	0.6 GB
AMD Instinct MI300XAMD	SS	0.6 GB
AMD Instinct MI325XAMD	SS	0.6 GB
AMD Instinct MI355XAMD	SS	0.6 GB
AMD Radeon RX 7600 8GBAMD	SS	0.6 GB
AMD Radeon RX 7700 XTAMD	SS	0.6 GB
AMD Radeon RX 7800 XTAMD	SS	0.6 GB
AMD Radeon RX 7900 XTAMD	SS	0.6 GB
AMD Radeon RX 7900 XTXAMD	SS	0.6 GB
AMD Radeon RX 9070AMD	SS	0.6 GB
AMD Radeon RX 9070 XTAMD	SS	0.6 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	0.6 GB
Apple M4Apple	SS	0.6 GB
Apple M4 Max (40-core GPU)Apple	SS	0.6 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	0.6 GB
Apple M5Apple	SS	0.6 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	0.6 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	0.6 GB
Apple Mac Mini (M1, 2020)Apple	SS	0.6 GB
Apple Mac Mini (M2, 2023)Apple	SS	0.6 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	0.6 GB
Apple Mac Mini (M4, 2024)Apple	SS	0.6 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	0.6 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	0.6 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	0.6 GB

Rows per page

Page 1 of 4

About This Model

NVIDIA Canary 180M Flash

Architecture: Encoder-decoder with FastConformer encoder and Transformer decoder, based on the Canary Flash architecture. Uses a concatenated SentencePiece tokenizer.

Training: Trained using the NVIDIA NeMo framework for 219K steps with 2D bucketing and OOMptimizer on 32 NVIDIA A100 80GB GPUs.