NVIDIA

NVIDIA Parakeet RNNT 1.1B

NVIDIA Parakeet RNNT 1.1B is an XXL FastConformer RNN-Transducer English ASR model jointly developed by NVIDIA NeMo and Suno.ai, offering strong accuracy and streaming-capable inference.

1.1B paramsDense

View on Hugging Face Source Code Official Page

Model Specifications

Parameters1.1B

ArchitectureDense

ProviderNVIDIA

Download Size4.3 GB

Community

Monthly Downloads818

Likes167

Last Updated4 months ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

7.1%

Overall Score

60.7BB

Benchmark40%

85.8

Popularity25%

35.3

Efficiency25%

42.2

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	1.2 GB
AMD Instinct MI300XAMD	SS	1.2 GB
AMD Instinct MI325XAMD	SS	1.2 GB
AMD Instinct MI355XAMD	SS	1.2 GB
AMD Radeon RX 7600 8GBAMD	SS	1.2 GB
AMD Radeon RX 7700 XTAMD	SS	1.2 GB
AMD Radeon RX 7800 XTAMD	SS	1.2 GB
AMD Radeon RX 7900 XTAMD	SS	1.2 GB
AMD Radeon RX 7900 XTXAMD	SS	1.2 GB
AMD Radeon RX 9070AMD	SS	1.2 GB
AMD Radeon RX 9070 XTAMD	SS	1.2 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	1.2 GB
Apple M4Apple	SS	1.2 GB
Apple M4 Max (40-core GPU)Apple	SS	1.2 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	1.2 GB
Apple M5Apple	SS	1.2 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	1.2 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	1.2 GB
Apple Mac Mini (M1, 2020)Apple	SS	1.2 GB
Apple Mac Mini (M2, 2023)Apple	SS	1.2 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	1.2 GB
Apple Mac Mini (M4, 2024)Apple	SS	1.2 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	1.2 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	1.2 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	1.2 GB

Rows per page

Page 1 of 4

About This Model

NVIDIA Parakeet RNNT 1.1B

Parakeet-RNNT-1.1B is an ASR model that transcribes speech in lower-case English alphabet. Jointly developed by NVIDIA NeMo and Suno.ai, it is an XXL version of the FastConformer Transducer (~1.1B parameters). At release in early 2024, it (along with Parakeet CTC) topped the Hugging Face Open ASR Leaderboard, surpassing Whisper.

Architecture: FastConformer encoder (an optimized Conformer with 8x depthwise-separable convolutional downsampling) with an RNN-Transducer (RNNT) decoder trained with transducer loss in a multitask setup. Supports streaming inference.

Training: Trained using the NVIDIA NeMo toolkit for several hundred epochs on a large multi-domain English corpus (LibriSpeech, Fisher, Switchboard, WSJ-0/1, Common Voice 8.0, National Singapore Corpus 1 & 6, VCTK, VoxPopuli, Europarl, Multilingual LibriSpeech, People's Speech) plus proprietary data.

Use cases: Streaming English ASR, voice assistants, call-center transcription, captioning, and as a base for fine-tuning. Accepts 16 kHz mono-channel audio (WAV) as input.

Related Models

NVIDIA

Find the best hardware for this model

Use our hardware calculator to find the optimal device for running this model.

1.1B

NVIDIA

NVIDIA Parakeet RNNT 1.1B

NVIDIA Parakeet RNNT 1.1B is an XXL FastConformer RNN-Transducer English ASR model jointly developed by NVIDIA NeMo and Suno.ai, offering strong accuracy and streaming-capable inference.

1.1B paramsDense

View on Hugging Face Source Code Official Page

Model Specifications

Parameters1.1B

ArchitectureDense

ProviderNVIDIA

Download Size4.3 GB

Community

Monthly Downloads818

Likes167

Last Updated4 months ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

7.1%

Overall Score

60.7BB

Benchmark40%

85.8

Popularity25%

35.3

Efficiency25%

42.2

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	1.2 GB
AMD Instinct MI300XAMD	SS	1.2 GB
AMD Instinct MI325XAMD	SS	1.2 GB
AMD Instinct MI355XAMD	SS	1.2 GB
AMD Radeon RX 7600 8GBAMD	SS	1.2 GB
AMD Radeon RX 7700 XTAMD	SS	1.2 GB
AMD Radeon RX 7800 XTAMD	SS	1.2 GB
AMD Radeon RX 7900 XTAMD	SS	1.2 GB
AMD Radeon RX 7900 XTXAMD	SS	1.2 GB
AMD Radeon RX 9070AMD	SS	1.2 GB
AMD Radeon RX 9070 XTAMD	SS	1.2 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	1.2 GB
Apple M4Apple	SS	1.2 GB
Apple M4 Max (40-core GPU)Apple	SS	1.2 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	1.2 GB
Apple M5Apple	SS	1.2 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	1.2 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	1.2 GB
Apple Mac Mini (M1, 2020)Apple	SS	1.2 GB
Apple Mac Mini (M2, 2023)Apple	SS	1.2 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	1.2 GB
Apple Mac Mini (M4, 2024)Apple	SS	1.2 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	1.2 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	1.2 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	1.2 GB

Rows per page

Page 1 of 4

About This Model

NVIDIA Parakeet RNNT 1.1B

Use cases: Streaming English ASR, voice assistants, call-center transcription, captioning, and as a base for fine-tuning. Accepts 16 kHz mono-channel audio (WAV) as input.