NVIDIA

NVIDIA Parakeet CTC 1.1B

NVIDIA Parakeet CTC 1.1B is an XXL FastConformer-CTC English ASR model jointly developed by NVIDIA NeMo and Suno.ai, offering strong non-autoregressive speech recognition accuracy with efficient inference.

1.1B paramsDense

Best for: Open-source asr workloads

A solid 1.1B-parameter dense audio model from NVIDIA. Treat the modality benchmarks above as the leading indicator of fit — composite scoring across modalities is still maturing.

Generated from this model’s benchmarks and ranking signals. Editor reviews refine it over time.

Model Specifications

Parameters1.1B

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

CC-BY-4.0View Full License

Performance & Scoring

Benchmarks

WER

7.4%

Overall Score

67.7BB

Benchmark40%

85.2

Popularity25%

66.7

Efficiency25%

40.0

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

102 devices


ACEMAGIC M1A Pro (i9-13900HK + ARC A770)ACEMAGIC	SS	1.2 GB
Acer Veriton GN100 AI MiniAcer	SS	1.2 GB
AMD Instinct MI300XAMD	SS	1.2 GB
AMD Instinct MI325XAMD	SS	1.2 GB
AMD Instinct MI355XAMD	SS	1.2 GB
AMD Radeon RX 7600 8GBAMD	SS	1.2 GB
AMD Radeon RX 7700 XTAMD	SS	1.2 GB
AMD Radeon RX 7800 XTAMD	SS	1.2 GB
AMD Radeon RX 7900 XTAMD	SS	1.2 GB
AMD Radeon RX 7900 XTXAMD	SS	1.2 GB
AMD Radeon RX 9070AMD	SS	1.2 GB
AMD Radeon RX 9070 XTAMD	SS	1.2 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	1.2 GB
Apple M4Apple	SS	1.2 GB
Apple M4 Max (40-core GPU)Apple	SS	1.2 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	1.2 GB
Apple M5Apple	SS	1.2 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	1.2 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	1.2 GB
Apple Mac Mini (M1, 2020)Apple	SS	1.2 GB
Apple Mac Mini (M2, 2023)Apple	SS	1.2 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	1.2 GB
Apple Mac Mini (M4, 2024)Apple	SS	1.2 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	1.2 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	1.2 GB

Rows per page

Page 1 of 5

About This Model

NVIDIA Parakeet CTC 1.1B

Parakeet-CTC-1.1B is an ASR model that transcribes speech in lower-case English alphabet. Jointly developed by NVIDIA NeMo and Suno.ai, it is an XXL version of the FastConformer CTC architecture (~1.1B parameters).

Architecture: FastConformer encoder (an optimized Conformer with 8x depthwise-separable convolutional downsampling) with a linear CTC decoder. Being non-autoregressive, CTC inference is very efficient. Can also be run natively via 🤗 Transformers (ParakeetForCTC).

Training: Trained using the NVIDIA NeMo toolkit on a large multi-domain English corpus including LibriSpeech, Fisher, Switchboard, WSJ, Common Voice, VCTK, VoxPopuli, Europarl, Multilingual LibriSpeech, and People's Speech, along with a large private corpus.

Use cases: Low-latency English ASR, transcription of long audio, voice interfaces, and fine-tuning base for domain-specific ASR. Accepts 16 kHz mono-channel audio (WAV) as input.

Find the Best Hardware for This Model

Use our hardware calculator to find the optimal device for running this model.

Download Size21.3 GB

Community

Monthly Downloads752.1K

Likes46

Last Updated7 months ago

NVIDIA Parakeet CTC 1.1B

Our Take

Model Specifications

Quick Start

Download from Hugging Face

License

Performance & Scoring

Benchmarks

Overall Score

Hardware Compatibility

About This Model

NVIDIA Parakeet CTC 1.1B

Related Models

NVIDIA Canary-Qwen 2.5B

NVIDIA Parakeet RNNT 1.1B

NVIDIA Parakeet TDT 1.1B

Find the Best Hardware for This Model

Community

NVIDIA Canary 1B