XTTS is Coqui’s open-weight multilingual voice cloning text-to-speech family. XTTS v2 is widely self-hosted for high-quality speech synthesis in dozens of languages.
See all models from CoquiModels in family
1
Open weight
1
API only
0
Avg score
64.7
Top benchmark
4.2
MOS
Total HF downloads
7.8M
Primary modality
Audio
First release
Nov 2023
Latest release
Nov 2023
Every release in the XTTS family, ranked by composite score across benchmarks, popularity, efficiency, and versatility.
| # | Model | Modality | Score | Params | Released |
|---|---|---|---|---|---|
| 1 | audio | BB64.7 | — | Nov 2023 |
When each release shipped, newest first. Useful for tracking version cadence.
Nov 11
Composite grades across this family. Higher is better, blending benchmarks, popularity, and efficiency.
Models with downloadable weights, ranked by composite score.
| # | Model | Modality | Score | Params | Released |
|---|---|---|---|---|---|
| 1 | audio | BB64.7 | — | Nov 2023 |
Spin up an instance in the cloud, or pick local hardware that fits.
Full Directory
Open the full directory to filter by hardware, capability, license, and benchmark score.
Or Browse by Provider
See every model from a lab side by side, with aggregate stats. Useful when you want a cross-family view of one provider.