DeepSeek is a Chinese research lab known for cost-efficient Mixture-of-Experts models. Its open-weight V3 and R1 reasoning models match or exceed many closed frontier models on coding and math.
Visit DeepSeekModels tracked
6
Open weight
6
API only
0
Avg score
64.5
Top benchmark
94.2
AIME 2026
Total HF downloads
18.7M
Context window
128K – 1.0M
First release
Dec 2024
Latest release
Apr 2026
Ranked by composite score across benchmarks, popularity, efficiency, and versatility.
| # | Model | Modality | Score | Params | Released |
|---|---|---|---|---|---|
| 1 | text | AA73.6 | 671B | Jan 2025 | |
| 2 | text | BB69.6 | 671B | Dec 2024 | |
| 3 | text | BB66.1 | 685B | Nov 2025 | |
| 4 | text | BB64.9 | 671B | Jul 2025 | |
| 5 | text | BB58.5 | 1.6T | Apr 2026 | |
| 6 | text | CC54.1 | 284B | Apr 2026 |
The strongest model from this provider in each modality. Click a card to open the model page.
How this provider’s catalog splits across text, image, video, audio, and embedding.
When each model shipped, newest first.
Apr 24
Apr 24
Nov 30
Jul 31
Jan 19
Dec 25
Composite grades across this provider’s catalog. Higher is better, blending benchmarks, popularity, and efficiency.
Models with downloadable weights, ranked by composite score.
| # | Model | Modality | Score | Params | Released |
|---|---|---|---|---|---|
| 1 | text | AA73.6 | 671B | Jan 2025 | |
| 2 | text | BB69.6 | 671B | Dec 2024 | |
| 3 | text | BB66.1 | 685B | Nov 2025 | |
| 4 | text | BB64.9 | 671B | Jul 2025 | |
| 5 | text | BB58.5 | 1.6T | Apr 2026 | |
| 6 | text | CC54.1 | 284B | Apr 2026 |
Each family is a model series. Browse all releases in a family with their own leaderboard.
Full Directory
Open the full directory to filter by hardware, capability, license, and benchmark score.
Or Browse by Family
See every release in a family side by side, with a timeline and a leaderboard. Useful when you have already picked a series and want the right size.