MiniMax

MiniMax M3

MiniMax's open-weight flagship, a Mixture-of-Experts model with roughly 428B total parameters and about 23B active per token. It is natively multimodal, accepting text, image, and video input, and supports a 1M-token context window. The model is built on MiniMax Sparse Attention (MSA), which the team reports gives more than 9x faster prefill and more than 15x faster decoding at 1M context versus M2. On agentic and coding benchmarks it scores 59.0% on SWE-Bench Pro, 66.0% on Terminal-Bench 2.1, 74.2% on MCP Atlas, 34.8% on SWE-fficiency, and 28.8% on KernelBench Hard.

428B paramsMoE1000K ctxMultimodal

View on Hugging Face Source Code Official Page

Our Take

Best for: Strongest at graduate-level reasoning (GPQA) in its size class

A workable 428B-parameter MoE language model from MiniMax. Pulls ahead on graduate-level reasoning (GPQA) (93/100), so reach for it when that's the dimension that matters. Newly released, so production-readiness is still being shaken out.

Run this onAMD Instinct MI325XCheapest card in our directory with comfortable headroom (256 GB) for this model at Q4 (~197.8 GB).

Generated from this model’s benchmarks and ranking signals. Editor reviews refine it over time.

Capabilities

Chat

Code Generation

Vision

Reasoning

Function Calling

Multilingual

Instruction Following

Model Specifications

Parameters428B

Active Params23B

ArchitectureMoE

Context Length1M tokens

ModalityMultimodal

ProviderMiniMax

Download Size1.7 TB

Community

Monthly Downloads154.3K

Likes1.2K

Last Updated3 days ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

minimax-communityView Full License

Performance & Scoring

Benchmarks

92.9

80.5

37.1

59.0

AA Intelligence Index

44.4

45.4

42.4

82.9

74.0

MBA Open Score

52.6CC

Benchmark40%

62.1

Popularity25%

44.6

Efficiency20%

15.5

Versatility15%

90.0

Quantization Options

See how different quantization levels affect VRAM requirements and quality for this model.

Format	VRAM Required	Quality
Q2_K	193.0 GB	Low	Aggressive quantization — smallest size, noticeable quality loss
Q4_K_MRecommended	197.8 GB	Good	Best balance of size and quality for most use-cases
Q5_K_M	200.1 GB	Very Good	Slightly better quality than Q4 with moderate size increase
Q6_K	202.9 GB	Excellent	Near-lossless quality with manageable size
Q8_0	208.7 GB	Near Perfect	Virtually indistinguishable from full precision
FP16	230.5 GB	Full	Full 16-bit floating point — maximum quality, largest size

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

102 devices


AMD Instinct MI355XAMD	SS	32.6 tok/s	197.8 GB
AMD Instinct MI325XAMD	AA	24.4 tok/s	197.8 GB
ASUS ExpertCenter Pro ET900N G3ASUS	AA	28.9 tok/s	197.8 GB
Dell Pro Max with GB300Dell	AA	28.9 tok/s	197.8 GB
HP ZGX Fury AI StationHP	AA	28.9 tok/s	197.8 GB
MSI XpertStation WS300MSI	AA	28.9 tok/s	197.8 GB
SuperMicro Super AI StationSuperMicro	AA	28.9 tok/s	197.8 GB
Gigabyte W775-V10-L01Gigabyte	AA	28.9 tok/s	197.8 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	BB	3.3 tok/s	197.8 GB
Apple Mac Studio (M3 Ultra, 2025)Apple	BB	3.3 tok/s	197.8 GB
NVIDIA B200 GPUNVIDIA	BB	32.6 tok/s	197.8 GB
Google TPU v7 (Ironwood)Google	BB	30.0 tok/s	197.8 GB
AMD Instinct MI300XAMD	CC	21.6 tok/s	197.8 GB
Apple Mac Studio (M2 Ultra, 2023)Apple	DD	3.3 tok/s	197.8 GB
ACEMAGIC M1A Pro (i9-13900HK + ARC A770)ACEMAGIC	FF	2.1 tok/s	197.8 GB
Acer Veriton GN100 AI MiniAcer	FF	1.1 tok/s	197.8 GB
AMD Radeon RX 7600 8GBAMD	FF	1.2 tok/s	197.8 GB
AMD Radeon RX 7700 XTAMD	FF	1.8 tok/s	197.8 GB
AMD Radeon RX 7800 XTAMD	FF	2.5 tok/s	197.8 GB
AMD Radeon RX 7900 XTAMD	FF	3.3 tok/s	197.8 GB
AMD Radeon RX 7900 XTXAMD	FF	3.9 tok/s	197.8 GB
AMD Radeon RX 9070AMD	FF	2.6 tok/s	197.8 GB
AMD Radeon RX 9070 XTAMD	FF	2.6 tok/s	197.8 GB
Apple M4Apple	FF	0.5 tok/s	197.8 GB
Apple M4 Max (40-core GPU)Apple	FF	2.2 tok/s	197.8 GB

Rows per page

Page 1 of 5

Run Locally vs API

Energy cost on AMD Instinct MI300X (~22 tok/s, Q4_K_M) vs flagship API pricing.

Source	Cost per 1M tokens
Local (energy only)MiniMax M3 on AMD Instinct MI300X · ~22 tok/s · 750W	$1.16
GPT-5.5OpenAI · in $5.00 · out $30.00	$12.50
Claude Opus 4.7 ThinkingAnthropic · in $5.00 · out $25.00	$11.00
Gemini 3.5 FlashGoogle · in $1.50 · out $9.00	$3.75
Grok 4.3xAI · in $1.25 · out $2.50	$1.63

API prices blended at 70% input / 30% output.

Hardware amortisation not included. Run the full ROI calculator for payback math.

Run the full ROI calculator

Rent in the Cloud

Cheapest current cloud rentals with at least 198 GB VRAM, refreshed hourly.

Option	Cost / GPU-hour
NVIDIA B300Vast.ai · Spot · 288 GB VRAM	$3.50
NVIDIA B300Vast.ai · On-Demand · 288 GB VRAM	$3.75
NVIDIA B300RunPod · Community · 288 GB VRAM	$6.94
NVIDIA B300RunPod · Spot · 288 GB VRAM	$6.94
NVIDIA B300RunPod · Secure · 288 GB VRAM	$7.39

Per-GPU rate across RunPod and the Vast.ai marketplace.

Spot tier is interruptible. Plan for restarts when comparing against on-demand prices.

See the full price index

Related Models

MiniMax

minimax-m2.5

230BMoE

Explore the Provider

See all MiniMax models

Aggregate stats, leaderboard, release timeline, and benchmark coverage across every MiniMax model we track.

Open MiniMax

Explore the Family

See every MiniMax release

The full MiniMax family leaderboard with sizes, benchmark scores, and a release timeline.

Open MiniMax

Free Monthly Report

The AI Build Report

The state of AI models, API prices, and what to run where. New every month, free.

428B

MiniMax

MiniMax M3

428B paramsMoE1000K ctxMultimodal

View on Hugging Face Source Code Official Page

Our Take

Best for: Strongest at graduate-level reasoning (GPQA) in its size class

Run this onAMD Instinct MI325XCheapest card in our directory with comfortable headroom (256 GB) for this model at Q4 (~197.8 GB).

Generated from this model’s benchmarks and ranking signals. Editor reviews refine it over time.

Capabilities

Chat

Code Generation

Vision

Reasoning

Function Calling

Multilingual

Instruction Following

Model Specifications

Parameters428B

Active Params23B

ArchitectureMoE

Context Length1M tokens

ModalityMultimodal

ProviderMiniMax

Download Size1.7 TB

Community

Monthly Downloads154.3K

Likes1.2K

Last Updated3 days ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

minimax-communityView Full License

Performance & Scoring

Benchmarks

92.9

80.5

37.1

59.0

AA Intelligence Index

44.4

45.4

42.4

82.9

74.0

MBA Open Score

52.6CC

Benchmark40%

62.1

Popularity25%

44.6

Efficiency20%

15.5

Versatility15%

90.0

Quantization Options

See how different quantization levels affect VRAM requirements and quality for this model.

Format	VRAM Required	Quality
Q2_K	193.0 GB	Low	Aggressive quantization — smallest size, noticeable quality loss
Q4_K_MRecommended	197.8 GB	Good	Best balance of size and quality for most use-cases
Q5_K_M	200.1 GB	Very Good	Slightly better quality than Q4 with moderate size increase
Q6_K	202.9 GB	Excellent	Near-lossless quality with manageable size
Q8_0	208.7 GB	Near Perfect	Virtually indistinguishable from full precision
FP16	230.5 GB	Full	Full 16-bit floating point — maximum quality, largest size

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

102 devices


AMD Instinct MI355XAMD	SS	32.6 tok/s	197.8 GB
AMD Instinct MI325XAMD	AA	24.4 tok/s	197.8 GB
ASUS ExpertCenter Pro ET900N G3ASUS	AA	28.9 tok/s	197.8 GB
Dell Pro Max with GB300Dell	AA	28.9 tok/s	197.8 GB
HP ZGX Fury AI StationHP	AA	28.9 tok/s	197.8 GB
MSI XpertStation WS300MSI	AA	28.9 tok/s	197.8 GB
SuperMicro Super AI StationSuperMicro	AA	28.9 tok/s	197.8 GB
Gigabyte W775-V10-L01Gigabyte	AA	28.9 tok/s	197.8 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	BB	3.3 tok/s	197.8 GB
Apple Mac Studio (M3 Ultra, 2025)Apple	BB	3.3 tok/s	197.8 GB
NVIDIA B200 GPUNVIDIA	BB	32.6 tok/s	197.8 GB
Google TPU v7 (Ironwood)Google	BB	30.0 tok/s	197.8 GB
AMD Instinct MI300XAMD	CC	21.6 tok/s	197.8 GB
Apple Mac Studio (M2 Ultra, 2023)Apple	DD	3.3 tok/s	197.8 GB
ACEMAGIC M1A Pro (i9-13900HK + ARC A770)ACEMAGIC	FF	2.1 tok/s	197.8 GB
Acer Veriton GN100 AI MiniAcer	FF	1.1 tok/s	197.8 GB
AMD Radeon RX 7600 8GBAMD	FF	1.2 tok/s	197.8 GB
AMD Radeon RX 7700 XTAMD	FF	1.8 tok/s	197.8 GB
AMD Radeon RX 7800 XTAMD	FF	2.5 tok/s	197.8 GB
AMD Radeon RX 7900 XTAMD	FF	3.3 tok/s	197.8 GB
AMD Radeon RX 7900 XTXAMD	FF	3.9 tok/s	197.8 GB
AMD Radeon RX 9070AMD	FF	2.6 tok/s	197.8 GB
AMD Radeon RX 9070 XTAMD	FF	2.6 tok/s	197.8 GB
Apple M4Apple	FF	0.5 tok/s	197.8 GB
Apple M4 Max (40-core GPU)Apple	FF	2.2 tok/s	197.8 GB

Rows per page

Page 1 of 5

Run Locally vs API

Energy cost on AMD Instinct MI300X (~22 tok/s, Q4_K_M) vs flagship API pricing.

Source	Cost per 1M tokens
Local (energy only)MiniMax M3 on AMD Instinct MI300X · ~22 tok/s · 750W	$1.16
GPT-5.5OpenAI · in $5.00 · out $30.00	$12.50
Claude Opus 4.7 ThinkingAnthropic · in $5.00 · out $25.00	$11.00
Gemini 3.5 FlashGoogle · in $1.50 · out $9.00	$3.75
Grok 4.3xAI · in $1.25 · out $2.50	$1.63

API prices blended at 70% input / 30% output.

Hardware amortisation not included. Run the full ROI calculator for payback math.

Run the full ROI calculator

Rent in the Cloud

Cheapest current cloud rentals with at least 198 GB VRAM, refreshed hourly.

Option	Cost / GPU-hour
NVIDIA B300Vast.ai · Spot · 288 GB VRAM	$3.50
NVIDIA B300Vast.ai · On-Demand · 288 GB VRAM	$3.75
NVIDIA B300RunPod · Community · 288 GB VRAM	$6.94
NVIDIA B300RunPod · Spot · 288 GB VRAM	$6.94
NVIDIA B300RunPod · Secure · 288 GB VRAM	$7.39

Per-GPU rate across RunPod and the Vast.ai marketplace.

Spot tier is interruptible. Plan for restarts when comparing against on-demand prices.

See the full price index

Related Models

MiniMax

minimax-m2.5

230BMoE

Explore the Provider

See all MiniMax models

Aggregate stats, leaderboard, release timeline, and benchmark coverage across every MiniMax model we track.

Open MiniMax

Explore the Family

See every MiniMax release

The full MiniMax family leaderboard with sizes, benchmark scores, and a release timeline.

Open MiniMax

Free Monthly Report

The AI Build Report

The state of AI models, API prices, and what to run where. New every month, free.