Black Forest Labs

FLUX.2 [dev]

32B parameter rectified flow transformer for text-to-image generation and editing. Supports up to 4MP resolution and multi-reference combinations in a single checkpoint.

32B paramsDense

View on Hugging Face Official Page

Model Specifications

Parameters32B

ArchitectureDense

ProviderBlack Forest Labs

Download Size177.6 GB

Community

Monthly Downloads199.7K

Likes1.6K

Last Updated2 months ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

FLUX Non-Commercial License

Performance & Scoring

Benchmarks

No benchmark data available for this model yet.

Overall Score

50.3CC

Benchmark45%

50.0

Popularity25%

83.1

Efficiency20%

0.0

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	20.1 GB
AMD Instinct MI300XAMD	SS	20.1 GB
AMD Instinct MI325XAMD	SS	20.1 GB
AMD Instinct MI355XAMD	SS	20.1 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	20.1 GB
Apple M4 Max (40-core GPU)Apple	SS	20.1 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	20.1 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	20.1 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	20.1 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	20.1 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	20.1 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	20.1 GB
Apple Mac Studio (M2 Max, 2023)Apple	SS	20.1 GB
Apple Mac Studio (M2 Ultra, 2023)Apple	SS	20.1 GB
Apple Mac Studio (M3 Ultra, 2025)Apple	SS	20.1 GB
Apple Mac Studio (M4 Max, 2025)Apple	SS	20.1 GB
ASUS Ascent GX10 - 1TBASUS	SS	20.1 GB
ASUS Ascent GX10 - 2TBASUS	SS	20.1 GB
ASUS Ascent GX10 - 4TBASUS	SS	20.1 GB
ASUS ExpertCenter Pro ET900N G3ASUS	SS	20.1 GB
Dell Pro Max with GB10Dell	SS	20.1 GB
Dell Pro Max with GB300Dell	SS	20.1 GB
GIGABYTE AI TOP ATOMGigabyte	SS	20.1 GB
Gigabyte W775-V10-L01Gigabyte	SS	20.1 GB
Google Cloud TPU v5pGoogle	SS	20.1 GB

Rows per page

Page 1 of 4

About This Model

FLUX.2 [dev] is a state-of-the-art 32B parameter rectified flow transformer designed by Black Forest Labs for high-fidelity text-to-image generation and complex image editing. Positioned as the open-weight successor to the original FLUX.1 series, this model bridges the gap between research-grade weights and production-ready output. It competes directly with top-tier proprietary models like Midjourney v6 and DALL-E 3, but with the distinct advantage of being runnable on local consumer hardware.

Unlike standard diffusion models, FLUX.2 [dev] utilizes a rectified flow architecture that excels at spatial reasoning and prompt adherence. It is specifically engineered for practitioners who require precise control over composition, typography, and photorealism without relying on cloud-based APIs. The "dev" variant is a guidance-distilled version of the model, optimized to provide high-quality results in fewer steps than the "pro" versions while maintaining the full 32B parameter density.

Architecture & Technical Details

The core of FLUX.2 [dev] is a 32B dense parameter rectified flow transformer. This architecture represents a shift from traditional U-Net structures, offering better scaling laws and more stable training for high-resolution outputs.

Rectified Flow Transformer

The model uses a flow-matching framework, which simplifies the path between noise and the final image. This results in cleaner textures and more accurate object placement compared to older latent diffusion techniques. Because it is a dense 32B model, every parameter is active during every inference step. While this demands more VRAM than a Mixture-of-Experts (MoE) or a smaller 12B model, it provides a level of detail and "understanding" of complex prompts that smaller models cannot match.

Long-Context VLM Integration

FLUX.2 [dev] incorporates a vision-language model (VLM) with a context window of approximately 32,000 tokens. In practical terms, this allows the model to process extremely long, descriptive prompts and complex layout instructions. It can interpret multi-paragraph descriptions of a scene, maintaining coherence across various elements like lighting, specific character attire, and background details.

Multi-Reference Control

A standout technical feature is the native support for multi-reference combinations within a single checkpoint. Traditional models often require LoRAs or ControlNets to maintain character or style consistency. FLUX.2 [dev] can ingest up to 10 reference images simultaneously, allowing users to specify a character from one image, a clothing style from another, and a lighting environment from a third, all without additional fine-tuning.

Capabilities & Use Cases

FLUX.2 [dev] is built for professional creative workflows where "good enough" is insufficient. It excels in areas where previous open-weight models typically struggled:

Native 4MP Output: The model generates images at up to 4-megapixel resolution natively. This eliminates the "hallucinations" often seen when upscaling lower-resolution 1024x1024 generations. It supports any aspect ratio, making it ideal for hero banners, print media, and cinematic concept art.
Production-Grade Typography: FLUX.2 [dev] can render complex text, UI mockups, and brand logos with high accuracy. It understands kerning and placement, which is essential for ad creatives and social media assets.
Complex Spatial Reasoning: The model handles difficult prompts involving "left of," "behind," or "occluded by" with high reliability. This makes it a powerful tool for architectural visualization and 3D concepting where perspective and physics must look grounded.
Zero-Shot Consistency: By using the multi-reference path, developers can build applications that maintain identity across hundreds of generated assets—critical for game development (sprites/portraits) and brand mascot generation.

Running FLUX.2 [dev] Locally

To run FLUX.2 [dev] locally, your primary bottleneck will be VRAM. As a 32B dense model, the memory footprint is significant, especially when factoring in the VAE and the T5/CLIP encoders.

Hardware Requirements & VRAM

Minimum (Quantized): 16GB VRAM (using 4-bit quantization like Q4_K_S or GGUF formats). This allows the model to run on an RTX 4080 (16GB) or an RTX 3090/4090 with significant headroom for other tasks.
Recommended: 24GB VRAM (RTX 3090, 4090, or 5090). This allows for 8-bit quantization (bitsandbytes) or FP8, which preserves more of the model's fine-grained detail and skin textures.
Mac Silicon: M2/M3/M4 Max or Ultra with 64GB+ of Unified Memory is the ideal setup for those working outside the NVIDIA ecosystem, as it allows the model to run in higher precision without swapping to disk.

Best Quantization for FLUX.2 [dev]

For most practitioners, Q4_K_M (4-bit) is the "sweet spot." It reduces the model size enough to fit on 16GB-24GB cards while retaining nearly 98% of the generation quality of the full-precision weights. If you are doing professional photography-style work where skin pores and fabric textures are critical, FP8 or Q8_0 is recommended on 24GB+ hardware.

Performance Expectations

On an RTX 4090 using optimized environments like ComfyUI or Forge, you can expect a 1024x1024 image in approximately 10–15 seconds using 20–25 steps. Native 4MP generations will take significantly longer, often 45–90 seconds depending on the complexity and quantization level.

The quickest way to get started is via Ollama (for CLI/backend integration) or ComfyUI, which has robust support for the FLUX.2 architecture and its multi-reference inputs.

How It Compares

FLUX.2 [dev] sits in a unique position between lightweight models and massive enterprise systems.

FLUX.2 [dev] vs. SDXL: SDXL is much faster and runs on 8GB VRAM, but it cannot compete with FLUX.2 in terms of text rendering, prompt adherence, or native resolution. FLUX.2 [dev] effectively replaces the need for complex SDXL "workflow stacks" that include multiple LoRAs and ControlNets.
FLUX.2 [dev] vs. FLUX.1 [dev]: The version 2 update provides a massive leap in resolution (4MP vs 1MP) and integrates multi-reference control directly into the model. If your hardware can handle the 32B parameter count, FLUX.2 is the superior choice for professional work.
FLUX.2 [dev] vs. Midjourney: While Midjourney offers a polished "vibe" and ease of use, FLUX.2 [dev] provides local privacy, no subscription fees after the initial hardware investment, and the ability to use reference images without the censorship or "aesthetic drift" common in cloud platforms.

For developers building local AI tools, FLUX.2 [dev] is currently the benchmark for what is possible with open-weight image generation on consumer-grade GPUs.

Related Models

Black Forest Labs

Find the best hardware for this model

Use our hardware calculator to find the optimal device for running this model.

32B

Black Forest Labs

FLUX.2 [dev]

32B parameter rectified flow transformer for text-to-image generation and editing. Supports up to 4MP resolution and multi-reference combinations in a single checkpoint.

32B paramsDense

View on Hugging Face Official Page

Model Specifications

Parameters32B

ArchitectureDense

ProviderBlack Forest Labs

Download Size177.6 GB

Community

Monthly Downloads199.7K

Likes1.6K

Last Updated2 months ago

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

FLUX Non-Commercial License

Performance & Scoring

Benchmarks

No benchmark data available for this model yet.

Overall Score

50.3CC

Benchmark45%

50.0

Popularity25%

83.1

Efficiency20%

0.0

Versatility10%

70.0

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	20.1 GB
AMD Instinct MI300XAMD	SS	20.1 GB
AMD Instinct MI325XAMD	SS	20.1 GB
AMD Instinct MI355XAMD	SS	20.1 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	20.1 GB
Apple M4 Max (40-core GPU)Apple	SS	20.1 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	20.1 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	20.1 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	20.1 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	20.1 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	20.1 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	20.1 GB
Apple Mac Studio (M2 Max, 2023)Apple	SS	20.1 GB
Apple Mac Studio (M2 Ultra, 2023)Apple	SS	20.1 GB
Apple Mac Studio (M3 Ultra, 2025)Apple	SS	20.1 GB
Apple Mac Studio (M4 Max, 2025)Apple	SS	20.1 GB
ASUS Ascent GX10 - 1TBASUS	SS	20.1 GB
ASUS Ascent GX10 - 2TBASUS	SS	20.1 GB
ASUS Ascent GX10 - 4TBASUS	SS	20.1 GB
ASUS ExpertCenter Pro ET900N G3ASUS	SS	20.1 GB
Dell Pro Max with GB10Dell	SS	20.1 GB
Dell Pro Max with GB300Dell	SS	20.1 GB
GIGABYTE AI TOP ATOMGigabyte	SS	20.1 GB
Gigabyte W775-V10-L01Gigabyte	SS	20.1 GB
Google Cloud TPU v5pGoogle	SS	20.1 GB

Rows per page

Page 1 of 4

About This Model

Architecture & Technical Details

Rectified Flow Transformer

Long-Context VLM Integration

Multi-Reference Control

Capabilities & Use Cases

FLUX.2 [dev] is built for professional creative workflows where "good enough" is insufficient. It excels in areas where previous open-weight models typically struggled:

Native 4MP Output: The model generates images at up to 4-megapixel resolution natively. This eliminates the "hallucinations" often seen when upscaling lower-resolution 1024x1024 generations. It supports any aspect ratio, making it ideal for hero banners, print media, and cinematic concept art.
Production-Grade Typography: FLUX.2 [dev] can render complex text, UI mockups, and brand logos with high accuracy. It understands kerning and placement, which is essential for ad creatives and social media assets.
Complex Spatial Reasoning: The model handles difficult prompts involving "left of," "behind," or "occluded by" with high reliability. This makes it a powerful tool for architectural visualization and 3D concepting where perspective and physics must look grounded.
Zero-Shot Consistency: By using the multi-reference path, developers can build applications that maintain identity across hundreds of generated assets—critical for game development (sprites/portraits) and brand mascot generation.

Running FLUX.2 [dev] Locally

To run FLUX.2 [dev] locally, your primary bottleneck will be VRAM. As a 32B dense model, the memory footprint is significant, especially when factoring in the VAE and the T5/CLIP encoders.

Hardware Requirements & VRAM

Minimum (Quantized): 16GB VRAM (using 4-bit quantization like Q4_K_S or GGUF formats). This allows the model to run on an RTX 4080 (16GB) or an RTX 3090/4090 with significant headroom for other tasks.
Recommended: 24GB VRAM (RTX 3090, 4090, or 5090). This allows for 8-bit quantization (bitsandbytes) or FP8, which preserves more of the model's fine-grained detail and skin textures.
Mac Silicon: M2/M3/M4 Max or Ultra with 64GB+ of Unified Memory is the ideal setup for those working outside the NVIDIA ecosystem, as it allows the model to run in higher precision without swapping to disk.

Best Quantization for FLUX.2 [dev]

Performance Expectations

The quickest way to get started is via Ollama (for CLI/backend integration) or ComfyUI, which has robust support for the FLUX.2 architecture and its multi-reference inputs.

How It Compares

FLUX.2 [dev] sits in a unique position between lightweight models and massive enterprise systems.

FLUX.2 [dev] vs. SDXL: SDXL is much faster and runs on 8GB VRAM, but it cannot compete with FLUX.2 in terms of text rendering, prompt adherence, or native resolution. FLUX.2 [dev] effectively replaces the need for complex SDXL "workflow stacks" that include multiple LoRAs and ControlNets.
FLUX.2 [dev] vs. FLUX.1 [dev]: The version 2 update provides a massive leap in resolution (4MP vs 1MP) and integrates multi-reference control directly into the model. If your hardware can handle the 32B parameter count, FLUX.2 is the superior choice for professional work.
FLUX.2 [dev] vs. Midjourney: While Midjourney offers a polished "vibe" and ease of use, FLUX.2 [dev] provides local privacy, no subscription fees after the initial hardware investment, and the ability to use reference images without the censorship or "aesthetic drift" common in cloud platforms.

For developers building local AI tools, FLUX.2 [dev] is currently the benchmark for what is possible with open-weight image generation on consumer-grade GPUs.