14B total / 7B active Mixture-of-Transformer-Experts model unifying multimodal understanding and generation. Dual-encoder (SigLIP-L + FLUX.1 VAE) with specialized language and vision decoder experts.
Access model weights, configuration files, and documentation.
See which devices can run this model and at what quality level.