GritLM (Contextual AI)

GritLM-8x7B

MoE Mixtral-8x7B unified embedding+generation model, best-in-class open generation, competitive on MTEB.

57.9B paramsMoE

A workable 57.9B-parameter MoE embedding model from GritLM (Contextual AI). Treat the modality benchmarks above as the leading indicator of fit — composite scoring across modalities is still maturing.

Generated from this model’s benchmarks and ranking signals. Editor reviews refine it over time.

Model Specifications

Parameters57.9B

Quick Start

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

Apache 2.0View Full License

Performance & Scoring

Benchmarks

Retrieval

57.5

Classification

61.5

Clustering

50.2

STS

73.2

Overall Score

44.4CC

Benchmark60%

60.6

Popularity25%

30.0

Efficiency15%

3.7

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

102 devices


ACEMAGIC M1A Pro (i9-13900HK + ARC A770)ACEMAGIC	SS	8.5 GB
Acer Veriton GN100 AI MiniAcer	SS	8.5 GB
AMD Instinct MI300XAMD	SS	8.5 GB
AMD Instinct MI325XAMD	SS	8.5 GB
AMD Instinct MI355XAMD	SS	8.5 GB
AMD Radeon RX 7800 XTAMD	SS	8.5 GB
AMD Radeon RX 7900 XTAMD	SS	8.5 GB
AMD Radeon RX 7900 XTXAMD	SS	8.5 GB
AMD Radeon RX 9070AMD	SS	8.5 GB
AMD Radeon RX 9070 XTAMD	SS	8.5 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	8.5 GB
Apple M4Apple	SS	8.5 GB
Apple M4 Max (40-core GPU)Apple	SS	8.5 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	8.5 GB
Apple M5Apple	SS	8.5 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	8.5 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	8.5 GB
Apple Mac Mini (M1, 2020)Apple	SS	8.5 GB
Apple Mac Mini (M2, 2023)Apple	SS	8.5 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	8.5 GB
Apple Mac Mini (M4, 2024)Apple	SS	8.5 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	8.5 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	8.5 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	8.5 GB
Apple Mac Studio (M2 Max, 2023)Apple	SS	8.5 GB

Rows per page

Page 1 of 5

About This Model

The scaled-up Mixture-of-Experts variant of GritLM, fine-tuned from Mixtral-8x7B via Generative Representational Instruction Tuning to unify generation and embedding in a sparse 47B-parameter model. At release it outperformed all open generative LMs the authors tested while still ranking among the strongest embedding models, demonstrating GRIT's scalability to MoE architectures.

Find the Best Hardware for This Model

Use our hardware calculator to find the optimal device for running this model.

GritLM-8x7B

Our Take

Model Specifications

Quick Start

Download from Hugging Face

License

Performance & Scoring

Benchmarks

Overall Score

Hardware Compatibility

About This Model

Related Models

GritLM-7B

Find the Best Hardware for This Model

Community