Qwen/Alibaba

Qwen3-Embedding-4B

Mid-size 4B Qwen3 embedding model balancing quality and efficiency.

4B paramsDense

View on Hugging Face

Run with Ollama Source Code Official Page

Model Specifications

Parameters4B

Active Params3.6B

ArchitectureDense

ProviderQwen/Alibaba

Download Size36.2 GB

Community

Monthly Downloads1.8M

Likes254

Last Updated10 months ago

Quick Start

Run with Ollama

Copy and paste this command to start running the model locally.

ollama run qwen3-embedding:4b

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

Apache 2.0View Full License

Performance & Scoring

Benchmarks

MTEB Overall

69.5

Retrieval

69.6

Classification

72.3

Clustering

57.1

STS

80.9

Overall Score

71.7AA

Benchmark60%

69.9

Popularity25%

85.6

Efficiency15%

55.6

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	2.7 GB
AMD Instinct MI300XAMD	SS	2.7 GB
AMD Instinct MI325XAMD	SS	2.7 GB
AMD Instinct MI355XAMD	SS	2.7 GB
AMD Radeon RX 7600 8GBAMD	SS	2.7 GB
AMD Radeon RX 7700 XTAMD	SS	2.7 GB
AMD Radeon RX 7800 XTAMD	SS	2.7 GB
AMD Radeon RX 7900 XTAMD	SS	2.7 GB
AMD Radeon RX 7900 XTXAMD	SS	2.7 GB
AMD Radeon RX 9070AMD	SS	2.7 GB
AMD Radeon RX 9070 XTAMD	SS	2.7 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	2.7 GB
Apple M4Apple	SS	2.7 GB
Apple M4 Max (40-core GPU)Apple	SS	2.7 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	2.7 GB
Apple M5Apple	SS	2.7 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	2.7 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	2.7 GB
Apple Mac Mini (M1, 2020)Apple	SS	2.7 GB
Apple Mac Mini (M2, 2023)Apple	SS	2.7 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	2.7 GB
Apple Mac Mini (M4, 2024)Apple	SS	2.7 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	2.7 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	2.7 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	2.7 GB

Rows per page

Page 1 of 4

About This Model

The mid-size variant of the Qwen3 Embedding series, fine-tuned from Qwen3-4B-Base using the same 3-stage contrastive + supervised + model-merging recipe as the 8B model. Supports 100+ languages, 32K context, instruction-aware embeddings, and Matryoshka output dimensions, offering a strong balance of quality and inference cost.

Related Models

Qwen/Alibaba

Qwen3-Embedding-8B

7.6BDense

7.6B

Qwen/Alibaba

Qwen3-Embedding-0.6B

0.596BDense

0.596B

Find the best hardware for this model

Use our hardware calculator to find the optimal device for running this model.

Qwen/Alibaba

Qwen3-Embedding-4B

Mid-size 4B Qwen3 embedding model balancing quality and efficiency.

4B paramsDense

View on Hugging Face

Run with Ollama Source Code Official Page

Model Specifications

Parameters4B

Active Params3.6B

ArchitectureDense

ProviderQwen/Alibaba

Download Size36.2 GB

Community

Monthly Downloads1.8M

Likes254

Last Updated10 months ago

Quick Start

Run with Ollama

Copy and paste this command to start running the model locally.

ollama run qwen3-embedding:4b

Download from Hugging Face

Access model weights, configuration files, and documentation.

Download from Hugging Face

License

Apache 2.0View Full License

Performance & Scoring

Benchmarks

MTEB Overall

69.5

Retrieval

69.6

Classification

72.3

Clustering

57.1

STS

80.9

Overall Score

71.7AA

Benchmark60%

69.9

Popularity25%

85.6

Efficiency15%

55.6

Hardware Compatibility

See which devices can run this model and at what quality level.

Hide F tierOnly featured devices

83 devices


Acer Veriton GN100 AI MiniAcer	SS	2.7 GB
AMD Instinct MI300XAMD	SS	2.7 GB
AMD Instinct MI325XAMD	SS	2.7 GB
AMD Instinct MI355XAMD	SS	2.7 GB
AMD Radeon RX 7600 8GBAMD	SS	2.7 GB
AMD Radeon RX 7700 XTAMD	SS	2.7 GB
AMD Radeon RX 7800 XTAMD	SS	2.7 GB
AMD Radeon RX 7900 XTAMD	SS	2.7 GB
AMD Radeon RX 7900 XTXAMD	SS	2.7 GB
AMD Radeon RX 9070AMD	SS	2.7 GB
AMD Radeon RX 9070 XTAMD	SS	2.7 GB
Apple M3 Ultra (32-core CPU, 80-core GPU)Apple	SS	2.7 GB
Apple M4Apple	SS	2.7 GB
Apple M4 Max (40-core GPU)Apple	SS	2.7 GB
Apple M4 Pro (14-core CPU, 20-core GPU)Apple	SS	2.7 GB
Apple M5Apple	SS	2.7 GB
Apple M5 Max (18-core CPU, 40-core GPU)Apple	SS	2.7 GB
Apple M5 Pro (18-core CPU, 20-core GPU)Apple	SS	2.7 GB
Apple Mac Mini (M1, 2020)Apple	SS	2.7 GB
Apple Mac Mini (M2, 2023)Apple	SS	2.7 GB
Apple Mac Mini (M2 Pro, 2023)Apple	SS	2.7 GB
Apple Mac Mini (M4, 2024)Apple	SS	2.7 GB
Apple Mac Mini (M4 Pro, 2024)Apple	SS	2.7 GB
Apple Mac Studio (M1 Max, 2022)Apple	SS	2.7 GB
Apple Mac Studio (M1 Ultra, 2022)Apple	SS	2.7 GB

Rows per page

Page 1 of 4

About This Model

Related Models

Qwen/Alibaba

Qwen3-Embedding-8B

7.6BDense

7.6B

Qwen/Alibaba

Qwen3-Embedding-0.6B

0.596BDense

0.596B

Find the best hardware for this model

Use our hardware calculator to find the optimal device for running this model.