deepset

Haystack

Modular graph-based component pipelines for RAG, search, and agentic workflows.

Modular RAG and search pipelines

Visit Site View on GitHub Read the Docs Join Discord

GitHub Stars

25.8K

Contributors

387

npm / Week

—

PyPI / Month

962.4K

Maintained by: deepset
First released: Nov 2019
Last commit: 1 wk ago
Language: Python
License: apache-2

Overview

Haystack is an open-source Python framework for building production-ready LLM applications, maintained by deepset (Berlin-based deepset GmbH). First released in 2019, it has evolved into a mature orchestration layer designed for modular RAG, semantic search, and agentic workflows. With 25,325 GitHub stars, 358 contributors, and over 868,000 monthly PyPI downloads, it is one of the most widely adopted frameworks in the Python AI ecosystem.

Haystack occupies the orchestration and retrieval/RAG category, competing directly with LangChain, LlamaIndex, and to a lesser extent CrewAI and LangGraph. What sets Haystack apart is its explicit graph-based pipeline architecture: instead of sequential chains or loosely coupled agents, every interaction between components is typed, validated at connect time, and explicitly directed. This design philosophy comes from deepset’s experience building enterprise search and QA systems. The team prioritizes transparency, debuggability, and production reliability over rapid prototyping convenience.

Haystack is best suited for teams that need to build complex retrieval pipelines with conditional branching, multi-step reasoning, and human oversight. It is Apache 2.0 licensed and fully self-hostable, with a managed cloud option (deepset Cloud) for enterprise deployments.

Architecture and Programming Model

Haystack’s core abstraction is the Pipeline: a directed multigraph of components connected by typed input/output sockets. Each component declares what it consumes and produces (e.g., a Retriever outputs List[Document], a PromptBuilder takes documents and query and produces a str). Connections are validated at pipeline.connect() time, not at runtime. This means misconfiguration errors surface during development, not in production.

Building an application is imperative (code-first) but declarative in structure: you instantiate components, add them to a pipeline, and wire them together. Control flow is explicit: you can create branching paths, conditional routing (e.g., skip retrieval if the query is short), and cycles for iterative refinement. Haystack 2.x supports loops, allowing agents to reflect on their outputs and re-enter earlier stages.

Multi-agent flows are modeled as pipelines that call sub-pipelines or include agent components with tool-use capabilities. Each agent component can host its own nested pipeline, enabling hierarchical orchestration. Human-in-the-loop is built into the pipeline graph: a special HumanValidation component intercepts tool calls and waits for approval, rejection, or parameter modification before proceeding.

Haystack is Python-only. There is no TypeScript SDK, which limits its use in full-stack Node.js environments.

Key Features and Capabilities

Graph-based pipelines enable complex topologies beyond linear chains. You can model parallel retrievers (e.g., BM25 + dense embedding) followed by a reranker, then route results to different generators based on confidence scores. Conditional branches allow “if-else” logic inside the pipeline graph.

70+ modular integrations cover vector stores (Weaviate, Pinecone, Elasticsearch, Qdrant, Chroma), model providers (OpenAI, Anthropic, Mistral, Hugging Face, Cohere, Google), document parsers (PDF, HTML, Markdown, slides), and tools (web search, code execution, calculators). Official integrations are maintained by deepset; community ones live in the haystack-community package.

Human-in-the-loop validation is not an afterthought. The ToolValidator component intercepts any agent tool call and presents it to a human for approval, rejection, or parameter editing before execution. This is essential for high-stakes automation (e.g., financial transactions, medical data handling).

Multi-agent coordination is supported through nested pipelines and the Agent component. Each agent can have its own set of tools and memory, and communicate with other agents via pipeline sockets. This is more structured than emergent agent-to-agent chat patterns: Haystack agents are deterministic and debuggable.

Streaming is supported at the generator level, enabling token-by-token output in chat applications. The pipeline propagates streaming responses through the graph.

Tracing and evaluations are first-class. Haystack emits OpenTelemetry-compatible spans for every pipeline run, which can be viewed in any observability backend (e.g., Grafana, Datadog). The evaluation module provides metrics for RAG pipelines (faithfulness, answer relevance, document recall) and can be integrated into CI/CD.

Memory is handled via ChatMessage stores and the ConversationalMemory component, which persists conversation history across pipeline runs.

Self-hostable and cloud-hosted: Haystack runs anywhere Python runs. deepset Cloud provides a managed platform with hosted inference, monitoring, and collaborative debugging.

Real-World Use Cases

Enterprise RAG over documents is the most mature use case. Teams build pipelines that ingest PDFs, slides, and tables, chunk them, embed them into a vector store, then run hybrid retrieval (keyword + semantic) with reranking. Haystack’s document processing pipeline (clean, split, embed, store) is production-proven at scale.

Semantic search and question answering power internal knowledge bases and customer support portals. The framework’s strong retrieval foundation (sparse/dense/multi-modal) delivers high precision for both fact lookup and open-ended QA.

Autonomous web research agents combine search APIs (SerpAPI, DuckDuckGo), content scraping, and self-evaluation. A pipeline fetches multiple search results, scrapes each page, extracts relevant sections, and uses a generator to synthesize an answer with citations. The agent can loop back to refine its search if the initial answer lacks confidence.

Customer support automation uses human-in-the-loop to validate tool calls (e.g., checking an order shipment status). The pipeline retrieves relevant knowledge base articles, generates a draft response, and passes it to a human for approval before sending.

Internal copilots for code generation, query writing, or data analysis: agents have access to databases, code interpreters, and documentation retrievers. The graph structure lets you add guardrails (e.g., validate SQL queries before execution) without breaking the flow.

Haystack is a poor fit for simple chat completions or linear chains where you just call one LLM. The overhead of building a pipeline for “prompt -> LLM -> output” is unnecessary; frameworks like Pydantic AI or simple openai SDK calls are lighter. It also struggles in JavaScript/Node.js ecosystems.

Getting Started With Haystack

Install the framework:

1pip install haystack-ai

You’ll need an LLM provider key (OpenAI, Anthropic, etc.) and, for retrieval pipelines, a running vector store (Weaviate, Pinecone, or in-memory for prototyping).

The smallest meaningful example: a retrieval-augmented generation pipeline.

1from haystack import Pipeline
2from haystack.components.retrievers.in_memory import InMemoryBM25Retriever
3from haystack.components.builders import PromptBuilder
4from haystack.components.generators import OpenAIGenerator
5from haystack.document_stores.in_memory import InMemoryDocumentStore
6
7# Initialize store and index some documents
8doc_store = InMemoryDocumentStore()
9doc_store.write_documents([{"content": "Haystack is an open-source framework..."}])
10
11pipeline = Pipeline()
12pipeline.add_component("retriever", InMemoryBM25Retriever(doc_store))
13pipeline.add_component("prompt", PromptBuilder(template="Answer based on: {{documents}} \n {{query}}"))
14pipeline.add_component("llm", OpenAIGenerator(model="gpt-4o-mini"))
15
16pipeline.connect("retriever.documents", "prompt.documents")
17pipeline.connect("prompt.prompt", "llm.prompt")
18
19result = pipeline.run({"retriever": {"query": "What is Haystack?"}})
20print(result["llm"]["replies"])

Full documentation lives at [haystack.deepset.ai](https://haystack.deepset.ai). The community is active on Discord and GitHub. For enterprise features (role-based access, monitoring, deployment templates), see [deepset Cloud](https://deepset.ai/cloud).

How It Compares

Haystack vs LangChain: LangChain offers a broader ecosystem (JavaScript support, more integrations, larger community) and a lower initial learning curve via its chain abstraction. However, Haystack’s typed graph pipelines are more disciplined for production. If you need a quick prototype across multiple languages, LangChain is faster. If you need a reliable, debuggable pipeline for enterprise RAG with human-in-the-loop, Haystack wins.

Haystack vs LlamaIndex: LlamaIndex excels at document ingestion, indexing, and retrieval tuning. Its declarative index abstractions make it straightforward for standard RAG. Haystack’s graph pipelines give you more control over the orchestration logic (branching, loops, tool use). For complex agentic workflows that require conditional routing and iterative refinement, Haystack is the better choice. For simple “query a document” use cases, LlamaIndex may be quicker.

Haystack vs CrewAI: CrewAI focuses on role-based multi-agent teams with natural language task descriptions. Haystack’s multi-agent model is more programmatic and explicit. Choose CrewAI if you want agents to “figure out” the plan via LLM reasoning. Choose Haystack if you need deterministic, auditable pipelines where every step is predefined and type-checked.

When to avoid Haystack: you need a TypeScript or JavaScript framework; your use case is a trivial chat completion; you want the most popular ecosystem regardless of architecture. In those cases, consider LangChain (TS), Mastra (TS), or plain OpenAI SDK.

Strengths

Highly flexible and composable directed-graph pipeline architecture.
Extensive integration ecosystem with 70+ vector stores, generators, and tools.
Robust native support for semantic search and complex document retrieval.
First-class managed cloud platform (deepset Cloud) for enterprise deployment.

Trade-offs

Higher learning curve for simple pipelines compared to linear-chain SDKs.
Python-only — lacks a first-class TypeScript SDK.
Migrating from older 1.x architectures to 2.x requires significant refactoring.

Key Features

What the framework gives you out of the box, in plain language.

Multi-Agent
Streaming
Tool Use
Human in the Loop
Memory
Tracing
Evaluations
Self-Hostable
Cloud-Hosted
Type-Safe

Graph-based pipelines
Connect components in a directed graph to build cyclical loops, conditional routing, and complex multi-agent flows.
70+ modular integrations
Easily hook into vector databases, model providers, document parsers, and custom tools with official and community integrations.
Human-in-the-loop validation
Intercept agent tool calls to request human approval, rejection, or parameter modifications before execution.

Where It Shines

The jobs this framework is best suited for.

Enterprise RAG over documents
Build optimized retrieval pipelines over unstructured PDFs, slides, and tables with advanced retrieval and reranking techniques.
Semantic search and QA
Query large document corpora to locate precise answers or semantically relevant sections with high accuracy.
Autonomous web research agents
Assemble agents that call search APIs, scrape content, and self-evaluate their results before answering.

Side-by-Side

Compare Haystack With Another Framework

Add a second or third framework and see stars, downloads, and capabilities lined up next to each other.

Open the Comparator

Related Frameworks

Close alternatives worth a look before you decide.

LangChain

Composable building blocks for LLM apps — chains, agents, retrievers, and integrations.

Composable LLM building blocks

Stars

140.3K

npm / wk

2.4M

PyPI / mo

312.4M

MixedMITLast commit:1 wk ago

LlamaIndex

Data-grounded agents and RAG pipelines, with deep indexing primitives.

Data-grounded RAG agents

Stars

50.5K

npm / wk

—

PyPI / mo

7.5M

MixedMITLast commit:1 wk ago

CrewAI

Multi-agent crews with role-based prompts and explicit task hand-offs.

Role-based multi-agent crews

Stars

54.5K

npm / wk

—

PyPI / mo

11.1M

PythonMITLast commit:1 wk ago

Frequently Asked Questions

What Is an Agent Framework?

An agent framework is the code your team uses to wire large language models into tools, memory, and human checkpoints. It is the connective tissue between an LLM call and a real task, like answering a support ticket or running a multi-step research workflow.

Is Haystack open source?

Haystack ships under the apache-2 license. The source code lives on GitHub, so you can read it, fork it, and run it on your own infrastructure if your team prefers self-hosting.

Which language is Haystack built in?

Haystack is primarily a Python project. Pick a framework that matches the language your team already ships in. The cost of a stack switch is almost always higher than the difference between two frameworks.

Free Monthly Report

The AI Build Report

The state of AI models, API prices, and what to run where. New every month, free.

deepset

Haystack

Modular graph-based component pipelines for RAG, search, and agentic workflows.

Modular RAG and search pipelines

Visit Site View on GitHub Read the Docs Join Discord

GitHub Stars

25.8K

Contributors

387

npm / Week

—

PyPI / Month

962.4K

Maintained by: deepset
First released: Nov 2019
Last commit: 1 wk ago
Language: Python
License: apache-2

Overview

Architecture and Programming Model

Haystack is Python-only. There is no TypeScript SDK, which limits its use in full-stack Node.js environments.

Key Features and Capabilities

Streaming is supported at the generator level, enabling token-by-token output in chat applications. The pipeline propagates streaming responses through the graph.

Memory is handled via ChatMessage stores and the ConversationalMemory component, which persists conversation history across pipeline runs.

Self-hostable and cloud-hosted: Haystack runs anywhere Python runs. deepset Cloud provides a managed platform with hosted inference, monitoring, and collaborative debugging.

Real-World Use Cases

Getting Started With Haystack

Install the framework:

1pip install haystack-ai

You’ll need an LLM provider key (OpenAI, Anthropic, etc.) and, for retrieval pipelines, a running vector store (Weaviate, Pinecone, or in-memory for prototyping).

The smallest meaningful example: a retrieval-augmented generation pipeline.

1from haystack import Pipeline
2from haystack.components.retrievers.in_memory import InMemoryBM25Retriever
3from haystack.components.builders import PromptBuilder
4from haystack.components.generators import OpenAIGenerator
5from haystack.document_stores.in_memory import InMemoryDocumentStore
6
7# Initialize store and index some documents
8doc_store = InMemoryDocumentStore()
9doc_store.write_documents([{"content": "Haystack is an open-source framework..."}])
10
11pipeline = Pipeline()
12pipeline.add_component("retriever", InMemoryBM25Retriever(doc_store))
13pipeline.add_component("prompt", PromptBuilder(template="Answer based on: {{documents}} \n {{query}}"))
14pipeline.add_component("llm", OpenAIGenerator(model="gpt-4o-mini"))
15
16pipeline.connect("retriever.documents", "prompt.documents")
17pipeline.connect("prompt.prompt", "llm.prompt")
18
19result = pipeline.run({"retriever": {"query": "What is Haystack?"}})
20print(result["llm"]["replies"])

How It Compares

Strengths

Highly flexible and composable directed-graph pipeline architecture.
Extensive integration ecosystem with 70+ vector stores, generators, and tools.
Robust native support for semantic search and complex document retrieval.
First-class managed cloud platform (deepset Cloud) for enterprise deployment.

Trade-offs

Higher learning curve for simple pipelines compared to linear-chain SDKs.
Python-only — lacks a first-class TypeScript SDK.
Migrating from older 1.x architectures to 2.x requires significant refactoring.

Key Features

What the framework gives you out of the box, in plain language.

Multi-Agent
Streaming
Tool Use
Human in the Loop
Memory
Tracing
Evaluations
Self-Hostable
Cloud-Hosted
Type-Safe

Graph-based pipelines
Connect components in a directed graph to build cyclical loops, conditional routing, and complex multi-agent flows.
70+ modular integrations
Easily hook into vector databases, model providers, document parsers, and custom tools with official and community integrations.
Human-in-the-loop validation
Intercept agent tool calls to request human approval, rejection, or parameter modifications before execution.

Where It Shines

The jobs this framework is best suited for.

Enterprise RAG over documents
Build optimized retrieval pipelines over unstructured PDFs, slides, and tables with advanced retrieval and reranking techniques.
Semantic search and QA
Query large document corpora to locate precise answers or semantically relevant sections with high accuracy.
Autonomous web research agents
Assemble agents that call search APIs, scrape content, and self-evaluate their results before answering.

Side-by-Side

Compare Haystack With Another Framework

Add a second or third framework and see stars, downloads, and capabilities lined up next to each other.

Open the Comparator

Related Frameworks

Close alternatives worth a look before you decide.

LangChain

Composable building blocks for LLM apps — chains, agents, retrievers, and integrations.

Composable LLM building blocks

Stars

140.3K

npm / wk

2.4M

PyPI / mo

312.4M

MixedMITLast commit:1 wk ago

LlamaIndex

Data-grounded agents and RAG pipelines, with deep indexing primitives.

Data-grounded RAG agents

Stars

50.5K

npm / wk

—

PyPI / mo

7.5M

MixedMITLast commit:1 wk ago

CrewAI

Multi-agent crews with role-based prompts and explicit task hand-offs.

Role-based multi-agent crews

Stars

54.5K

npm / wk

—

PyPI / mo

11.1M

PythonMITLast commit:1 wk ago

Frequently Asked Questions

What Is an Agent Framework?

Is Haystack open source?

Haystack ships under the apache-2 license. The source code lives on GitHub, so you can read it, fork it, and run it on your own infrastructure if your team prefers self-hosting.

Which language is Haystack built in?

Free Monthly Report

The AI Build Report

The state of AI models, API prices, and what to run where. New every month, free.

Haystack

Overview

Architecture and Programming Model

Key Features and Capabilities

Real-World Use Cases

Getting Started With Haystack

How It Compares

Strengths

Trade-offs

Key Features

Graph-based pipelines

70+ modular integrations

Human-in-the-loop validation

Where It Shines

Enterprise RAG over documents

Semantic search and QA

Autonomous web research agents

Compare Haystack With Another Framework

Related Frameworks

LangChain

LlamaIndex

CrewAI

Frequently Asked Questions

What Is an Agent Framework?

Is Haystack open source?

Which language is Haystack built in?

The AI Build Report

Haystack

Overview

Architecture and Programming Model

Key Features and Capabilities

Real-World Use Cases

Getting Started With Haystack

How It Compares

Strengths

Trade-offs

Key Features

Graph-based pipelines

70+ modular integrations

Human-in-the-loop validation

Where It Shines

Enterprise RAG over documents

Semantic search and QA

Autonomous web research agents

Compare Haystack With Another Framework

Related Frameworks

LangChain

LlamaIndex

CrewAI

Frequently Asked Questions

What Is an Agent Framework?

Is Haystack open source?

Which language is Haystack built in?

The AI Build Report