OpenAI

OpenAI Agents SDK

Name: OpenAI Agents SDK
Author: OpenAI

OpenAI's production-ready agent SDK with tracing, handoffs, and structured outputs.

Production agents on OpenAI models

Visit Site View on GitHub Read the Docs

GitHub Stars

26.4K

Contributors

272

npm / Week

—

PyPI / Month

29.8M

Maintained by: OpenAI
First released: Mar 2025
Last commit: 2 days ago
Language: Python
License: MIT

Overview

OpenAI Agents SDK is a lightweight, open-source Python and TypeScript framework for building production-grade multi-agent applications. Released in March 2025 under the MIT license, it is maintained by OpenAI and serves as the official successor to the experimental Swarm project. With over 26,000 GitHub stars, 272 contributors, and nearly 30 million monthly PyPI downloads, it has quickly become one of the most adopted agent frameworks in the ecosystem.

The SDK targets a specific gap: teams that want to move beyond single-turn LLM calls into multi-step, tool-using agent workflows without drowning in abstraction layers. Its design philosophy is minimalism. It provides exactly four core primitives — Agent, Runner, Handoff, and Guardrail — and leaves everything else to standard Python control flow. This makes it a natural fit for engineers who prefer explicit composition over declarative DSLs or graph-based orchestration.

In the agent framework landscape, OpenAI Agents SDK competes directly with LangGraph, CrewAI, and AutoGen. Where LangGraph emphasizes graph-based state machines and CrewAI focuses on role-based teams, OpenAI Agents SDK leans into lightweight, code-first orchestration with deep integration into OpenAI’s model ecosystem and observability stack. It is built by the same team that delivers GPT-4o and the Responses API, which means first-class access to OpenAI-specific features like structured outputs, streaming, and built-in tracing in the OpenAI dashboard.

Architecture and Programming Model

The programming model is imperative and Python-native. You define agents as objects, decorate functions as tools, and orchestrate execution using ordinary if/else blocks, loops, and function calls. There is no graph definition, no YAML config files, and no abstract workflow engine between you and the LLM.

Core abstractions:

Agent: An LLM configured with a system instruction, a list of tools, optional handoff targets, guardrails, and model settings. You instantiate an agent and then run it with a Runner.
Runner: The execution loop that handles tool calls, manages conversation history, and returns the final response. It accepts an agent and an input string or a list of messages.
Handoff: A mechanism that lets an agent delegate to another agent for a specific sub-task, preserving the full conversation context. Handoffs are defined declaratively on the source agent and handled automatically by the runner.
Guardrail: Input or output validation that runs in parallel with agent execution. If a guardrail fails, execution halts immediately with a configurable error message.
Tool: Any Python function decorated with @function_tool becomes a tool with automatic Pydantic-based input validation and JSON schema generation. Tools can also be MCP servers or OpenAI-hosted tools.

Control flow is straightforward: you call Runner.run(agent, input) and the SDK loops — invoking tools, sending results back to the LLM, and repeating until the LLM produces a final output or hits a guardrail. There is no support for cyclic graphs or conditional branching inside the runner; you implement that logic in your own Python code by checking outputs and calling different agents.

The SDK is provider-agnostic in principle — it supports the OpenAI Responses API, Chat Completions API, and over 100 third-party models through the openai Python client. In practice, the best experience is with OpenAI models because tracing, structured outputs, and streaming optimizations are tested and tuned for those endpoints.

Key Features and Capabilities

Multi-Agent Handoffs are a first-class feature. Instead of building a router system from scratch, you define handoffs directly on the agent. When the LLM determines it needs a specialist, the runner transparently transfers control to the target agent with full conversation history. This makes patterns like customer support triage (greeter -> billing -> technical support) a few lines of code.

Built-in Tracing is enabled by default. Every agent run, tool call, guardrail check, and handoff is recorded and visible in the OpenAI dashboard. You can inspect step-by-step execution, latency breakdowns, token usage, and error paths without setting up any external observability tool. For teams already using OpenAI, this eliminates the need to integrate a third-party tracing solution for agent debugging.

Guardrails run input and output checks in parallel with agent execution. They are defined as Python functions that return a GuardrailResult with a pass/fail status and optional error message. This allows you to block unsafe content, enforce format constraints, or validate business rules before the model response reaches the user.

Streaming is supported through Runner.run_streamed(), which yields intermediate events (tool calls, partial text, handoffs) as they happen. This is critical for real-time user experiences like chat interfaces or live dashboards.

Type Safety comes from Pydantic integration. Tool schemas are inferred from Python type hints, and structured outputs can be enforced using output_type on the agent. The TypeScript SDK mirrors this approach with Zod.

Self-Hostable and Cloud-Hosted: The Python package is installable via pip and runs anywhere Python runs. The tracing backend defaults to OpenAI’s cloud, but you can point it to a self-hosted endpoint if needed. The April 2026 update added sandbox agents that run inside isolated containers for long-running tasks.

Real-World Use Cases

Customer-Facing Assistants are the primary use case. Teams deploy GPT-4o agents that handle initial inquiries and hand off to specialist agents (e.g., refunds, technical support) when needed. The built-in tracing gives product teams visibility into failure modes and conversation flow.

Internal Copilots on OpenAI are another common pattern. Companies build agents that answer internal questions about company policies, codebases, or datasets, using guardrails to keep responses on-brand and within compliance boundaries. The SDK’s low overhead makes it easy to spin up a new copilot for each department without heavy scaffolding.

Lightweight Agent Prototypes benefit from the minimal boilerplate. If you need a single agent with a few tools and no multi-agent orchestration, the SDK is often the fastest path to a working prototype compared to frameworks that impose graph or team abstractions.

Code Generation Pipelines: Developers use the SDK to build agents that write code, run it in a sandbox, evaluate the output, and iterate. The sandbox agents feature (April 2026) enables this without exposing the host system.

Poor Fit Cases: Frameworks with deep graph-based reasoning (LangGraph) or role-based crew management (CrewAI) are better for complex, non-linear workflows that require cycles, conditional branching, or human-in-the-loop at every step. OpenAI Agents SDK assumes explicit linear or tree-like delegation.

Getting Started With OpenAI Agents SDK

Install the Python package:

1pip install openai-agents

Set your OpenAI API key as the OPENAI_API_KEY environment variable.

The smallest meaningful example:

1from agents import Agent, Runner
2
3agent = Agent(
4    name="Assistant",
5    instructions="You are a helpful assistant.",
6)
7
8result = Runner.run_sync(agent, "What is the capital of France?")
9print(result.final_output)

This creates an agent with no tools, runs it once, and prints the output. To add a tool, define a function with @function_tool and pass it in the tools list.

Prerequisites: An OpenAI API key or an endpoint for a compatible model. No vector store or external observability tool is required for basic use, but tracing requires internet access to OpenAI’s dashboard.

Documentation lives at [openai.github.io/openai-agents-python](https://openai.github.io/openai-agents-python/). The GitHub repository at [github.com/openai/openai-agents-python](https://github.com/openai/openai-agents-python) contains examples and the TypeScript version.

How It Compares

vs LangChain/LangGraph: LangGraph excels at cyclic, state-machine-like workflows with branching and conditional transitions. OpenAI Agents SDK is far simpler for linear or tree-like delegation. If you need a DAG or dynamic routing based on intermediate results, LangGraph gives you more control. If you want minimal setup for a straightforward multi-agent pipeline, OpenAI Agents SDK wins on developer velocity.

vs CrewAI: CrewAI provides role-based teams with built-in task assignments and process definitions. It is opinionated about how agents collaborate (sequential, hierarchical, etc.). OpenAI Agents SDK is more flexible but requires you to write the coordination logic. For teams that want “set up a team and let it figure out the plan,” CrewAI may be easier. For teams that want deterministic, fine-grained control over handoffs and tool selection, OpenAI Agents SDK is the better fit.

vs AutoGen: AutoGen is agent-centric with strong support for inter-agent conversation patterns and human-in-the-loop. It has a larger feature surface but a steeper learning curve. OpenAI Agents SDK is intentionally smaller and faster to learn, but offers less built-in support for conversation management and multi-turn multi-agent dialogue.

The bottom line: Choose OpenAI Agents SDK when your workflow is roughly linear or tree-shaped, you already use OpenAI models, and you want a framework that stays out of your way. Choose alternatives when you need non-linear graph traversal, role-based assignment, or extensive third-party integrations.

Strengths

Minimal, well-documented API from the model provider.
Built-in tracing in the OpenAI dashboard.
Handoffs make multi-agent flows feel native.
TypeScript SDK available alongside Python.

Trade-offs

Optimised for OpenAI models — other providers work but feel second-class.
Newer than LangGraph and AutoGen for complex orchestration.

Key Features

What the framework gives you out of the box, in plain language.

Multi-Agent
Streaming
Tool Use
Human in the Loop
Memory
Tracing
Evaluations
Self-Hostable
Cloud-Hosted
Type-Safe

Handoffs
First-class multi-agent transfer with preserved context.
Built-in tracing
Every run shows up in the OpenAI dashboard with full step traces.
Guardrails
Run input and output checks in parallel to block unsafe content.

Where It Shines

The jobs this framework is best suited for.

Customer-facing assistants
Production agents using GPT models, with handoffs to specialist agents.
Internal copilots on OpenAI
Agents that use guardrails to keep outputs on-brand and on-topic.
Lightweight agent prototypes
Minimal boilerplate when you only need a single agent with tools.

Side-by-Side

Compare OpenAI Agents SDK With Another Framework

Add a second or third framework and see stars, downloads, and capabilities lined up next to each other.

Open the Comparator

Related Frameworks

Close alternatives worth a look before you decide.

LangChain

Composable building blocks for LLM apps — chains, agents, retrievers, and integrations.

Composable LLM building blocks

Stars

137.0K

npm / wk

2.2M

PyPI / mo

241.8M

MixedMITLast commit:Today

LangGraph

Stateful, graph-based agent workflows with first-class human-in-the-loop.

Complex, stateful agent graphs

Stars

32.3K

npm / wk

—

PyPI / mo

49.0M

PythonMITLast commit:4 days ago

CrewAI

Multi-agent crews with role-based prompts and explicit task hand-offs.

Role-based multi-agent crews

Stars

51.6K

npm / wk

—

PyPI / mo

9.6M

PythonMITLast commit:2 days ago

AutoGen

Conversational multi-agent simulations and orchestration from Microsoft Research.

Conversational multi-agent simulations

Stars

58.1K

npm / wk

—

PyPI / mo

1.5M

PythonMITLast commit:1 mo ago

Frequently Asked Questions

What Is an Agent Framework?

An agent framework is the code your team uses to wire large language models into tools, memory, and human checkpoints. It is the connective tissue between an LLM call and a real task, like answering a support ticket or running a multi-step research workflow.

Is OpenAI Agents SDK open source?

OpenAI Agents SDK ships under the MIT license. The source code lives on GitHub, so you can read it, fork it, and run it on your own infrastructure if your team prefers self-hosting.

Which language is OpenAI Agents SDK built in?

OpenAI Agents SDK is primarily a Python project. Pick a framework that matches the language your team already ships in. The cost of a stack switch is almost always higher than the difference between two frameworks.

Need Help Adopting OpenAI Agents SDK?

We help teams stand up production agents with the right framework for their stack, on a money-back basis if we cannot show ROI.

OpenAI

OpenAI Agents SDK

OpenAI's production-ready agent SDK with tracing, handoffs, and structured outputs.

Production agents on OpenAI models

Visit Site View on GitHub Read the Docs

GitHub Stars

26.4K

Contributors

272

npm / Week

—

PyPI / Month

29.8M

Maintained by: OpenAI
First released: Mar 2025
Last commit: 2 days ago
Language: Python
License: MIT

Overview

Architecture and Programming Model

Core abstractions:

Agent: An LLM configured with a system instruction, a list of tools, optional handoff targets, guardrails, and model settings. You instantiate an agent and then run it with a Runner.
Runner: The execution loop that handles tool calls, manages conversation history, and returns the final response. It accepts an agent and an input string or a list of messages.
Handoff: A mechanism that lets an agent delegate to another agent for a specific sub-task, preserving the full conversation context. Handoffs are defined declaratively on the source agent and handled automatically by the runner.
Guardrail: Input or output validation that runs in parallel with agent execution. If a guardrail fails, execution halts immediately with a configurable error message.
Tool: Any Python function decorated with @function_tool becomes a tool with automatic Pydantic-based input validation and JSON schema generation. Tools can also be MCP servers or OpenAI-hosted tools.

Key Features and Capabilities

Real-World Use Cases

Getting Started With OpenAI Agents SDK

Install the Python package:

1pip install openai-agents

Set your OpenAI API key as the OPENAI_API_KEY environment variable.

The smallest meaningful example:

1from agents import Agent, Runner
2
3agent = Agent(
4    name="Assistant",
5    instructions="You are a helpful assistant.",
6)
7
8result = Runner.run_sync(agent, "What is the capital of France?")
9print(result.final_output)

This creates an agent with no tools, runs it once, and prints the output. To add a tool, define a function with @function_tool and pass it in the tools list.

How It Compares

Strengths

Minimal, well-documented API from the model provider.
Built-in tracing in the OpenAI dashboard.
Handoffs make multi-agent flows feel native.
TypeScript SDK available alongside Python.

Trade-offs

Optimised for OpenAI models — other providers work but feel second-class.
Newer than LangGraph and AutoGen for complex orchestration.

Key Features

What the framework gives you out of the box, in plain language.

Multi-Agent
Streaming
Tool Use
Human in the Loop
Memory
Tracing
Evaluations
Self-Hostable
Cloud-Hosted
Type-Safe

Handoffs
First-class multi-agent transfer with preserved context.
Built-in tracing
Every run shows up in the OpenAI dashboard with full step traces.
Guardrails
Run input and output checks in parallel to block unsafe content.

Where It Shines

The jobs this framework is best suited for.

Customer-facing assistants
Production agents using GPT models, with handoffs to specialist agents.
Internal copilots on OpenAI
Agents that use guardrails to keep outputs on-brand and on-topic.
Lightweight agent prototypes
Minimal boilerplate when you only need a single agent with tools.

Side-by-Side

Compare OpenAI Agents SDK With Another Framework

Add a second or third framework and see stars, downloads, and capabilities lined up next to each other.

Open the Comparator

Related Frameworks

Close alternatives worth a look before you decide.

LangChain

Composable building blocks for LLM apps — chains, agents, retrievers, and integrations.

Composable LLM building blocks

Stars

137.0K

npm / wk

2.2M

PyPI / mo

241.8M

MixedMITLast commit:Today

LangGraph

Stateful, graph-based agent workflows with first-class human-in-the-loop.

Complex, stateful agent graphs

Stars

32.3K

npm / wk

—

PyPI / mo

49.0M

PythonMITLast commit:4 days ago

CrewAI

Multi-agent crews with role-based prompts and explicit task hand-offs.

Role-based multi-agent crews

Stars

51.6K

npm / wk

—

PyPI / mo

9.6M

PythonMITLast commit:2 days ago

AutoGen

Conversational multi-agent simulations and orchestration from Microsoft Research.

Conversational multi-agent simulations

Stars

58.1K

npm / wk

—

PyPI / mo

1.5M

PythonMITLast commit:1 mo ago

Frequently Asked Questions

What Is an Agent Framework?

Is OpenAI Agents SDK open source?

OpenAI Agents SDK ships under the MIT license. The source code lives on GitHub, so you can read it, fork it, and run it on your own infrastructure if your team prefers self-hosting.

Which language is OpenAI Agents SDK built in?

Need Help Adopting OpenAI Agents SDK?

We help teams stand up production agents with the right framework for their stack, on a money-back basis if we cannot show ROI.

OpenAI Agents SDK

Overview

Architecture and Programming Model

Key Features and Capabilities

Real-World Use Cases

Getting Started With OpenAI Agents SDK

How It Compares

Strengths

Trade-offs

Key Features

Handoffs

Built-in tracing

Guardrails

Where It Shines

Customer-facing assistants

Internal copilots on OpenAI

Lightweight agent prototypes

Compare OpenAI Agents SDK With Another Framework

Related Frameworks

LangChain

LangGraph

CrewAI

AutoGen

Frequently Asked Questions

What Is an Agent Framework?

Is OpenAI Agents SDK open source?

Which language is OpenAI Agents SDK built in?

Need Help Adopting OpenAI Agents SDK?

OpenAI Agents SDK

Overview

Architecture and Programming Model

Key Features and Capabilities

Real-World Use Cases

Getting Started With OpenAI Agents SDK

How It Compares

Strengths

Trade-offs

Key Features

Handoffs

Built-in tracing

Guardrails

Where It Shines

Customer-facing assistants

Internal copilots on OpenAI

Lightweight agent prototypes

Compare OpenAI Agents SDK With Another Framework

Related Frameworks

LangChain

LangGraph

CrewAI

AutoGen

Frequently Asked Questions

What Is an Agent Framework?

Is OpenAI Agents SDK open source?

Which language is OpenAI Agents SDK built in?

Need Help Adopting OpenAI Agents SDK?