Model APIs

Responses API vs Chat Completions: choose the right OpenAI API path

Compare OpenAI Responses API and Chat Completions for new apps, agent workflows, tool use, conversation state, structured outputs, file search, web search, and migration planning.

Updated 2026-06-119 min readIntermediate

Read Assistants migration guide Read structured outputs guide

AI Buyer Readiness Scorecard

Turn this guide into procurement, security, ROI, rollout, and governance questions.

Use the scorecard before opening vendor pricing pages. It keeps commercial AI research tied to the workflow, data risk, operating cost, and evidence buyers need before a shortlist becomes a purchase.

Procurement trigger

Define the business event behind the search: budget review, renewal, security review, failed pilot, new workflow, or vendor consolidation.

Data and security review

Check whether prompts, files, logs, embeddings, customer records, regulated data, or source code will touch the AI system.

ROI and operating cost

Estimate seat cost, API usage, implementation time, review effort, support load, fallback work, and expected workflow savings.

Integration and rollout path

Map the tools, identity systems, data sources, approval steps, change management, and users needed for a real deployment.

Governance evidence

Collect policies, evals, audit logs, human review rules, incident response, vendor terms, and owner names before procurement asks.

Best for

Developers starting a new OpenAI integration
Teams deciding whether to migrate from Chat Completions
Product engineers adding tools, file search, web search, or agents
Engineering managers estimating migration risk and API surface changes

Not for

A line-by-line migration script for every codebase
Ignoring regression tests when changing API primitives
Assuming a simple endpoint swap covers tool calls, state, and output handling

Comparison

Choose by workflow, not brand

Option	Best for	Strengths	Tradeoffs	Use when
Responses API	New apps, agentic workflows, typed items, built-in tools, file search, web search, and multimodal use	A newer unified API path with typed input and output items and better alignment with OpenAI's agent-building direction.	Existing chat code may need changes to message mapping, state management, logging, and tool handling.	You are building or refactoring a serious OpenAI integration today.
Chat Completions	Stable legacy chat apps, simple message generation, and code paths that do not need new tools	Familiar, widely used, and often enough for simple chat or generation flows.	Less aligned with newer Responses API features and agent primitives.	The app is stable, low-risk, and migration has not yet been prioritized.
Dual-path migration	Production systems that need side-by-side testing before switching traffic	Lets teams compare quality, latency, cost, state handling, and output contracts before cutover.	Requires extra logging, test harnesses, and a temporary compatibility layer.	The API powers revenue, customer support, or critical internal workflows.

What actually changes

The migration is not only an endpoint name change. Chat Completions uses message arrays as the central mental model. Responses uses typed input and output items, which can include messages, reasoning, function calls, and tool outputs.

Map messages to input and output items instead of assuming one message format everywhere.
Review how your app stores conversation state and prompt history.
Update logs and tracing so developers can debug typed output items.

Use Responses for tools

If the product uses file search, web search, remote MCP servers, function calling, or agent-like workflows, Responses is the cleaner long-term API path.

Test tool calls and tool outputs separately from final answers.
Keep permissions, approvals, and business rules outside the model.
Add eval cases for missing tools, failed tools, and malformed tool arguments.

Migrate with evidence

A safe migration compares the current production behavior against the new path on realistic prompts. The goal is not only to compile but to preserve or improve quality, latency, cost, and observability.

Run old and new paths on the same eval set.
Track output schema validity, cost, latency, and user-visible answer quality.
Cut traffic gradually and keep rollback simple until confidence is high.

Decision Rules

A practical checklist

Use Responses API for new OpenAI projects.

Keep Chat Completions temporarily for stable legacy apps without tool needs.

Move sooner when you need file search, web search, agents, multimodal, or richer output items.

Migrate behind evals, traces, and a rollback plan for production workflows.

Related Guides

Continue the decision path

Read Assistants migration guide

Plan migration before the Assistants API sunset.

Open

Read structured outputs guide

Use schema-first responses after moving API paths.

Open

Assistants API migration guide

Handle the higher-risk migration away from Assistants API.

Open

OpenAI vs Anthropic API

Compare provider strategy after choosing the OpenAI API path.

Open

Structured outputs guide

Make Responses output reliable for downstream code.

Open

Chinese Archive

Aligned deeper reading

AI agent archive

Chinese agent and tool-calling implementation notes.

Open

AI product archive

Chinese AI product migration and workflow notes.

Open

Topic Hubs

Explore the wider search cluster

Topic hub

RAG and models

Plan RAG systems, local LLM deployment, model APIs, cloud AI platforms, vector databases, evaluation, observability, rate limits, and cost optimization.

Open

FAQ

Common questions

Should I use Responses API or Chat Completions for a new app?

Use Responses API for new OpenAI apps unless you have a specific legacy constraint. It is better aligned with tools, agents, typed items, and newer platform direction.

Is migration from Chat Completions difficult?

Simple text generation can be straightforward, but apps with tools, state, streaming, logging, or structured outputs need careful mapping and regression tests.

Does previous_response_id make conversation state free?

No. Conversation state still has cost and design implications. Use provider documentation and your own traces to understand billing, context, and truncation behavior.

Source Links

Primary references used for this guide

Reference

Migrate to the Responses API

Official OpenAI migration guide from Chat Completions to Responses.

Open

Reference

OpenAI conversation state

Official OpenAI documentation for managing conversation state.

Open

Reference

OpenAI tools guide

Official OpenAI documentation for tools, web search, file search, MCP, and function calling.

Open

Build your own evaluation note

The strongest decision is always local to your workflow. Save the vendor links, define a representative task, record the exact prompt or command, and compare the final evidence instead of the marketing claim.

Return to the AI learning map