What is the OpenAI Assistants API?

The Assistants API is OpenAI's managed framework for building AI assistants. You create an Assistant (with a system prompt and tools), create Threads (conversation history), add Messages to threads, and create Runs (execute the assistant on a thread). OpenAI manages the conversation context, tool execution loop, and state persistence. Built-in tools: code interpreter (Python execution), file search (RAG over uploaded files), and function calling.

Is the Assistants API free?

The Assistants API charges for: model tokens (same as Chat Completions API), code interpreter at $0.03/session, and file storage at $0.20/GB/day. There are no separate Assistants API fees beyond these. For simple chatbots without code interpreter or file search, the cost is the same as direct Chat Completions. Files used for retrieval are billed per storage.

When should I use the Assistants API vs. Chat Completions?

Use the Assistants API when you need: (1) Persistent multi-turn conversations that outlast a single API call; (2) Built-in code execution (code interpreter runs Python, generates charts); (3) File-based retrieval (upload documents, ask questions about them); (4) Multi-step tool use with automatic retry. Use Chat Completions when you need simplicity, maximum control over conversation management, or lower latency.

What is the code interpreter tool?

Code Interpreter lets the assistant write and run Python code in a sandboxed environment. The assistant can: perform calculations, analyze data, generate charts and visualizations (returned as image files), process uploaded files (CSV, Excel, images), and iterate on code until it gets the correct answer. It runs in OpenAI's infrastructure — no server needed.

OpenAI Assistants API | db.fyi

Why it matters

Eliminates session management boilerplate — threads and runs handle conversation history and context window management automatically.
Code Interpreter enables AI that executes Python and analyzes data — far more powerful than text-only assistants for data, math, and analysis tasks.
File Search (built-in RAG) lets non-engineers add document knowledge to assistants without building vector databases and retrieval pipelines.
Function calling with automatic retry loop handles multi-step tool use without building state machines — the assistant keeps trying until it completes the task.

Key capabilities

Persistent threads: Conversation history stored and managed by OpenAI — no session state in your code.
Code Interpreter: Execute Python in a sandbox; generate charts, analyze files, perform calculations.
File Search: Upload PDFs, DOCX, and other files; assistant retrieves relevant content (built-in RAG).
Function calling: Define functions; assistant calls them; results feed back into conversation.
Multi-step tool use: Assistant automatically retries and combines tool calls to complete complex tasks.
Streaming: Stream assistant responses token-by-token via the streaming runs API.
Model selection: Use any OpenAI model (GPT-4o, GPT-4o-mini) per assistant.
File management: Upload, store, and reference files across multiple threads and assistants.

Technical notes

API: POST /v1/assistants, /v1/threads, /v1/runs
SDK: openai.beta.assistants in Python and Node.js SDKs
Models: GPT-4o, GPT-4o-mini, GPT-4-Turbo
Tools: Code Interpreter ($0.03/session), File Search ($0.20/GB/day), Function Calling (free)
Pricing: Model tokens + tool usage (see above)
Context: Automatic context management; oldest messages truncated if over limit

Ideal for

Customer-facing AI assistants that need persistent conversation history across multiple sessions.
Data analysis applications where users upload files and ask questions about them.
Multi-step task automation where the AI needs to execute code, call APIs, and iterate on results.

Not ideal for

Simple single-turn completions — Chat Completions API is faster, cheaper, and simpler.
Applications requiring sub-second latency — Assistants API has overhead from state management.
Teams who need full control over RAG, retrieval, and conversation management — build your own with LangChain or LlamaIndex.

OpenAI Assistants API

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

OpenAI Assistants API

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is the OpenAI Assistants API?

Is the Assistants API free?

When should I use the Assistants API vs. Chat Completions?

What is the code interpreter tool?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also