Outlines is a Python library for guaranteed structured LLM output. With standard LLMs, you prompt for JSON and sometimes get invalid JSON, extra text, or schema violations. Outlines uses constrained generation — it intercepts the LLM's token probabilities and sets probabilities to zero for tokens that would violate your schema. The result is mathematically guaranteed valid output according to your Pydantic model or JSON schema.

Outlines is completely free and open source (Apache 2.0 license). It runs locally with open-source models via transformers. For commercial APIs (OpenAI, Anthropic), structured output features may be handled differently — some providers now offer native JSON mode, but Outlines remains the most reliable approach for local models.

How does constrained generation work?

During LLM token generation, each step the model produces a probability distribution over all vocabulary tokens. Outlines analyzes your schema and, at each generation step, masks tokens that would lead to invalid output. For example, when generating a JSON object with an 'age' field of type integer, Outlines prevents the model from generating non-digit characters at that position. This is done via finite-state machine matching, not regex post-processing.

When should I use Outlines vs. OpenAI's JSON mode?

OpenAI's JSON mode reduces invalid JSON but doesn't guarantee schema compliance. Outlines guarantees both valid JSON and schema conformance for every single output. For local models (Llama, Mistral via transformers or vLLM), Outlines is the primary structured output solution. For OpenAI API, use Outlines with instructor or OpenAI's structured outputs feature (which uses a similar constrained decoding approach).

Outlines | db.fyi

Why it matters

Mathematically guaranteed valid output eliminates an entire class of production bugs: JSON parse errors, schema validation failures, and missing required fields.
Works with local models (Llama, Mistral, Phi) where you have full control over the generation process — not limited to commercial API JSON modes.
Pydantic model integration is Pythonic and concise — define your data model once, use it for both LLM output and application logic.
9K+ GitHub stars shows widespread adoption among teams building structured data extraction, form filling, and classification pipelines.

Key capabilities

Pydantic model output: Define a Pydantic class; Outlines guarantees LLM output can be parsed into that class.
JSON schema constraint: Provide a JSON schema; output is guaranteed to be valid JSON matching the schema.
Regex constraint: Force output to match a regex pattern — useful for extracting formatted data (dates, IDs, codes).
Grammar constraint: Define a context-free grammar; output is guaranteed to conform to the grammar.
Choice constraint: Limit output to one of a predefined set of options — perfect for classification tasks.
vLLM integration: Outlines integrates with vLLM for high-throughput structured generation at scale.
Multiple backends: transformers, llamacpp, mlx-lm (Apple Silicon); commercial APIs via instructor.

Technical notes

License: Apache 2.0 (open source)
GitHub: github.com/dottxt-ai/outlines (9K+ stars)
Install: pip install outlines
Backends: Hugging Face transformers, llamacpp, mlx-lm, vLLM (server)
Schema formats: Pydantic models, JSON schema, regex, EBNF grammars
Python versions: 3.8+
Company: .txt (dottxt); created by the team behind the structured generation research

Ideal for

Teams building data extraction pipelines where LLM output must conform to strict schemas (invoices, forms, medical records).
Classification and routing systems where the LLM must output one of a fixed set of labels.
Developers using local open-source models who need structured output without OpenAI's JSON mode.

Not ideal for

OpenAI API users who just need basic JSON — OpenAI's native structured outputs feature is simpler.
Free-form text generation tasks where structure isn't needed — Outlines adds overhead for unconstrained use cases.
Teams using cloud-only APIs without local model access (Outlines' strongest features apply to local model generation).

Outlines

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

Outlines

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is Outlines?

Is Outlines free?

How does constrained generation work?

When should I use Outlines vs. OpenAI's JSON mode?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also