OpenRouter is an API service that aggregates 100+ LLMs under a single OpenAI-compatible endpoint. You create an OpenRouter account, add credits, and call any model using the same format as OpenAI's API — just change the base URL to `https://openrouter.ai/api/v1`. Models include Claude, GPT-4o, Llama 3, Mistral, Gemini, and many more.

OpenRouter has no subscription fee — you pay per token used (like a utility). Some models have free tiers with rate limits. Pricing for paid models is typically at or below direct API pricing, with credits purchased in advance. No minimum commitment.

How does OpenRouter compare to LiteLLM?

OpenRouter is a hosted service — you use their API, they manage routing and provider relationships, you pay per token. LiteLLM is an open-source library/proxy you self-host using your own API keys for each provider. OpenRouter is simpler and requires no infrastructure; LiteLLM gives you more control and can be cheaper if you have high volume.

Does OpenRouter support streaming?

Yes. OpenRouter supports streaming responses (SSE) for all models that support it. Use `stream=True` in your API call exactly as you would with the OpenAI SDK.

OpenRouter | db.fyi

Why it matters

Access Claude, GPT-4o, Llama 3, Mistral, Gemini, and 100+ other models through one endpoint with one API key.
Often cheaper than direct provider pricing — OpenRouter negotiates volume discounts and passes savings through.
Model comparison built-in: see real-time throughput, latency, and cost-per-token for each model to make informed routing decisions.
OpenAI-compatible API means zero code changes — just swap the base URL.

Key capabilities

100+ model catalog: Claude 3.5 Sonnet, GPT-4o, Llama 3.1, Mistral Large, Gemini Pro, Qwen, DeepSeek, and more.
OpenAI-compatible API: POST /v1/chat/completions — works with any OpenAI SDK client.
Pay-per-use: No subscriptions — purchase credits, pay per million tokens per model.
Model routing: Automatic fallback to backup models when primary is down or at rate limits.
Context window management: Automatically routes to models with sufficient context for your input.
Free model tier: Selected models (Llama 3, Mistral) available with daily rate limits at no cost.
Usage analytics: Track spending per model, per app, and over time from the dashboard.
Prompt caching: Automatic caching for supported models (Claude) to reduce costs on repeated similar prompts.

Technical notes

Base URL: https://openrouter.ai/api/v1
Auth: Authorization: Bearer <openrouter-api-key>; model specified in model field as provider/model-name
SDKs: No SDK required; use any OpenAI SDK with custom base_url
Pricing: Pay-per-token; typically at or below direct provider pricing; free tier models available
Rate limits: Varies by model and credit tier
Provider coverage: Anthropic, OpenAI, Google, Meta, Mistral, Cohere, Together AI, Perplexity, and 20+ providers

Ideal for

Developers who want to experiment with multiple LLMs without managing multiple API keys or provider contracts.
Startups building on multiple models who want a single invoice and unified usage dashboard.
Teams needing automatic failover to backup models for production reliability.

Not ideal for

High-volume production workloads where direct provider pricing is cheaper at scale.
Organizations with strict data sovereignty requirements — OpenRouter routes through their infrastructure.
Teams who need self-hosted LLM routing for compliance reasons — use LiteLLM Proxy instead.

OpenRouter

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

OpenRouter

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is OpenRouter?

Is OpenRouter free?

How does OpenRouter compare to LiteLLM?

Does OpenRouter support streaming?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also