Overview

Command R+ is Cohere's flagship model, designed from the ground up for enterprise retrieval-augmented generation (RAG) workflows. Released in April 2024, it is not trying to win headline benchmark competitions — it is built for the specific, practical challenge that most enterprise AI deployments face: grounding model responses in real documents, preventing hallucinations, and delivering accurate, cited answers from large document collections.

Built for Enterprise RAG

Most frontier models are general-purpose; Command R+ is purpose-built for RAG. Cohere has optimised the model specifically for:

Grounded generation: Producing answers that are derived from and attributed to the documents provided, rather than the model's parametric memory.
Citation generation: Including inline citations that point to specific documents and passages, enabling users and auditors to verify every claim.
Multi-hop retrieval reasoning: Combining information from multiple retrieved documents to answer questions that no single document can answer alone.
Faithfulness to source material: Resisting the tendency to hallucinate or blend retrieved content with ungrounded model knowledge.

This is the capability profile that enterprises actually need for legal document review, medical literature search, technical knowledge bases, and compliance-sensitive deployments.

The Cohere Ecosystem: Embed + Rerank + Generate

Command R+ is designed to work natively with the full Cohere retrieval stack, which is one of the most mature in the industry:

Cohere Embed: State-of-the-art text embeddings for vector search. Documents are embedded into a vector store (Pinecone, Weaviate, Qdrant, etc.).
Cohere Rerank: A cross-encoder reranking model that re-scores retrieved candidates for relevance before passing them to the generator. This step dramatically improves the quality of what Command R+ receives.
Command R+: Generates a grounded answer from the reranked, retrieved context — with citations.

This three-stage pipeline (embed → retrieve → rerank → generate) represents best practice for enterprise RAG, and Command R+ is the only major model that is co-designed with all three stages by the same vendor.

128K Context Window

With a 128,000 token context window, Command R+ can ingest substantial amounts of retrieved context in a single pass — dozens of full documents, lengthy contract sections, or extensive knowledge base articles. This reduces the need for aggressive chunking and retrieval optimisation that smaller context models require.

Multilingual Support

Command R+ is trained with strong multilingual capabilities, supporting ten languages at a high quality level: English, French, Spanish, Italian, German, Portuguese, Japanese, Korean, Arabic, and Chinese. Enterprise RAG applications frequently span multiple languages — internal documents in one language, user queries in another — and Command R+ handles this natively.

Function Calling

Beyond RAG, Command R+ supports function calling for agentic workflows — connecting to APIs, querying databases, and executing multi-step tasks. This enables compound AI systems where retrieval and tool use are combined in a single model.

Deployment and Availability

Cohere API: Direct access with pay-per-token pricing.
Microsoft Azure AI: Available as a managed deployment on Azure, useful for organisations standardised on Microsoft's cloud.
Amazon Bedrock: Available through AWS's managed AI model service.
Self-hosted: Cohere offers private deployment options for enterprises with data residency requirements.

Best Use Cases

Enterprise knowledge bases: Internal Q&A systems that must answer from company documents with citations.
Legal and compliance review: Grounded analysis of contracts, regulations, and case law with source attribution.
Medical and scientific literature search: Evidence-based answers from clinical documents, research papers, and guidelines.
Customer support: Grounded responses from product documentation and support knowledge bases that don't hallucinate unsupported claims.
Regulatory compliance: Answering questions about policy documents where accuracy and citation are non-negotiable.

Context

128K

131K

1.0M

200K

MMLU

75.7

84.0

91.0

92.0

HumanEval

—

92.0

—

94.5

MATH

—

69.7

97.0

97.1

GPQA

—

84.0

79.9

Pricing

Paid

Input $/M

$2.50

$2.00

$1.25

$15.00

Overview

Built for Enterprise RAG

Most frontier models are general-purpose; Command R+ is purpose-built for RAG. Cohere has optimised the model specifically for:

Grounded generation: Producing answers that are derived from and attributed to the documents provided, rather than the model's parametric memory.

Citation generation: Including inline citations that point to specific documents and passages, enabling users and auditors to verify every claim.

Multi-hop retrieval reasoning: Combining information from multiple retrieved documents to answer questions that no single document can answer alone.

Faithfulness to source material: Resisting the tendency to hallucinate or blend retrieved content with ungrounded model knowledge.

This is the capability profile that enterprises actually need for legal document review, medical literature search, technical knowledge bases, and compliance-sensitive deployments.

The Cohere Ecosystem: Embed + Rerank + Generate

Command R+ is designed to work natively with the full Cohere retrieval stack, which is one of the most mature in the industry:

Cohere Embed: State-of-the-art text embeddings for vector search. Documents are embedded into a vector store (Pinecone, Weaviate, Qdrant, etc.).

Cohere Rerank: A cross-encoder reranking model that re-scores retrieved candidates for relevance before passing them to the generator. This step dramatically improves the quality of what Command R+ receives.

Command R+: Generates a grounded answer from the reranked, retrieved context — with citations.

128K Context Window

Multilingual Support

Deployment and Availability

Cohere API: Direct access with pay-per-token pricing.

Microsoft Azure AI: Available as a managed deployment on Azure, useful for organisations standardised on Microsoft's cloud.

Amazon Bedrock: Available through AWS's managed AI model service.

Self-hosted: Cohere offers private deployment options for enterprises with data residency requirements.

Best Use Cases

Enterprise knowledge bases: Internal Q&A systems that must answer from company documents with citations.

Legal and compliance review: Grounded analysis of contracts, regulations, and case law with source attribution.

Medical and scientific literature search: Evidence-based answers from clinical documents, research papers, and guidelines.

Customer support: Grounded responses from product documentation and support knowledge bases that don't hallucinate unsupported claims.

Regulatory compliance: Answering questions about policy documents where accuracy and citation are non-negotiable.

Provider	Cohere
Released	2024-04-04
Status	Current
Context window	128K tokens
Pricing	Paid
Input price	$2.50/M
Output price	$10.00/M
Capabilities	textcodefunction-callingrag
API docs	Docs ↗

Command R+

Benchmarks

Overview

Built for Enterprise RAG

The Cohere Ecosystem: Embed + Rerank + Generate

128K Context Window

Multilingual Support

Function Calling

Deployment and Availability

Best Use Cases

Compare with similar models

Overview

Built for Enterprise RAG

The Cohere Ecosystem: Embed + Rerank + Generate

128K Context Window

Multilingual Support

Function Calling

Deployment and Availability

Best Use Cases