Embedded is a unified API for generating text and image embeddings from multiple providers. Instead of integrating OpenAI, Cohere, and sentence-transformers separately, Embedded provides a single endpoint that routes to the best model. It adds caching (avoid re-embedding identical text), batching (process large document sets efficiently), and provider fallbacks.

When do I need an embeddings API?

Embeddings are numerical representations of text (or images) that capture meaning. You need them for: semantic search (find documents similar to a query), RAG (retrieve relevant context for LLMs), recommendation systems (find similar items), clustering (group similar content), and classification. Every vector database (Pinecone, Weaviate, Qdrant) requires embeddings as input.

Embedded has a free tier for development and low-volume use. Paid plans scale with the number of embeddings generated monthly. Enterprise plans include dedicated infrastructure, SLAs, and custom pricing for very high volumes.

What embedding models does Embedded support?

Embedded supports OpenAI text-embedding-3 (small and large), Cohere Embed v3 (English and multilingual), sentence-transformers (various open-source models), and others. You can specify a model or let Embedded route to the best available option based on language, performance requirements, and cost.

Embedded | db.fyi

Why it matters

Unified API abstracts embedding provider differences — switch from OpenAI to Cohere by changing one parameter, not rewriting integration code.
Built-in caching prevents re-embedding identical or near-identical text — significant cost reduction for large document corpora with repeated content.
Batching optimizations handle large document processing efficiently — embed millions of documents without managing API rate limits manually.
Provider fallbacks prevent single-provider outages from breaking RAG and search pipelines.

Key capabilities

Multi-provider routing: Access OpenAI, Cohere, sentence-transformers through one API endpoint.
Caching: Avoid re-embedding identical text — serve cached vectors for repeated content.
Batching: Efficient batch processing for large document corpora.
Provider fallbacks: Automatic failover if a provider is unavailable or rate-limited.
Text and image embeddings: Supports both modalities for multimodal search applications.
Simple SDK: Python and JavaScript clients for quick integration.
Cost optimization: Route to cheaper models for low-stakes use cases; premium models for accuracy-critical search.

Technical notes

Models: OpenAI text-embedding-3-small/large, Cohere Embed v3, sentence-transformers, custom models
API: REST API with OpenAI-compatible format where possible
Caching: Semantic and exact-match caching
Languages: Python SDK, JavaScript SDK, REST
Output: Float32 vectors; configurable dimensionality for supported models
Pricing: Free tier; pay-per-embedding for production; Enterprise custom

Ideal for

Teams building RAG pipelines who want flexibility to switch embedding providers without code changes.
Applications with large document corpora where caching significantly reduces embedding costs.
Developers who want a simpler integration layer over multiple embedding providers.

Not ideal for

Teams who have standardized on a single embedding provider — direct integration is simpler with no intermediary.
Cutting-edge embedding research where access to the latest models immediately on release matters.
On-premise or air-gapped deployments — Embedded is a cloud API service.

Embedded

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

Embedded

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is Embedded?

When do I need an embeddings API?

Is Embedded free?

What embedding models does Embedded support?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also