LlamaIndex (formerly GPT Index) is a Python framework for building RAG applications. It provides: data loaders (160+ connectors for PDFs, Notion, Slack, GitHub, databases), node parsers (chunk documents for retrieval), indices (vector, summary, keyword), retrievers (semantic search, BM25, hybrid), query engines (RAG pipelines), and agent tools. It's designed to make it easy to connect any LLM to any data source.

LlamaIndex is open-source under the MIT license and free to use. LlamaCloud (managed platform) has paid tiers. LlamaParse (PDF parsing service) has a free tier (1,000 pages/day) and paid plans. The core LlamaIndex Python library is completely free; you pay for the LLM and embedding APIs it calls (OpenAI, Anthropic, etc.).

How many data connectors does LlamaIndex have?

LlamaIndex has 160+ built-in data connectors (called LlamaHub loaders): PDFs, Word docs, Google Docs, Notion, Confluence, Slack, Discord, GitHub, Jira, databases (PostgreSQL, MySQL, MongoDB), APIs, web pages, YouTube transcripts, and more. This breadth of connectors is a key advantage — you can ingest data from almost any source without writing custom parsers.

How does LlamaIndex compare to LangChain?

LlamaIndex focuses on data — ingestion, indexing, and retrieval. Its abstractions are built around nodes, indices, and query engines. LangChain focuses on chaining — connecting LLMs, tools, and memory into agent workflows. For RAG-heavy applications with complex retrieval pipelines, LlamaIndex is typically the better choice. For agent workflows with many tools and complex orchestration, LangChain's ecosystem is stronger. Many teams use both: LlamaIndex for retrieval, LangChain for orchestration.

LlamaIndex | db.fyi

Why it matters

160+ data connectors eliminate custom data ingestion code — connect to Notion, Slack, PDFs, GitHub, and databases with a few lines of code.
Production-tested retrieval patterns include advanced techniques (hybrid search, HyDE, auto-merging, re-ranking) that outperform naive chunking + embedding approaches.
LlamaParse provides structured PDF parsing that preserves tables, headers, and lists — critical for high-quality RAG over complex documents.
Active enterprise ecosystem with LlamaCloud for managed deployment and professional support.

Key capabilities

Data loaders: 160+ connectors via LlamaHub; PDF, Notion, Slack, GitHub, databases.
Node parsers: Intelligent document chunking with multiple strategies.
Vector indices: VectorStoreIndex compatible with 20+ vector databases.
Query engines: RAG pipelines with retrieval, synthesis, and citation.
Retrieval modes: Semantic, BM25, hybrid, MMR, and custom retrievers.
Advanced RAG: HyDE, auto-merging, parent document retrieval, re-ranking.
Agents: ReAct, OpenAI function calling, and custom agent loops.
LlamaParse: Cloud PDF parsing service with table/structure extraction.
Streaming: Real-time streaming query engines.
TypeScript: LlamaIndex.TS for JavaScript/TypeScript applications.

Technical notes

Language: Python (primary); TypeScript (LlamaIndex.TS)
License: MIT
Install: pip install llama-index
GitHub: github.com/run-llama/llama_index
Stars: 32K+
LLMs: OpenAI, Anthropic, Cohere, Hugging Face, local (Ollama, LM Studio)
Vector stores: Pinecone, Weaviate, Chroma, Qdrant, pgvector, 20+ more

Usage example

from llama_index.core import VectorStoreIndex, SimpleDirectoryReader
from llama_index.llms.openai import OpenAI

# Load documents and build RAG index
documents = SimpleDirectoryReader("./data").load_data()
index = VectorStoreIndex.from_documents(documents)

# Query with natural language
query_engine = index.as_query_engine(
    llm=OpenAI(model="gpt-4o"),
    similarity_top_k=5
)
response = query_engine.query("What are the key findings from the research papers?")
print(response)

Ideal for

Python developers building RAG applications who need production-quality retrieval with many data sources.
Teams with complex document ingestion needs (PDFs with tables, mixed formats, many source systems).
Enterprise RAG deployments that need advanced retrieval techniques (hybrid search, re-ranking) for high-quality answers.

Not ideal for

TypeScript/JavaScript-first teams — use LlamaIndex.TS or Vercel AI SDK + custom retrieval.
Simple single-file RAG with no complex retrieval needs — direct LLM SDK may be simpler.
Agent-heavy workflows without significant retrieval — LangChain has a stronger agent and tool ecosystem.

LlamaIndex

Why it matters

Key capabilities

Technical notes

Usage example

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Usage example

Ideal for

Not ideal for

See also

LlamaIndex

Why it matters

Key capabilities

Technical notes

Usage example

Ideal for

Not ideal for

See also

FAQ

What is LlamaIndex?

Is LlamaIndex free?

How many data connectors does LlamaIndex have?

How does LlamaIndex compare to LangChain?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Usage example

Ideal for

Not ideal for

See also