LangSmith is an observability and evaluation platform for LLM applications. You wrap your LangChain (or any LLM) code with LangSmith tracing, and it records every prompt, model call, tool invocation, and output in a searchable trace dashboard. You can then run automated evals, create test datasets, and compare prompt versions.

LangSmith has a free Developer tier (up to 5,000 traces/month and 1 user). Plus plan ($39/mo) adds 50,000 traces and 3 users. Teams ($299/mo) scales to 500,000 traces with 10 users. Enterprise pricing for unlimited scale with SLA and SSO.

Does LangSmith work with non-LangChain apps?

Yes. While LangSmith integrates most easily with LangChain, it has an SDK that works with any Python or JavaScript LLM app. You can wrap any LLM call with LangSmith tracing using the `@traceable` decorator or the `Client` API, making it framework-agnostic.

What is the difference between LangSmith and LangFuse?

Both are LLM observability tools with similar features. LangSmith is made by LangChain and has the tightest LangChain ecosystem integration. LangFuse is open-source and self-hostable — better for teams with data privacy requirements or who prefer open source. Both support non-LangChain apps. LangFuse has a more generous free tier.

LangSmith | db.fyi

Why it matters

Developed by LangChain — the deepest observability integration for the LangChain ecosystem, with zero-config tracing for LangChain apps.
Full-stack trace visibility: see exactly what prompts, tokens, latency, and costs were incurred at every step of a chain or agent run.
Online evaluation lets you attach custom evaluators that automatically grade outputs on quality, accuracy, or custom criteria.
Dataset management turns production traces into test suites — continuously test prompt changes against real production examples.

Key capabilities

Full trace capture: Record every LLM call, chain step, tool use, and retrieval in a hierarchical trace tree.
Latency and cost tracking: See token counts, latency, and model cost for every call — identify bottlenecks and expensive steps.
Prompt versioning: Save and compare prompt templates; test different versions against the same dataset.
Dataset curation: Create test datasets from production traces; annotate examples as ground truth.
Automated evaluation: Define evaluators (LLM-based or custom Python) that grade output quality automatically.
Human annotation: UI for human labelers to rate and annotate model outputs for feedback datasets.
Regression testing: Run evaluation suites in CI/CD to catch prompt regressions before deployment.
LangChain integration: Automatic tracing with one env variable (LANGCHAIN_TRACING_V2=true).

Technical notes

SDK: Python and JavaScript/TypeScript (pip install langsmith); @traceable decorator and Client API
LangChain integration: Automatic with LANGCHAIN_TRACING_V2=true and LANGCHAIN_API_KEY
Data storage: Hosted on LangChain's cloud (US); no self-hosted option (use LangFuse for that)
Pricing: Free (5,000 traces/mo); Plus $39/mo; Teams $299/mo; Enterprise custom
API: REST API for programmatic access to traces, datasets, and evaluations
Founded: 2023 (LangChain Inc., spun out from Harrison Chase's work); San Francisco

Ideal for

Teams using LangChain who want native observability with zero additional configuration.
Production AI apps where debugging complex multi-step agent failures is a regular occurrence.
ML teams building systematic evaluation pipelines with automated regression detection.

Not ideal for

Organizations with strict data residency requirements — LangSmith is cloud-only (use LangFuse for self-hosted).
Teams primarily using non-Python LLM stacks where LangSmith's SDK adds friction.
Budget-conscious teams — LangFuse open-source offers comparable features for free.

LangSmith

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

LangSmith

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is LangSmith?

Is LangSmith free?

Does LangSmith work with non-LangChain apps?

What is the difference between LangSmith and LangFuse?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also