Picking an LLM API

When building an AI feature, choosing an LLM API affects cost, latency, and control.

Criteria

Latency: Groq and some dedicated providers offer very low latency. OpenAI and Anthropic are typically 200–500 ms.
Cost: Open-source via Together, Replicate, or Groq can be cheaper per token. OpenAI and Anthropic have free tiers and usage-based pricing.
Features: Function calling, vision, long context, and fine-tuning vary by provider.
Vendor lock-in: Use a unified SDK (e.g. Vercel AI SDK) to swap providers without rewriting app code.

Use the Compare page on db.fyi to see APIs side by side.