Skip to main content

Best LLM APIs and inference

Curated list of LLM APIs and inference platforms for developers.

#ToolCategoryPricingVisit
1OpenAI API

GPT-4o, embeddings, DALL-E, and Whisper API — the most widely used LLM API powering the majority of AI applications

Code / DevToolsFreemiumVisit
2Anthropic API

Claude models API — 200K-token context, tool use, vision, and computer use for building production AI applications

Code / DevToolsFreemiumVisit
3Groq

Ultra-fast LLM inference API — run Llama, Mixtral, and Gemma at 500+ tokens/second on custom LPU hardware

LLM FrameworksFreemiumVisit
4Together AI

Open-source LLM inference and fine-tuning API — run Llama, Mistral, and 100+ models with competitive pricing

LLM FrameworksFreemiumVisit
5Cohere

Enterprise LLM API — embeddings, RAG, and generation with enterprise security, compliance, and private cloud deployment

LLM FrameworksFreemiumVisit
6ChatGPT / GPT-4

OpenAI's flagship AI model — GPT-4o for chat, code, vision, and reasoning with 200M+ weekly active users

TextFreemiumVisit
7Claude

Anthropic's long-context AI assistant

TextFreemiumVisit
8Replicate

Run thousands of open-source ML models via API — LLMs, image generation, audio, and video without GPU management

LLM FrameworksFreemiumVisit
9Hugging Face

The AI community hub — 900K+ models, 200K+ datasets, Inference API, and Spaces for the open-source ML ecosystem

LLM FrameworksFreeVisit
10Ollama

Run LLMs locally — pull and run Llama, Mistral, Gemma, and 100+ models with one command and OpenAI-compatible API

LLM FrameworksFreeVisit
11Vercel AI SDK

TypeScript AI SDK for React and Node — streaming, tool use, and multi-provider LLM integration with 1M+ npm downloads/week

LLM FrameworksFreeVisit
12LangChain

Open-source LLM application framework — chains, agents, RAG, and 700+ integrations with 127K GitHub stars

LLM FrameworksFreeVisit
13LlamaIndex

Python RAG framework — connect LLMs to 160+ data sources with production-grade retrieval pipelines

LLM FrameworksFreeVisit