Best LLM APIs and inference
Curated list of LLM APIs and inference platforms for developers.
| # | Tool | Category | Pricing | Visit |
|---|---|---|---|---|
| 1 | OpenAI API GPT-4o, embeddings, DALL-E, and Whisper API — the most widely used LLM API powering the majority of AI applications | Code / DevTools | Freemium | Visit |
| 2 | Anthropic API Claude models API — 200K-token context, tool use, vision, and computer use for building production AI applications | Code / DevTools | Freemium | Visit |
| 3 | Groq Ultra-fast LLM inference API — run Llama, Mixtral, and Gemma at 500+ tokens/second on custom LPU hardware | LLM Frameworks | Freemium | Visit |
| 4 | Together AI Open-source LLM inference and fine-tuning API — run Llama, Mistral, and 100+ models with competitive pricing | LLM Frameworks | Freemium | Visit |
| 5 | Cohere Enterprise LLM API — embeddings, RAG, and generation with enterprise security, compliance, and private cloud deployment | LLM Frameworks | Freemium | Visit |
| 6 | ChatGPT / GPT-4 OpenAI's flagship AI model — GPT-4o for chat, code, vision, and reasoning with 200M+ weekly active users | Text | Freemium | Visit |
| 7 | Claude Anthropic's long-context AI assistant | Text | Freemium | Visit |
| 8 | Replicate Run thousands of open-source ML models via API — LLMs, image generation, audio, and video without GPU management | LLM Frameworks | Freemium | Visit |
| 9 | Hugging Face The AI community hub — 900K+ models, 200K+ datasets, Inference API, and Spaces for the open-source ML ecosystem | LLM Frameworks | Free | Visit |
| 10 | Ollama Run LLMs locally — pull and run Llama, Mistral, Gemma, and 100+ models with one command and OpenAI-compatible API | LLM Frameworks | Free | Visit |
| 11 | Vercel AI SDK TypeScript AI SDK for React and Node — streaming, tool use, and multi-provider LLM integration with 1M+ npm downloads/week | LLM Frameworks | Free | Visit |
| 12 | LangChain Open-source LLM application framework — chains, agents, RAG, and 700+ integrations with 127K GitHub stars | LLM Frameworks | Free | Visit |
| 13 | LlamaIndex Python RAG framework — connect LLMs to 160+ data sources with production-grade retrieval pipelines | LLM Frameworks | Free | Visit |