Skip to main content

Replicate

Run thousands of open-source ML models via API — LLMs, image generation, audio, and video without GPU management

LLM FrameworksFreemium

Replicate is a cloud platform for running open-source machine learning models via a simple REST API. With thousands of community models including Llama, Stable Diffusion XL, Whisper, Flux, SDXL, and more, Replicate lets you run any model with a single API call without managing GPU servers. Pay-per-prediction pricing with no minimums. You can also push and host your own custom models.

Loading…

FAQ

Alternatives

Integrations

None listed.

Built on

None listed.