Refact.ai is an AI coding assistant with both cloud and self-hosted options. It provides inline code completion, an AI chat panel, and code refactoring suggestions in VS Code and JetBrains. The distinctive feature is the self-hosted option: teams can run Refact's backend server on their own GPU infrastructure using open-source models (Code Llama, StarCoder, WizardCoder) — code never leaves the company's network.

Refact.ai has a free cloud tier with basic code completion using hosted models. The Pro plan (~$10/user/mo) unlocks more models and higher usage limits. The self-hosted version (Refact Enterprise) requires a license but allows using your own GPU infrastructure with no per-token API costs — economical for large engineering teams.

How does Refact's self-hosting work?

Refact provides a Docker-based self-hosted server that runs open-source code models (Code Llama 34B, StarCoder2, WizardCoder). You deploy the server on your GPU infrastructure (on-premise or private cloud), configure the VS Code/JetBrains plugin to point to your server, and all requests stay internal. This is the key differentiator for organizations in finance, defense, healthcare, or any industry with strict data residency requirements.

What models does Refact.ai support?

Cloud tier: GPT-4, Claude 3, and Refact's proprietary 1.6B code completion model. Self-hosted: Code Llama (7B, 13B, 34B), StarCoder2, WizardCoder, and any HuggingFace-compatible model. The 1.6B Refact model is particularly notable — it achieves competitive HumanEval scores at a fraction of the size, enabling fast completion on consumer-grade GPUs.

Refact.ai | db.fyi

Why it matters

True self-hosted option with complete data privacy — code never leaves the network, critical for defense, finance, and healthcare customers.
Proprietary 1.6B Refact model achieves strong HumanEval scores with a tiny model — enables fast completion on consumer GPUs (RTX 3080).
VS Code and JetBrains coverage reaches the full developer population — not VS Code-only like many competitors.
Open source server (github.com/smallcloudai/refact) means self-hosted teams can audit and customize the backend.

Key capabilities

Inline completion: Context-aware code completion in VS Code and JetBrains IDEs.
AI chat: Conversational AI chat for code explanation, debugging, and generation.
Refactoring: Suggest improvements, extract functions, and simplify code.
Self-hosted server: Docker-based server running open-source models on private GPU infrastructure.
Model selection: Cloud (GPT-4, Claude, Refact-1.6B) or self-hosted (Code Llama, StarCoder2).
Custom fine-tuning: Enterprise option for fine-tuning on internal codebase (self-hosted).
Codebase indexing: Understand repository context for more accurate completions.
Team management: User access control, usage analytics, and permission settings.

Technical notes

IDE plugins: VS Code, JetBrains (IntelliJ, PyCharm, GoLand, etc.)
Self-hosted: Docker; NVIDIA GPU (8GB+ VRAM recommended); github.com/smallcloudai/refact
Cloud models: GPT-4, Claude 3, Refact-1.6B
Self-hosted models: Code Llama (7B/13B/34B), StarCoder2, WizardCoder
Pricing: Free (cloud, limited); Pro ~$10/user/mo; Enterprise (self-hosted, custom)
Founded: 2022; Tallinn, Estonia

Ideal for

Engineering organizations with strict data privacy requirements that prevent using cloud-based coding AIs.
Financial services, healthcare, defense, and government teams who need on-premise AI coding assistance.
Teams with existing GPU infrastructure who want to maximize it for developer productivity.

Not ideal for

Teams without GPU infrastructure for self-hosted deployment — cloud tier is competitive but not class-leading.
Maximum AI coding quality without data privacy constraints — Cursor or GitHub Copilot have stronger cloud-side capabilities.
Solo developers — the self-hosted value proposition doesn't apply at individual scale.

Refact.ai

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

Refact.ai

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is Refact.ai?

Is Refact.ai free?

How does Refact's self-hosting work?

What models does Refact.ai support?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also