Tabby is an open-source AI coding assistant server that you deploy on your own infrastructure. Once running, install the VS Code or JetBrains extension and point it to your server — you get GitHub Copilot-style code completion powered by your own models (StarCoder2, CodeLlama, DeepSeek-Coder). Code stays within your network; nothing is sent to external AI providers.

Tabby is completely free and open source (Apache 2.0). You self-host the server on your own hardware or private cloud. The only costs are your own compute (GPU/CPU time). There's no per-user, per-completion, or subscription fee. Tabby Enterprise adds team management features at additional cost.

What hardware does Tabby need?

For GPU inference: NVIDIA GPU with 8GB+ VRAM (RTX 3080, 4090, A10G, A100) for good performance. For smaller models (1B-3B), NVIDIA GPU with 4GB+ or Apple Silicon M1/M2 (via Metal). CPU inference is supported for smaller models but is significantly slower. For teams, a single A10G server can handle multiple concurrent completions.

What models does Tabby support?

Tabby supports: StarCoder2 (3B, 7B, 15B), CodeLlama (7B, 13B, 34B), DeepSeek-Coder (1.3B, 6.7B, 33B), Mistral 7B, WizardCoder, and any GGUF or HuggingFace-compatible model. The model registry in Tabby's admin UI lets you download and switch models without manual configuration.

Tabby | db.fyi

Why it matters

21K+ GitHub stars makes it the most popular self-hosted coding assistant alternative to GitHub Copilot.
Complete data privacy — code never leaves your infrastructure — critical for defense, finance, healthcare, and proprietary IP protection.
Consumer GPU support (RTX 3080+) makes self-hosting accessible to teams without enterprise hardware budgets.
Open source means the code can be audited — organizations with strict compliance requirements can verify exactly how code is processed.

Key capabilities

Code completion: Context-aware multi-line completion powered by StarCoder2, CodeLlama, or other models.
Self-hosted server: Docker-based server deployment; REST API for extensions.
VS Code extension: Tabby VS Code extension for in-editor completion.
JetBrains extension: Tabby IntelliJ plugin for JetBrains IDEs.
Model management: Web admin UI for downloading, switching, and managing coding models.
Multi-user support: Team usage tracking and analytics (Tabby Enterprise).
Repository indexing: Index your codebase for context-aware, repo-specific completions.
OpenAI-compatible API: Can be used as a drop-in for services expecting OpenAI code completion format.

Technical notes

License: Apache 2.0 (open source)
GitHub: github.com/TabbyML/tabby (21K+ stars)
Server: Docker (docker pull tabbyml/tabby); native binary
GPU: NVIDIA CUDA, Apple Silicon (Metal), CPU (slow)
Models: StarCoder2, CodeLlama, DeepSeek-Coder, WizardCoder, and more
IDE plugins: VS Code, JetBrains
Pricing: Free (self-hosted); Tabby Enterprise for team features

Ideal for

Organizations with strict data privacy requirements that prevent using cloud-based coding AIs.
Teams with on-premise GPU infrastructure who want to run coding assistance on existing hardware.
Open-source advocates who want transparency into how their coding assistant works.

Not ideal for

Teams without GPU hardware — CPU inference is too slow for practical use; cloud services are more practical.
Maximum code completion quality — hosted services with GPT-4 or Claude 3.5 typically outperform local models.
Non-technical deployment teams — self-hosting requires Docker and some infrastructure setup.

Tabby

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

Tabby

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is Tabby?

Is Tabby free?

What hardware does Tabby need?

What models does Tabby support?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also