What is text-generation-webui (oobabooga)?

Text Generation WebUI, commonly called 'ooba' or 'oobabooga' after its author, is an open-source Gradio-based web UI for running local LLMs. It supports nearly every quantization format (GGUF, GPTQ, AWQ, EXL2, GGML), has dozens of generation parameter controls, an extensions system for adding new features, and an OpenAI-compatible API server.

Is text-generation-webui free?

Completely free and open source (AGPL-3.0). You run it on your own hardware. GPU required for reasonable performance — supports NVIDIA (CUDA), AMD (ROCm), and Apple Silicon (MPS). CPU-only mode exists but is slow.

How does oobabooga compare to LM Studio?

LM Studio is more beginner-friendly with a polished GUI and easy model downloads. Oobabooga offers far more control — multiple backends, quantization formats, fine-tuned sampling parameters, extensions, and chat templates. Power users and researchers prefer oobabooga; LM Studio is better for casual daily use.

What extensions does oobabooga have?

The extensions system adds capabilities like: Stable Diffusion (image generation in chat), TTS (text-to-speech), voice input (speech-to-text), LoRA loading, SuperBooga (RAG document chat), AllTalk TTS, and many community-contributed extensions. Extensions add features without modifying the core.

Text Generation WebUI | db.fyi

Why it matters

Supports the widest range of model formats (GGUF, GPTQ, AWQ, EXL2) — run any quantized model regardless of which format it was released in.
Granular control over generation: temperature, repetition penalty, top-p, min-p, top-k, CFG, and 20+ other parameters.
Extensions ecosystem adds RAG, TTS, voice input, and image generation — making it a complete AI toolkit.
The benchmark tool of choice for researchers comparing model behaviors at specific generation settings.

Key capabilities

Multi-format model support: GGUF (llama.cpp), GPTQ, AWQ, EXL2, GGML (legacy), and Transformers float16.
Multiple backends: llama.cpp, ExLlamaV2, AutoGPTQ, AutoAWQ, and HuggingFace Transformers.
Chat modes: Instruct mode, chat mode, and character roleplay with custom persona cards.
Generation parameter control: 20+ sampling parameters — temperature, repetition penalty, top-p, DRY, Mirostat, CFG, etc.
Extensions: Community extensions for TTS, voice input, image generation (SD), RAG, and more.
LoRA loading: Apply LoRA adapters on top of base models for fine-tuned behavior.
API server: OpenAI-compatible REST API + additional endpoints for Ooba-specific features.
Training tab: Basic LoRA fine-tuning on custom datasets from the UI.

Technical notes

Install: Python 3.11+; GPU drivers; install script handles environment setup
Hardware: NVIDIA GPU (CUDA), AMD GPU (ROCm), Apple Silicon (MPS), or CPU
License: AGPL-3.0 — open source; commercial use has restrictions under AGPL
Backends: ExLlamaV2 (best for GPTQ/EXL2), llama.cpp (best for GGUF), Transformers (most compatible)
API: OpenAI-compatible at localhost:5000; also SSE streaming
Model download: Manual download from HuggingFace and copy to models/ folder (more complex than LM Studio)
Maintained by: oobabooga (GitHub username); actively maintained community project

Ideal for

Power users and ML researchers who need precise control over model loading, quantization formats, and generation parameters.
Developers building around local LLMs who need the most flexible and extensible local inference UI available.
Enthusiasts who want to experiment with LoRA loading, custom sampling strategies, and model comparison.

Not ideal for

Beginners — complex installation process; LM Studio or GPT4All are far more approachable.
Purely CPU-only machines — performance is acceptable but the UI complexity is wasted without GPU.
Production API serving with many concurrent users — vLLM handles high concurrency better.

Text Generation WebUI

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

Text Generation WebUI

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is text-generation-webui (oobabooga)?

Is text-generation-webui free?

How does oobabooga compare to LM Studio?

What extensions does oobabooga have?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also