Axolotl is an open-source fine-tuning framework that simplifies training LLMs on custom data. You write a YAML config file specifying your base model (Llama 3, Mistral, etc.), training method (LoRA, QLoRA, full), dataset, and hyperparameters — then run `axolotl train config.yml`. Axolotl handles the training loop, multi-GPU setup, gradient checkpointing, and checkpoint saving.

Axolotl is completely free and open source (Apache 2.0). You provide your own GPU compute (local GPUs, RunPod, Lambda Labs, etc.) and pay for the hardware. No per-token or per-training-run fees. The framework itself costs nothing. RunPod + Axolotl is a popular combination for cost-effective custom model training.

What training methods does Axolotl support?

Axolotl supports: LoRA (Low-Rank Adaptation), QLoRA (quantized LoRA for memory efficiency), full fine-tuning, RLHF (Reinforcement Learning from Human Feedback), DPO (Direct Preference Optimization), ORPO, and more. For most teams, QLoRA is the starting point — it can fine-tune a 7B model on a single 24GB GPU with competitive results.

How does Axolotl compare to Unsloth?

Unsloth focuses on training speed and memory efficiency — it's 2-5× faster than standard training through custom CUDA kernels. Axolotl focuses on flexibility and features — it supports more training methods (RLHF, DPO), more model architectures, and more configuration options. Many practitioners use both: Axolotl for its feature flexibility, or Unsloth for faster iteration when speed matters. The two projects have complementary strengths.

Axolotl | db.fyi

Why it matters

YAML-based configuration makes fine-tuning accessible to ML practitioners who don't want to write custom training loops.
Comprehensive training method support (LoRA, QLoRA, full, RLHF, DPO) in one framework covers the full fine-tuning spectrum.
Community-built on community knowledge — OpenAccess AI Collective incorporates best practices from the open-source AI community.
Widely used for producing popular open-source fine-tuned models — many HuggingFace Hub models were trained with Axolotl.

Key capabilities

Training methods: LoRA, QLoRA, full fine-tuning, RLHF, DPO, ORPO, ReLoRA.
Model support: Llama 2/3, Mistral, Mixtral, Falcon, Phi, Gemma, MPT, and more.
YAML configuration: Define the entire training run in a simple, readable YAML file.
Multi-GPU: FSDP and DeepSpeed integration for distributed training.
Dataset formats: Alpaca, ShareGPT, Completion, Instruction, and custom formats.
Flash Attention 2: Automatic integration for memory-efficient training.
Gradient checkpointing: Reduce memory usage for larger batch sizes.
W&B integration: Weights & Biases logging for training metrics tracking.

Technical notes

License: Apache 2.0 (open source)
GitHub: github.com/OpenAccess-AI-Collective/axolotl (8K+ stars)
Install: pip install axolotl or Docker image
GPU: NVIDIA (CUDA required); multi-GPU via FSDP/DeepSpeed
Python: 3.9+
Models: Llama 2/3, Mistral, Mixtral, Gemma, Phi-3, Falcon, MPT, and more
Dataset formats: Alpaca, ShareGPT, completion, custom JSON/JSONL

Ideal for

ML practitioners who want to fine-tune LLMs on custom datasets without writing training code from scratch.
Researchers experimenting with different training methods (LoRA vs. DPO vs. RLHF) on the same base model.
Teams producing specialized models (domain-specific, instruction-tuned, preference-aligned) for open-source release or internal use.

Not ideal for

Teams without GPU access — Axolotl requires CUDA GPUs; use Predibase for managed training.
Maximum training speed — Unsloth's custom CUDA kernels are significantly faster.
Non-technical users who need a UI — Axolotl is CLI-based; Predibase or Together AI have GUIs.

Axolotl

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

Alternatives

Integrations

Built on

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

Axolotl

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also

FAQ

What is Axolotl?

Is Axolotl free?

What training methods does Axolotl support?

How does Axolotl compare to Unsloth?

Alternatives

Integrations

Built on

Related tools

Why it matters

Key capabilities

Technical notes

Ideal for

Not ideal for

See also