hub / github.com/unslothai/unsloth

github.com/unslothai/unsloth @main sqlite

repository ↗ · DeepWiki ↗ · compare 2 versions

20,737 symbols 74,195 edges 1,540 files 4,948 documented · 24%

README

Unsloth Studio lets you run and train models locally.

Features • Quickstart • Notebooks • Documentation

⚡ Get started

macOS, Linux, WSL:

curl -fsSL https://unsloth.ai/install.sh | sh

Windows:

irm https://unsloth.ai/install.ps1 | iex

Community:

⭐ Features

Unsloth Studio (Beta) lets you run and train text, audio, embedding, vision models on Windows, Linux and macOS.

Inference

Search + download + run models including GGUF, LoRA adapters, safetensors
Export models: Save or export models to GGUF, 16-bit safetensors and other formats.
Tool calling: Support for self-healing tool calling and web search
Code execution: lets LLMs test code in Claude artifacts and sandbox environments
API inference endpoint: Deploy and run local LLMs in Claude Code, Codex tools with Unsloth
Auto set inference settings and customize chat templates.
We work directly with teams behind gpt-oss, Qwen3, Llama 4, Mistral, Gemma 1-3, and Phi-4, where we’ve fixed bugs that improve model accuracy.
Chat with images, audio, PDFs, code, DOCX and more. Connect API providers (OpenAI, Anthropic) or servers (vLLM, Ollama).

Training

Train and RL 500+ models up to 2x faster with up to 70% less VRAM, with no accuracy loss.
Custom Triton and mathematical kernels. See some collabs we did with PyTorch and Hugging Face.
Data Recipes: Auto-create datasets from PDF, CSV, DOCX etc. Edit data in a visual-node workflow.
Reinforcement Learning (RL): The most efficient RL library, using 80% less VRAM for GRPO, FP8 etc.
Supports full fine-tuning, RL, pretraining, 4-bit, 16-bit and, FP8 training.
Observability: Monitor training live, track loss and GPU usage and customize graphs.
Multi-GPU training is supported, with major improvements coming soon.

📥 Install

Unsloth can be used in two ways: through Unsloth Studio, the web UI, or through Unsloth Core, the code-based version. Each has different requirements.

Unsloth Studio (web UI)

Unsloth Studio (Beta) works on Windows, Linux, WSL and macOS.

CPU: Supported for Chat and Data Recipes currently
NVIDIA: Training works on RTX 30/40/50, Blackwell, DGX Spark, Station and more
macOS: Training, MLX and GGUF inference are ALL supported.
AMD: Chat + Data works. Train with Unsloth Core. Studio support is out soon.
Multi-GPU: Available now, with a major upgrade on the way

macOS, Linux, WSL:

curl -fsSL https://unsloth.ai/install.sh | sh

Use the same command to update.

Windows:

irm https://unsloth.ai/install.ps1 | iex

Use the same command to update.

Launch

unsloth studio -p 8888

For cloud or global access, add -H 0.0.0.0. By default, Unsloth is accessible only locally.

To reach Studio over HTTPS, use unsloth studio --secure. Studio stays bound to localhost and is reached only through a free Cloudflare tunnel, which publishes it at a public https://*.trycloudflare.com URL (it fails closed if the tunnel can't start, so the raw port is never exposed). This makes Studio reachable from the internet, so anyone with the link and API key can use it and run code: keep your API key private (see Remote access below).

Docker

Use our Docker image unsloth/unsloth container. Run:

docker run -d -e JUPYTER_PASSWORD="mypassword" \
  -p 8888:8888 -p 8000:8000 -p 2222:22 \
  -v $(pwd)/work:/workspace/work \
  --gpus all \
  unsloth/unsloth
  ```

#### Developer, Nightly, Uninstall
To see developer, nightly and uninstallation etc. instructions, see [advanced installation](#-advanced-installation).

### Unsloth Core (code-based)
#### Linux, WSL:
```bash
curl -LsSf https://astral.sh/uv/install.sh | sh
uv venv unsloth_env --python 3.13
source unsloth_env/bin/activate
uv pip install unsloth --torch-backend=auto

Windows:

winget install -e --id Python.Python.3.13
winget install --id=astral-sh.uv  -e
uv venv unsloth_env --python 3.13
.\unsloth_env\Scripts\activate
uv pip install unsloth --torch-backend=auto

For Windows, pip install unsloth works only if you have PyTorch installed. Read our Windows Guide. You can use the same Docker image as Unsloth Studio.

AMD, Intel:

For RTX 50x, B200, 6000 GPUs: uv pip install unsloth --torch-backend=auto. Read our guides for: Blackwell and DGX Spark.

To install Unsloth on AMD and Intel GPUs, follow our AMD Guide and Intel Guide.

📒 Free Notebooks

Train for free with our notebooks. You can use our new free Unsloth Studio notebook to run and train models for free in a web UI. Read our guide. Add dataset, run, then deploy your trained model.

Model	Free Notebooks	Performance	Memory use
Gemma 4 (E2B)	▶️ Start for free	1.5x faster	50% less
Qwen3.5 (4B)	▶️ Start for free	1.5x faster	60% less
gpt-oss (20B)	▶️ Start for free	2x faster	70% less
Qwen3.5 GSPO	▶️ Start for free	2x faster	70% less
gpt-oss (20B): GRPO	▶️ Start for free	2x faster	80% less
Qwen3: Advanced GRPO	▶️ Start for free	2x faster	70% less
embeddinggemma (300M)	▶️ Start for free	2x faster	20% less
Mistral Ministral 3 (3B)	▶️ Start for free	1.5x faster	60% less
Llama 3.1 (8B) Alpaca	▶️ Start for free	2x faster	70% less
Llama 3.2 Conversational	▶️ Start for free	2x faster	70% less
Orpheus-TTS (3B)	▶️ Start for free	1.5x faster	50% less

See all our notebooks for: Kaggle, GRPO, TTS, embedding & Vision
See all our models and all our notebooks
See detailed documentation for Unsloth here

🦥 Unsloth News

Connections: Connect any API provider (OpenAI, Anthropic) or server (vLLM, Ollama). Guide
MTP: Run Qwen3.6 MTP in Unsloth. MTP settings are autoset specific to your hardware. Guide
API inference endpoint: Deploy and run local LLMs in Claude Code, Codex tools. Guide
Qwen3.6: Qwen3.6-35B-A3B can now be trained and run in Unsloth Studio. Blog
Gemma 4: Run and train Google’s new models directly in Unsloth. Blog
Introducing Unsloth Studio: our new web UI for running and training LLMs. Blog
Qwen3.5 - 0.8B, 2B, 4B, 9B, 27B, 35-A3B, 112B-A10B are now supported. Guide + notebooks
Train MoE LLMs 12x faster with 35% less VRAM - DeepSeek, GLM, Qwen and gpt-oss. Blog
Embedding models: Unsloth now supports ~1.8-3.3x faster embedding fine-tuning. Blog • Notebooks
New 7x longer context RL vs. all other setups, via our new batching algorithms. Blog
New RoPE & MLP Triton Kernels & Padding Free + Packing: 3x faster training & 30% less VRAM. Blog
500K Context: Training a 20B model with >500K context is now possible on an 80GB GPU. Blog
FP8 & Vision RL: You can now do FP8 & VLM GRPO on consumer GPUs. FP8 Blog • Vision RL

📥 Advanced Installation

The below advanced instructions are for Unsloth Studio. For Unsloth Core advanced installation, view our docs.

Developer / Nightly / Experimental installs: macOS, Linux, WSL:

The developer install builds from the main branch, which is the latest (nightly) source.

git clone https://github.com/unslothai/unsloth
cd unsloth
./install.sh --local
unsloth studio -p 8888

To install into an isolated location (its own virtual env, auth/, studio.db, cache and llama.cpp build), set UNSLOTH_STUDIO_HOME and pass it again at launch:

UNSLOTH_STUDIO_HOME="$PWD/.studio" ./install.sh --local
UNSLOTH_STUDIO_HOME="$PWD/.studio" unsloth studio -p 8888

Then to update :

cd unsloth && git pull
./install.sh --local
unsloth studio -p 8888

Developer / Nightly / Experimental installs: Windows PowerShell:

The developer install builds from the main branch, which is the latest (nightly) source. ```powershell git clon

Extension points exported contracts — how you extend this code

SpeechRecognitionResultList (Interface)

* Minimal Web Speech API (Speech Recognition) types for browsers that support it. * Full types: @types/dom-speech-recog

studio/frontend/src/speech-recognition.d.ts

CodeExecutionArgs (Interface)

* Renders synthetic `_toolEvent` chunks from `_stream_anthropic` for the * `code_execution_20250825` tool. The backend

studio/frontend/src/components/assistant-ui/tool-ui-code-execution.tsx

ImageGenerationArgs (Interface)

* Renders the synthetic `_toolEvent` chunks emitted by * `_stream_openai_responses` when OpenAI's Responses-API `image_

studio/frontend/src/components/assistant-ui/tool-ui-image-generation.tsx

Tombstone (Interface)

* Tombstones mask deleted threads in the Dexie read fallback. Each carries a * `deletedAt` so old entries can be GC'd,

studio/frontend/src/features/chat/utils/chat-thread-tombstones.ts

ServerUsage (Interface)

Server-side usage data from llama-server (via stream_options.include_usage).

studio/frontend/src/features/chat/api/chat-adapter.ts

Core symbols most depended-on inside this repo

get

called by 1706

studio/backend/core/inference/api_monitor.py

set

called by 758

studio/frontend/src/features/chat/chat-settings-sheet.tsx

Field

called by 651

studio/frontend/src/components/ui/field.tsx

called by 635

studio/frontend/src/lib/utils.ts

list

called by 466

studio/frontend/src/features/chat/runtime-provider.tsx

filter

called by 404

studio/backend/run.py

get

called by 403

unsloth/import_fixes.py

has

called by 282

studio/frontend/src/features/hub/lib/lru-map.ts

Shape

Function 13,530

Method 5,053

Class 1,493

Interface 345

Route 316

Languages

Python81%

TypeScript19%

Modules by API surface

tests/studio/install/test_rocm_support.py401 symbols

tests/studio/install/test_selection_logic.py322 symbols

studio/backend/tests/test_transformers_version.py294 symbols

studio/backend/routes/inference.py282 symbols

studio/install_llama_prebuilt.py241 symbols

studio/backend/core/inference/llama_cpp.py232 symbols

studio/backend/tests/test_openai_tool_passthrough.py227 symbols

studio/backend/tests/test_vram_estimation.py197 symbols

studio/backend/tests/test_openai_auto_switch.py196 symbols

studio/backend/tests/test_gemini_provider.py168 symbols

unsloth/import_fixes.py163 symbols

studio/backend/tests/test_kv_cache_estimation.py160 symbols

Used by 1 indexed graphs manifest dependencies, hub-wide

github.com/marimo-team/marimo

Dependencies from manifests, versioned

@assistant-ui/core0.1.17 · 1×

@assistant-ui/react0.12.28 · 1×

@assistant-ui/tap0.5.10 · 1×

@base-ui/react1.2.0 · 1×

@biomejs/biome1.9.4 · 1×

@dagrejs/dagre2.0.4 · 1×

@dagrejs/graphlib3.0.4 · 1×

@eslint/js9.39.1 · 1×

@fontsource-variable/figtree5.2.10 · 1×

@fontsource-variable/inter5.2.8 · 1×

@fontsource-variable/space-grotesk5.2.10 · 1×

@hugeicons/core-free-icons4.1.1 · 1×

Datastores touched

appDatabase · 1 repos

For agents

$ claude mcp add unsloth \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact