hub / github.com/mosaicml/composer

github.com/mosaicml/composer @v0.32.1 sqlite

repository ↗ · DeepWiki ↗ · release v0.32.1 ↗

3,517 symbols 16,719 edges 391 files 1,165 documented · 33%

README

<a href="https://github.com/mosaicml/composer#gh-light-mode-only" class="only-light">
  <img src="https://github.com/mosaicml/composer/raw/v0.32.1/docs/source/_static/logo-light-mode.png" width="50%"/>
</a>

<a href="https://github.com/mosaicml/composer#gh-dark-mode-only" class="only-dark">
  <img src="https://github.com/mosaicml/composer/raw/v0.32.1/docs/source/_static/logo-dark-mode.png" width="50%"/>
</a>

Supercharge your Model Training

Deep Learning Framework for Training at Scale

[Website] - [Getting Started] - [Docs] - [We're Hiring!]

<a href="https://pypi.org/project/mosaicml/">
    <img alt="PyPi Version" src="https://img.shields.io/pypi/pyversions/mosaicml">
</a>
<a href="https://pypi.org/project/mosaicml/">
    <img alt="PyPi Package Version" src="https://img.shields.io/pypi/v/mosaicml">
</a>
<a href="https://pepy.tech/project/mosaicml/">
    <img alt="PyPi Downloads" src="https://static.pepy.tech/personalized-badge/mosaicml?period=month&units=international_system&left_color=grey&right_color=blue&left_text=Downloads/month">
</a>
<a href="https://docs.mosaicml.com/projects/composer/en/stable/">
    <img alt="Documentation" src="https://readthedocs.org/projects/composer/badge/?version=stable">
</a>
<a href="https://dub.sh/mcomm">
    <img alt="Chat @ Slack" src="https://img.shields.io/badge/slack-chat-2eb67d.svg?logo=slack">
</a>
<a href="https://github.com/mosaicml/composer/blob/dev/LICENSE">
    <img alt="License" src="https://img.shields.io/badge/License-Apache%202.0-green.svg?logo=slack">
</a>

👋 Welcome

Composer is an open-source deep learning training library by MosaicML. Built on top of PyTorch, the Composer library makes it easier to implement distributed training workflows on large-scale clusters.

We built Composer to be optimized for scalability and usability, integrating best practices for efficient, multi-node training. By abstracting away low-level complexities like parallelism techniques, distributed data loading, and memory optimization, you can focus on training modern ML models and running experiments without slowing down.

We recommend using Composer to speedup your experimentation workflow if you’re training neural networks of any size, including:

Large Language Models (LLMs)
Diffusion models
Embedding models (e.g. BERT)
Transformer-based models
Convolutional Neural Networks (CNNs)

Composer is heavily used by the MosaicML research team to train state-of-the-art models like MPT, and we open-sourced this library to enable the ML community to do the same. This framework is used by organizations in both the tech industry and the academic sphere and is continually updated with new features, bug fixes, and stability improvements for production workloads.

🔑 Key Features

Composer is to give you better workflows with the ability to maximize scale and customizability.

We designed Composer from the ground up for modern deep learning workloads. Gone are the days of AlexNet and ResNet, when state-of-the-art models could be trained on a couple of desktop GPUs. Today, developing the latest and greatest deep learning models often requires cluster-scale hardware — but with Composer’s help, you’ll hardly notice the difference.

The heart of Composer is our Trainer abstraction: a highly optimized PyTorch training loop designed to allow both you and your model to iterate faster. Our trainer has simple ways for you to configure your parallelization scheme, data loaders, metrics, loggers, and more.

Scalability

Whether you’re training on 1 GPU or 512 GPUs, 50MB or 10TB of data - Composer is built to keep your workflow simple.

FSDP: For large models that are too large to fit on GPUs, Composer has integrated PyTorch FullyShardedDataParallelism into our trainer and made it simple to efficiently parallelize custom models. We’ve found FSDP is competitive performance-wise with much more complex parallelism strategies. Alternatively, Composer also supports standard PyTorch distributed data parallelism (DDP) execution.
Elastic sharded checkpointing: Save on eight GPUs, resume on sixteen. Composer supports elastic sharded checkpointing, so you never have to worry if your sharded saved state is compatible with your new hardware setup.
Data streaming: Working with large datasets? Download datasets from cloud blob storage on the fly by integrating with MosaicML StreamingDataset during model training.

Customizability

Other high-level deep learning trainers provide simplicity at the cost of rigidity. When you want to add your own features, their abstractions get in your way. Composer, on the other hand, provides simple ways for you to customize our Trainer to your needs.

Composer’s training loop has a series of events that occur at each stage in the training process.

Fig. 1: Composer’s training loop has a series of events that occur at each stage in the training process. Callbacks are functions that users write to run at specific events. For example, our Learning Rate Monitor Callback logs the learning rate at every BATCH_END event.

Callbacks: Composer’s callback system allows you to insert custom logic at any point in the training loop. We’ve written callbacks to monitor memory usage, log and visualize images, and estimate your model’s remaining training time, to name a few. This feature is popular among researchers who want to implement and experiment with custom training techniques.
Speedup algorithms: We draw from the latest research to create a collection of algorithmic speedups. Stack these speedups into MosaicML recipes to boost your training speeds. Our team has open-sourced the optimal combinations of speedups for different types of models.
- 8x speedup: Stable Diffusion
  - $200k original SD2 cost —> $50k (Blog)
- 7x speedup: ResNet-50 on ImageNet
  - 3h33m —> 25m on 8xA100 (Blog)
- 8.8x speedup: BERT-Base Pretraining
  - 10h —> 1.13h on 8xA100 (Blog)
- 5.4x speedup: DeepLab v3 on ADE20K
  - 3h30m —> 39m on 8xA100 (Blog)

Better workflows

Composer is built to automate away low-level pain points and headaches so you can focus on the important (and fun) parts of deep learning and iterate faster.

Auto-resumption: Failed training run? Have no fear — just re-run your code, and Composer will automatically resume from your latest saved checkpoint.
CUDA OOM Prevention: Say goodbye to out-of-memory errors. Set your microbatch size to “auto”, and Composer will automatically select the biggest one that fits on your GPUs.
Time Abstractions: Ever messed up your conversion between update steps, epochs, samples, and tokens? Specify your training duration with custom units (epochs, batches, samples, and tokens) in your training loop with our Time class.

Integrations

Integrate with the tools you know and love for experiment tracking and data streaming.

Cloud integrations: Our Checkpointing and logging features have first-class support for remote storage and loading from Cloud bucket (OCI, GCP, AWS S3).
*Experiment tracking:* Weights and Biases, MLFlow, CometML, and neptune.ai — the choice is yours, easily log your data to your favorite platform.

🚀 Getting Started

📍Prerequisites

Composer is designed for users who are comfortable with Python and have basic familiarity with deep learning fundamentals and PyTorch.

*******Software requirements:******* A recent version of PyTorch.

*******Hardware requirements:******* System with CUDA-compatible GPUs (AMD + RoCM coming soon!). Composer can run on CPUs, but for full benefits, we recommend using it on hardware accelerators.

💾 Installation

Composer can be installed with pip:

pip install mosaicml

To simplify the environment setup for Composer, we also provide a set of pre-built Docker images. We highly recommend you use our Docker images.

🏁 Quick Start

Here is a code snippet demonstrating our Trainer on the MNIST dataset.

import torch
import torch.nn as nn
import torch.nn.functional as F
from torchvision import datasets, transforms
from torch.utils.data import DataLoader

from composer import Trainer
from composer.models import ComposerClassifier
from composer.algorithms import LabelSmoothing, CutMix, ChannelsLast

class Model(nn.Module):
    """Toy convolutional neural network architecture in pytorch for MNIST."""

    def __init__(self, num_classes: int = 10):
        super().__init__()

        self.num_classes = num_classes

        self.conv1 = nn.Conv2d(1, 16, (3, 3), padding=0)
        self.conv2 = nn.Conv2d(16, 32, (3, 3), padding=0)
        self.bn = nn.BatchNorm2d(32)
        self.fc1 = nn.Linear(32 * 16, 32)
        self.fc2 = nn.Linear(32, num_classes)

    def forward(self, x):
        out = self.conv1(x)
        out = F.relu(out)
        out = self.conv2(out)
        out = self.bn(out)
        out = F.relu(out)
        out = F.adaptive_avg_pool2d(out, (4, 4))
        out = torch.flatten(out, 1, -1)
        out = self.fc1(out)
        out = F.relu(out)
        return self.fc2(out)

transform = transforms.Compose([transforms.ToTensor()])
dataset = datasets.MNIST("data", train=True, download=True, transform=transform)
train_dataloader = DataLoader(dataset, batch_size=128)

trainer = Trainer(
    model=ComposerClassifier(module=Model(), num_classes=10),
    train_dataloader=train_dataloader,
    max_duration="2ep",
    algorithms=[
        LabelSmoothing(smoothing=0.1),
        CutMix(alpha=1.0),
        ChannelsLast(),
    ],
)
trainer.fit()

Next, check out our Getting Started Colab for a walk-through of Composer’s main features. In this tutorial, we will cover the basics of the Composer Trainer:

Dataloader
Trainer
Optimizer and Scheduler
Logging
Training a baseline model
Speeding up training

📚 Learn more

Once you’ve completed the Quick Start, you can go through the below tutorials or our documentation to further familiarize yourself with Composer.

If you have any questions, please feel free to reach out to us on our Community Slack!

Here are some resources actively maintained by the Composer community to help you get started:

Resource	Details
Training BERTs with Composer and 🤗	A Colab Notebook showing how to train BERT models with Composer and 🤗!
Pretraining and Finetuning an LLM Tutorial	A tutorial from MosaicML’s LLM Foundry, using MosaicML Composer, StreamingDataset, and MCLI on training and evaluating LLMs.
Migrating from PyTorch Lightning	A tutorial is to illustrating a path from working in PyTorch Lightning to working in Composer.

Core symbols most depended-on inside this repo

fit

called by 229

composer/trainer/trainer.py

get

called by 151

composer/core/time.py

write

called by 90

composer/metrics/map.py

exists

called by 67

composer/utils/compression.py

run_event

called by 66

composer/core/engine.py

from_timestring

called by 61

composer/core/time.py

ensure_tuple

called by 52

composer/utils/iter_helpers.py

tensor_to_device

called by 52

composer/devices/device.py

Shape

Function 1,587

Method 1,550

Class 353

Route 27

Languages

Python100%

TypeScript1%

Modules by API surface

tests/trainer/test_trainer.py92 symbols

tests/common/models.py72 symbols

composer/core/time.py71 symbols

tests/trainer/test_checkpoint.py68 symbols

tests/utils/test_batch_helpers.py65 symbols

composer/trainer/trainer.py49 symbols

composer/core/state.py49 symbols

tests/common/datasets.py47 symbols

composer/optim/scheduler.py47 symbols

composer/core/callback.py45 symbols

tests/fixtures/fixtures.py44 symbols

tests/models/test_hf_model.py42 symbols

For agents

$ claude mcp add composer \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact