hub / github.com/qubvel-org/segmentation_models.pytorch

github.com/qubvel-org/segmentation_models.pytorch @v0.5.0 sqlite

repository ↗ · DeepWiki ↗ · release v0.5.0 ↗

811 symbols 2,276 edges 130 files 154 documented · 19%

README

logo
Python library with Neural Networks for Image Semantic
Segmentation based on PyTorch.

The main features of the library are:

Super simple high-level API (just two lines to create a neural network)
12 encoder-decoder model architectures (Unet, Unet++, Segformer, DPT, ...)
800+ pretrained convolution- and transform-based encoders, including timm support
Popular metrics and losses for training routines (Dice, Jaccard, Tversky, ...)
ONNX export and torch script/trace/compile friendly

Community-Driven Project, Supported By

withoutBG API https://withoutbg.com High-quality background removal API

📚 Project Documentation 📚

Visit Read The Docs Project Page or read the following README to know more about Segmentation Models Pytorch (SMP for short) library

⏳ Quick start

1. Create your first Segmentation model with SMP

The segmentation model is just a PyTorch torch.nn.Module, which can be created as easy as:

import segmentation_models_pytorch as smp

model = smp.Unet(
    encoder_name="resnet34",        # choose encoder, e.g. mobilenet_v2 or efficientnet-b7
    encoder_weights="imagenet",     # use `imagenet` pre-trained weights for encoder initialization
    in_channels=1,                  # model input channels (1 for gray-scale images, 3 for RGB, etc.)
    classes=3,                      # model output channels (number of classes in your dataset)
)

see table with available model architectures
see table with available encoders and their corresponding weights

2. Configure data preprocessing

All encoders have pretrained weights. Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder.

from segmentation_models_pytorch.encoders import get_preprocessing_fn

preprocess_input = get_preprocessing_fn('resnet18', pretrained='imagenet')

Congratulations! You are done! Now you can train your model with your favorite framework!

💡 Examples

Name	Link	Colab
Train pets binary segmentation on OxfordPets	Notebook
Train cars binary segmentation on CamVid	Notebook
Train multiclass segmentation on CamVid	Notebook
Train clothes binary segmentation by @ternaus	Repo
Load and inference pretrained Segformer	Notebook
Load and inference pretrained DPT	Notebook
Load and inference pretrained DPT	Notebook
Save and load models locally / to HuggingFace Hub	Notebook
Export trained model to ONNX	Notebook

📦 Models and encoders

Architectures

Architecture	Paper	Documentation	Checkpoints
Unet	paper	docs
Unet++	paper	docs
MAnet	paper	docs
Linknet	paper	docs
FPN	paper	docs
PSPNet	paper	docs
PAN	paper	docs
DeepLabV3	paper	docs
DeepLabV3+	paper	docs
UPerNet	paper	docs	checkpoints
Segformer	paper	docs	checkpoints
DPT	paper	docs	checkpoints

Encoders

The library provides a wide range of pretrained encoders (also known as backbones) for segmentation models. Instead of using features from the final layer of a classification model, we extract intermediate features and feed them into the decoder for segmentation tasks.

All encoders come with pretrained weights, which help achieve faster and more stable convergence when training segmentation models.

Given the extensive selection of supported encoders, you can choose the best one for your specific use case, for example:
- Lightweight encoders for low-latency applications or real-time inference on edge devices (mobilenet/mobileone).
- High-capacity architectures for complex tasks involving a large number of segmented classes, providing superior accuracy (convnext/swin/mit).

By selecting the right encoder, you can balance efficiency, performance, and model complexity to suit your project needs.

All encoders and corresponding pretrained weight are listed in the documentation: - table with natively ported encoders - table with timm encoders supported

🔁 Models API

Input channels

The input channels parameter allows you to create a model that can process a tensor with an arbitrary number of channels. If you use pretrained weights from ImageNet, the weights of the first convolution will be reused: - For the 1-channel case, it would be a sum of the weights of the first convolution layer. - Otherwise, channels would be populated with weights like new_weight[:, i] = pretrained_weight[:, i % 3], and then scaled with new_weight * 3 / new_in_channels.

model = smp.FPN('resnet34', in_channels=1)
mask = model(torch.ones([1, 1, 64, 64]))

Auxiliary classification output

All models support aux_params parameters, which is default set to None. If aux_params = None then classification auxiliary output is not created, else model produce not only mask, but also label output with shape NC. Classification head consists of GlobalPooling->Dropout(optional)->Linear->Activation(optional) layers, which can be configured by aux_params as follows:

aux_params=dict(
    pooling='avg',             # one of 'avg', 'max'
    dropout=0.5,               # dropout ratio, default is None
    activation='sigmoid',      # activation function, default is None
    classes=4,                 # define number of output labels
)
model = smp.Unet('resnet34', classes=4, aux_params=aux_params)
mask, label = model(x)

Depth

Depth parameter specify a number of downsampling operations in encoder, so you can make your model lighter if specify smaller depth.

model = smp.Unet('resnet34', encoder_depth=4)

🛠 Installation

PyPI version:

```bash $ pip install segmentation-models-pytorch


The latest version from GitHub:

```bash
$ pip install git+https://github.com/qubvel/segmentation_models.pytorch

🏆 Competitions won with the library

Segmentation Models package is widely used in image segmentation competitions. [Here](https://github.com/qubvel/segmentation_models.pytorch/blob/main/HALLOFF

Core symbols most depended-on inside this repo

forward

called by 15

segmentation_models_pytorch/utils/metrics.py

_compute_metric

called by 15

segmentation_models_pytorch/metrics/functional.py

_get_sample

called by 12

tests/encoders/base.py

get_encoder

called by 11

segmentation_models_pytorch/encoders/__init__.py

initialize

called by 10

segmentation_models_pytorch/base/model.py

save_pretrained

called by 9

segmentation_models_pytorch/base/hub_mixin.py

_get_sample

called by 8

tests/models/base.py

check_run_test_on_diff_or_main

called by 7

tests/utils.py

Shape

Method 449

Class 208

Function 154

Languages

Python100%

Modules by API surface

segmentation_models_pytorch/encoders/mix_transformer.py37 symbols

segmentation_models_pytorch/metrics/functional.py34 symbols

segmentation_models_pytorch/encoders/_efficientnet.py32 symbols

segmentation_models_pytorch/encoders/_inceptionv4.py31 symbols

segmentation_models_pytorch/encoders/_inceptionresnetv2.py26 symbols

segmentation_models_pytorch/encoders/_dpn.py23 symbols

segmentation_models_pytorch/decoders/dpt/decoder.py22 symbols

segmentation_models_pytorch/encoders/mobileone.py18 symbols

segmentation_models_pytorch/decoders/deeplabv3/decoder.py18 symbols

segmentation_models_pytorch/base/modules.py18 symbols

segmentation_models_pytorch/encoders/_senet.py17 symbols

tests/models/base.py16 symbols

For agents

$ claude mcp add segmentation_models.pytorch \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact