hub / github.com/huggingface/sentence-transformers

github.com/huggingface/sentence-transformers @v5.6.0 sqlite

repository ↗ · DeepWiki ↗ · release v5.6.0 ↗

3,083 symbols 14,865 edges 453 files 1,298 documented · 42%

README

[][#github-license] [][#pypi-package] [][#pypi-package] [][#docs-package]

Sentence Transformers: Embeddings, Retrieval, and Reranking

This framework provides an easy method to compute embeddings for accessing, using, and training state-of-the-art embedding and reranker models. It can be used to compute embeddings using Sentence Transformer models (quickstart), to calculate similarity scores using Cross-Encoder (a.k.a. reranker) models (quickstart) or to generate sparse embeddings using Sparse Encoder models (quickstart). This unlocks a wide range of applications, including semantic search, semantic textual similarity, and paraphrase mining.

A wide selection of over 15,000 pre-trained Sentence Transformers models are available for immediate use on 🤗 Hugging Face, including many of the state-of-the-art models from the Massive Text Embeddings Benchmark (MTEB) leaderboard. Additionally, it is easy to train or finetune your own embedding models, reranker models or sparse encoder models using Sentence Transformers, enabling you to create custom models for your specific use cases.

For the full documentation, see www.SBERT.net.

Installation

We recommend Python 3.10+, PyTorch 1.11.0+, and transformers v4.41.0+.

pip install -U sentence-transformers

See Installation in the docs for uv, conda, source, and editable installs, CUDA setup, and extras ([image], [audio], [video], [train], [onnx], [openvino], [dev]).

Getting Started

See Quickstart in our documentation.

Embedding Models

First download a pretrained embedding a.k.a. Sentence Transformer model.

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("sentence-transformers/all-MiniLM-L6-v2")

Then provide some texts to the model.

sentences = [
    "The weather is lovely today.",
    "It's so sunny outside!",
    "He drove to the stadium.",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# => (3, 384)

And that's already it. We now have numpy arrays with the embeddings, one for each text. We can use these to compute similarities.

similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.6660, 0.1046],
#         [0.6660, 1.0000, 0.1411],
#         [0.1046, 0.1411, 1.0000]])

Reranker Models

First download a pretrained reranker a.k.a. Cross Encoder model.

from sentence_transformers import CrossEncoder

# 1. Load a pretrained CrossEncoder model
model = CrossEncoder("cross-encoder/ms-marco-MiniLM-L6-v2")

Then provide some texts to the model.

# The texts for which to predict similarity scores
query = "How many people live in Berlin?"
passages = [
    "Berlin had a population of 3,520,031 registered inhabitants in an area of 891.82 square kilometers.",
    "Berlin has a yearly total of about 135 million day visitors, making it one of the most-visited cities in the European Union.",
    "In 2013 around 600,000 Berliners were registered in one of the more than 2,300 sport and fitness clubs.",
]

# 2a. predict scores for pairs of texts
scores = model.predict([(query, passage) for passage in passages])
print(scores)
# => [8.607139 5.506266 6.352977]

And we're good to go. You can also use model.rank to avoid having to perform the reranking manually:

# 2b. Rank a list of passages for a query
ranks = model.rank(query, passages, return_documents=True)

print("Query:", query)
for rank in ranks:
    print(f"- #{rank['corpus_id']} ({rank['score']:.2f}): {rank['text']}")
"""
Query: How many people live in Berlin?
- #0 (8.61): Berlin had a population of 3,520,031 registered inhabitants in an area of 891.82 square kilometers.
- #2 (6.35): In 2013 around 600,000 Berliners were registered in one of the more than 2,300 sport and fitness clubs.
- #1 (5.51): Berlin has a yearly total of about 135 million day visitors, making it one of the most-visited cities in the European Union.
"""

Sparse Encoder Models

First download a pretrained sparse embedding a.k.a. Sparse Encoder model.


from sentence_transformers import SparseEncoder

# 1. Load a pretrained SparseEncoder model
model = SparseEncoder("naver/splade-cocondenser-ensembledistil")

# The sentences to encode
sentences = [
    "The weather is lovely today.",
    "It's so sunny outside!",
    "He drove to the stadium.",
]

# 2. Calculate sparse embeddings by calling model.encode()
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 30522] - sparse representation with vocabulary size dimensions

# 3. Calculate the embedding similarities
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[   35.629,     9.154,     0.098],
#         [    9.154,    27.478,     0.019],
#         [    0.098,     0.019,    29.553]])

# 4. Check sparsity stats
stats = SparseEncoder.sparsity(embeddings)
print(f"Sparsity: {stats['sparsity_ratio']:.2%}")
# Sparsity: 99.84%

Pre-Trained Models

We provide a large list of pretrained models for more than 100 languages. Some models are general purpose models, while others produce embeddings for specific use cases.

Training

Tip: Using an AI coding agent (Claude Code, Codex, Cursor, Gemini CLI, ...)? Install the train-sentence-transformers Hugging Face Agent Skill via hf skills add train-sentence-transformers [--claude] [--global] and ask your agent to fine-tune a model on your data.

This framework allows you to fine-tune your own sentence embedding methods, so that you get task-specific sentence embeddings. You have various options to choose from in order to get perfect sentence embeddings for your specific task.

Embedding Models
Sentence Transformer > Training Overview
Sentence Transformer > Training Examples or training examples on GitHub.
Reranker Models
Cross Encoder > Training Overview
Cross Encoder > Training Examples or training examples on GitHub.
Sparse Embedding Models
Sparse Encoder > Training Overview
Sparse Encoder > Training Examples or training examples on GitHub.

Some highlights across the different types of training are:

Support of various transformer networks including BERT, RoBERTa, XLM-R, DistilBERT, Electra, BART, ...
Multilingual and multi-task learning
Evaluation during training to find optimal model
20+ loss functions for embedding models, 10+ loss functions for reranker models and 10+ loss functions for sparse embedding models, allowing you to tune models specifically for semantic search, paraphrase mining, semantic similarity comparison, clustering, triplet loss, contrastive loss, etc.

Companion Blog Posts

The following Hugging Face blog posts complement this documentation with narrative walkthroughs and full training examples:

Training guides:

Training and Finetuning Embedding Models: end-to-end training of bi-encoder embedding models.
Training and Finetuning Reranker Models: training Cross Encoder models for the second stage of retrieve-and-rerank pipelines.
Training and Finetuning Sparse Embedding Models: training SPLADE and other sparse encoders.

Multimodal:

Multimodal Embedding & Reranker Models: using text, image, audio, and video models through a single API.
Training and Finetuning Multimodal Embedding & Reranker Models: training multimodal models, with a Visual Document Retrieval walkthrough.

Efficiency techniques:

Introduction to Matryoshka Embedding Models: variable-size embeddings that can be truncated with minimal quality loss.
Train 400x faster Static Embedding Models: CPU-friendly embedding models without attention.
Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval: post-training compression of embedding vectors.

Application Examples

You can use this framework for:

Computing Sentence Embeddings
Dense Embeddings
Sparse Embeddings
Semantic Textual Similarity
Dense STS
Sparse STS
Semantic Search
Dense Search
Sparse Search
Retrieve & Re-Rank
Dense only Retrieval
Sparse/Dense/Hybrid Retrieval
Clustering
Paraphrase Mining
Translated Sentence Mining
Multilingual Image Search, Clustering & Duplicate Detection

and many more use-cases.

For all examples, see examples/sentence_transformer/applications.

Development setup

After cloning the repo (or a fork) to your machine, in a virtual environment, run:

python -m pip install -e ".[dev]"

pre-commit install

To test your changes, run:

pytest

Citing & Authors

If you find this repository helpful, feel free to cite our publication Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks:

```bibtex @inproceedings{reimers-2019-sentence-bert, title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks", author = "Reimers, Nils and Gurevych, Iryna", booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing", month = "11", y

Core symbols most depended-on inside this repo

push_to_hub

called by 114

sentence_transformers/base/model.py

save_pretrained

called by 104

sentence_transformers/base/model.py

encode

called by 86

sentence_transformers/sparse_encoder/model.py

encode

called by 81

sentence_transformers/sentence_transformer/model.py

mean

called by 69

sentence_transformers/base/model_card.py

mine_hard_negatives

called by 69

sentence_transformers/util/hard_negatives.py

predict

called by 66

sentence_transformers/cross_encoder/model.py

infer_modality

called by 55

sentence_transformers/base/modality.py

Shape

Method 1,601

Function 1,084

Class 360

Route 38

Languages

Python99%

TypeScript1%

Modules by API surface

tests/base/test_modality.py200 symbols

tests/base/modules/test_transformer.py182 symbols

tests/base/test_model_card.py138 symbols

tests/base/test_model.py95 symbols

tests/base/modules/test_router.py85 symbols

sentence_transformers/base/model_card.py69 symbols

sentence_transformers/base/modules/transformer.py57 symbols

tests/util/test_hard_negatives.py56 symbols

sentence_transformers/base/model.py51 symbols

tests/cross_encoder/test_model.py50 symbols

tests/sentence_transformer/test_model.py48 symbols

tests/util/test_decorators.py38 symbols

Used by 21 indexed graphs manifest dependencies, hub-wide

github.com/666ghj/BettaFish

github.com/Aider-AI/aider

github.com/MaartenGr/BERTopic

github.com/MaartenGr/KeyBERT

github.com/MemoriLabs/Memori

github.com/PKU-YuanGroup/Helios

github.com/Shubhamsaboo/awesome-llm-apps

github.com/TideDra/zotero-arxiv-daily

github.com/agent0ai/agent-zero

github.com/arc53/DocsGPT

… +11 more

Dependencies from manifests, versioned

huggingface-hub0.23.0 · 1×

numpy1.20.0 · 1×

scikit-learn0.22.0 · 1×

scipy1.0.0 · 1×

torch1.11.0 · 1×

tqdm4.0.0 · 1×

typing_extensions4.5.0 · 1×

For agents

$ claude mcp add sentence-transformers \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact