nomic-ai
/

modernbert-embed-base

Sentence Similarity

sentence-transformers

Transformers.js

feature-extraction

Inference Endpoints

Model card Files Files and versions Community

zpn commited on Dec 30, 2024

Commit

0c579fa

·

1 Parent(s): 78aab1d

feat: sentence transformers

Files changed (1) hide show

README.md +21 -1

README.md CHANGED Viewed

@@ -2902,7 +2902,9 @@ base_model:
 # ModernBERT Embed
-ModernBERT Embed is an embedding model trained from [ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base), brining the new advances of ModernBERT to embeddings!
 ## Performance
@@ -2958,6 +2960,24 @@ embeddings = F.normalize(embeddings, p=2, dim=1)
 print(embeddings)
 ```
 ## Training
 Click the Nomic Atlas map below to visualize a 5M sample of our contrastive pretraining data!

 # ModernBERT Embed
+ModernBERT Embed is an embedding model trained from [ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base), brining the new advances of ModernBERT to embeddings!
+Trained on the [Nomic Embed](https://arxiv.org/abs/2402.01613) weakly-supervised and supervised datasets, `modernbert-embed` also supports Matryoshka Representation Learning dimensions of 256, reducing memory by 3x with minimal performance loss.
 ## Performance
 print(embeddings)
 ```
+### Sentence Transformers
+```python
+from sentence_transformers import SentenceTransformer
+model = SentenceTransformer(
+    "nomic-ai/modernbert-embed",
+)
+# Verify that everything works as expected
+embeddings = model.encode(['search_query: What is TSNE?', 'search_query: Who is Laurens van der Maaten?'])
+print(embeddings.shape)
+similarities = model.similarity(embeddings, embeddings)
+print(similarities)
+```
 ## Training
 Click the Nomic Atlas map below to visualize a 5M sample of our contrastive pretraining data!