add merged models gtr
Browse files- .gitattributes +8 -0
- README.md +51 -12
- config.json +3 -0
- modules.json +3 -0
- pytorch_model.bin +3 -0
- sentence-transformers/.gitattributes +5 -0
- sentence-transformers/config_sentence_transformers.json +3 -0
- sentence-transformers/convert-gtr.ipynb +3 -0
- sentence-transformers/convert_to_fp16.py +3 -0
- sentence-transformers/pytorch_model.bin +3 -0
- sentence-transformers/sentence_bert_config.json +3 -0
- special_tokens_map.json +3 -0
- spiece.model +3 -0
- tokenizer.json +3 -0
- tokenizer_config.json +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,11 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
spiece.model filter=lfs diff=lfs merge=lfs -text
|
37 |
+
tokenizer_config.json filter=lfs diff=lfs merge=lfs -text
|
38 |
+
config.json filter=lfs diff=lfs merge=lfs -text
|
39 |
+
pytorch_model.bin filter=lfs diff=lfs merge=lfs -text
|
40 |
+
sentence-transformers filter=lfs diff=lfs merge=lfs -text
|
41 |
+
modules.json filter=lfs diff=lfs merge=lfs -text
|
42 |
+
special_tokens_map.json filter=lfs diff=lfs merge=lfs -text
|
43 |
+
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -1,12 +1,51 @@
|
|
1 |
-
---
|
2 |
-
|
3 |
-
language:
|
4 |
-
-
|
5 |
-
tags:
|
6 |
-
-
|
7 |
-
|
8 |
-
|
9 |
-
|
10 |
-
|
11 |
-
|
12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
pipeline_tag: sentence-similarity
|
3 |
+
language: en
|
4 |
+
license: apache-2.0
|
5 |
+
tags:
|
6 |
+
- sentence-transformers
|
7 |
+
- feature-extraction
|
8 |
+
- sentence-similarity
|
9 |
+
- transformers
|
10 |
+
---
|
11 |
+
|
12 |
+
# sentence-transformers/gtr-t5-base
|
13 |
+
|
14 |
+
This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space. The model was specifically trained for the task of sematic search.
|
15 |
+
|
16 |
+
This model was converted from the Tensorflow model [gtr-base-1](https://tfhub.dev/google/gtr/gtr-base/1) to PyTorch. When using this model, have a look at the publication: [Large Dual Encoders Are Generalizable Retrievers](https://arxiv.org/abs/2112.07899). The tfhub model and this PyTorch model can produce slightly different embeddings, however, when run on the same benchmarks, they produce identical results.
|
17 |
+
|
18 |
+
The model uses only the encoder from a T5-base model. The weights are stored in FP16.
|
19 |
+
|
20 |
+
|
21 |
+
## Usage (Sentence-Transformers)
|
22 |
+
|
23 |
+
Using this model becomes easy when you have [sentence-transformers](https://www.SBERT.net) installed:
|
24 |
+
|
25 |
+
```
|
26 |
+
pip install -U sentence-transformers
|
27 |
+
```
|
28 |
+
|
29 |
+
Then you can use the model like this:
|
30 |
+
|
31 |
+
```python
|
32 |
+
from sentence_transformers import SentenceTransformer
|
33 |
+
sentences = ["This is an example sentence", "Each sentence is converted"]
|
34 |
+
|
35 |
+
model = SentenceTransformer('sentence-transformers/gtr-t5-base')
|
36 |
+
embeddings = model.encode(sentences)
|
37 |
+
print(embeddings)
|
38 |
+
```
|
39 |
+
|
40 |
+
The model requires sentence-transformers version 2.2.0 or newer.
|
41 |
+
|
42 |
+
## Evaluation Results
|
43 |
+
|
44 |
+
For an automated evaluation of this model, see the *Sentence Embeddings Benchmark*: [https://seb.sbert.net](https://seb.sbert.net?model_name=sentence-transformers/gtr-t5-base)
|
45 |
+
|
46 |
+
|
47 |
+
|
48 |
+
## Citing & Authors
|
49 |
+
|
50 |
+
If you find this model helpful, please cite the respective publication:
|
51 |
+
[Large Dual Encoders Are Generalizable Retrievers](https://arxiv.org/abs/2112.07899)
|
config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2c505a1fdbd38cf5f6d3ad747f9338345d41a0dc029b305f1525c2bef839d032
|
3 |
+
size 1381
|
modules.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:41251094dfd3e7ff81a85ba97b3f4788df47e4039f1bc5037575b43c028beb40
|
3 |
+
size 461
|
pytorch_model.bin
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d0115a9109a9809523f1ef8d28756657458ececb829fa9faba410b42f1e12409
|
3 |
+
size 221653921
|
sentence-transformers/.gitattributes
ADDED
@@ -0,0 +1,5 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
config_sentence_transformers.json filter=lfs diff=lfs merge=lfs -text
|
2 |
+
convert-gtr.ipynb filter=lfs diff=lfs merge=lfs -text
|
3 |
+
convert_to_fp16.py filter=lfs diff=lfs merge=lfs -text
|
4 |
+
pytorch_model.bin filter=lfs diff=lfs merge=lfs -text
|
5 |
+
sentence_bert_config.json filter=lfs diff=lfs merge=lfs -text
|
sentence-transformers/config_sentence_transformers.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f631e4bd49910419948c5e9134ab07e0d63b22b8cebbe6875f99a0712bcadd4d
|
3 |
+
size 122
|
sentence-transformers/convert-gtr.ipynb
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:665ac3935da4e9b2bc7ebd3355201ca4484b5777b44aa9351c9eb798dd4bb4bb
|
3 |
+
size 90620
|
sentence-transformers/convert_to_fp16.py
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1dbd5bfc49c82ef29c7b91f00562069f11551b024782fc4260609329addedfd3
|
3 |
+
size 198
|
sentence-transformers/pytorch_model.bin
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fd78a39e92ec5e96213ad3284d95360ba13c020bbda0841479b369612b19964c
|
3 |
+
size 219303530
|
sentence-transformers/sentence_bert_config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ec8e29d6dcb61b611b7d3fdd2982c4524e6ad985959fa7194eacfb655a8d0d51
|
3 |
+
size 53
|
special_tokens_map.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4720c0fddbe4c5991334f85ad7073d9bd0a294a8ba4641a2f8dab614ca825949
|
3 |
+
size 1786
|
spiece.model
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d60acb128cf7b7f2536e8f38a5b18a05535c9e14c7a355904270e15b0945ea86
|
3 |
+
size 791656
|
tokenizer.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3dd4249b57e28dda22219397c091a4a11655c537e0d8a9b5e2efcf43f39c8773
|
3 |
+
size 1387554
|
tokenizer_config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6973652f30c2c8359a70d8fe41f9af4f13b3225d9520dbecfe0b7f0c69f997f3
|
3 |
+
size 1923
|