DylanJHJ commited on
Commit
57e2c14
·
1 Parent(s): 8be182a

add merged models gtr

Browse files
.gitattributes CHANGED
@@ -33,3 +33,11 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ spiece.model filter=lfs diff=lfs merge=lfs -text
37
+ tokenizer_config.json filter=lfs diff=lfs merge=lfs -text
38
+ config.json filter=lfs diff=lfs merge=lfs -text
39
+ pytorch_model.bin filter=lfs diff=lfs merge=lfs -text
40
+ sentence-transformers filter=lfs diff=lfs merge=lfs -text
41
+ modules.json filter=lfs diff=lfs merge=lfs -text
42
+ special_tokens_map.json filter=lfs diff=lfs merge=lfs -text
43
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,12 +1,51 @@
1
- ---
2
- license: apache-2.0
3
- language:
4
- - en
5
- tags:
6
- - dense retrieval
7
- ---
8
-
9
- This model checkpoint is identical to https://huggingface.co/sentence-transformers/gtr-t5-base.
10
- But include the model's dense layer.
11
-
12
- Check the model class detail at: ...
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: sentence-similarity
3
+ language: en
4
+ license: apache-2.0
5
+ tags:
6
+ - sentence-transformers
7
+ - feature-extraction
8
+ - sentence-similarity
9
+ - transformers
10
+ ---
11
+
12
+ # sentence-transformers/gtr-t5-base
13
+
14
+ This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space. The model was specifically trained for the task of sematic search.
15
+
16
+ This model was converted from the Tensorflow model [gtr-base-1](https://tfhub.dev/google/gtr/gtr-base/1) to PyTorch. When using this model, have a look at the publication: [Large Dual Encoders Are Generalizable Retrievers](https://arxiv.org/abs/2112.07899). The tfhub model and this PyTorch model can produce slightly different embeddings, however, when run on the same benchmarks, they produce identical results.
17
+
18
+ The model uses only the encoder from a T5-base model. The weights are stored in FP16.
19
+
20
+
21
+ ## Usage (Sentence-Transformers)
22
+
23
+ Using this model becomes easy when you have [sentence-transformers](https://www.SBERT.net) installed:
24
+
25
+ ```
26
+ pip install -U sentence-transformers
27
+ ```
28
+
29
+ Then you can use the model like this:
30
+
31
+ ```python
32
+ from sentence_transformers import SentenceTransformer
33
+ sentences = ["This is an example sentence", "Each sentence is converted"]
34
+
35
+ model = SentenceTransformer('sentence-transformers/gtr-t5-base')
36
+ embeddings = model.encode(sentences)
37
+ print(embeddings)
38
+ ```
39
+
40
+ The model requires sentence-transformers version 2.2.0 or newer.
41
+
42
+ ## Evaluation Results
43
+
44
+ For an automated evaluation of this model, see the *Sentence Embeddings Benchmark*: [https://seb.sbert.net](https://seb.sbert.net?model_name=sentence-transformers/gtr-t5-base)
45
+
46
+
47
+
48
+ ## Citing & Authors
49
+
50
+ If you find this model helpful, please cite the respective publication:
51
+ [Large Dual Encoders Are Generalizable Retrievers](https://arxiv.org/abs/2112.07899)
config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2c505a1fdbd38cf5f6d3ad747f9338345d41a0dc029b305f1525c2bef839d032
3
+ size 1381
modules.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:41251094dfd3e7ff81a85ba97b3f4788df47e4039f1bc5037575b43c028beb40
3
+ size 461
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0115a9109a9809523f1ef8d28756657458ececb829fa9faba410b42f1e12409
3
+ size 221653921
sentence-transformers/.gitattributes ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ config_sentence_transformers.json filter=lfs diff=lfs merge=lfs -text
2
+ convert-gtr.ipynb filter=lfs diff=lfs merge=lfs -text
3
+ convert_to_fp16.py filter=lfs diff=lfs merge=lfs -text
4
+ pytorch_model.bin filter=lfs diff=lfs merge=lfs -text
5
+ sentence_bert_config.json filter=lfs diff=lfs merge=lfs -text
sentence-transformers/config_sentence_transformers.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f631e4bd49910419948c5e9134ab07e0d63b22b8cebbe6875f99a0712bcadd4d
3
+ size 122
sentence-transformers/convert-gtr.ipynb ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:665ac3935da4e9b2bc7ebd3355201ca4484b5777b44aa9351c9eb798dd4bb4bb
3
+ size 90620
sentence-transformers/convert_to_fp16.py ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1dbd5bfc49c82ef29c7b91f00562069f11551b024782fc4260609329addedfd3
3
+ size 198
sentence-transformers/pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fd78a39e92ec5e96213ad3284d95360ba13c020bbda0841479b369612b19964c
3
+ size 219303530
sentence-transformers/sentence_bert_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec8e29d6dcb61b611b7d3fdd2982c4524e6ad985959fa7194eacfb655a8d0d51
3
+ size 53
special_tokens_map.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4720c0fddbe4c5991334f85ad7073d9bd0a294a8ba4641a2f8dab614ca825949
3
+ size 1786
spiece.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d60acb128cf7b7f2536e8f38a5b18a05535c9e14c7a355904270e15b0945ea86
3
+ size 791656
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3dd4249b57e28dda22219397c091a4a11655c537e0d8a9b5e2efcf43f39c8773
3
+ size 1387554
tokenizer_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6973652f30c2c8359a70d8fe41f9af4f13b3225d9520dbecfe0b7f0c69f997f3
3
+ size 1923