Discrepancy in max tokens
Browse filesthe config states 512, which one is correct?
README.md
CHANGED
@@ -114,7 +114,7 @@ organized by Hugging Face. We developed this model as part of the project:
|
|
114 |
Our model is intended to be used as a sentence and short paragraph encoder. Given an input text, it outputs a vector which captures
|
115 |
the semantic information. The sentence vector may be used for information retrieval, clustering or sentence similarity tasks.
|
116 |
|
117 |
-
By default, input text longer than
|
118 |
|
119 |
|
120 |
## Training procedure
|
|
|
114 |
Our model is intended to be used as a sentence and short paragraph encoder. Given an input text, it outputs a vector which captures
|
115 |
the semantic information. The sentence vector may be used for information retrieval, clustering or sentence similarity tasks.
|
116 |
|
117 |
+
By default, input text longer than 512 word pieces is truncated.
|
118 |
|
119 |
|
120 |
## Training procedure
|