cointegrated
commited on
Commit
•
5122e9f
1
Parent(s):
fdac820
Update README.md
Browse files
README.md
CHANGED
@@ -1,4 +1,5 @@
|
|
1 |
This is a version of paraphrase detector by DeepPavlov ([details in the documentation](http://docs.deeppavlov.ai/en/master/features/overview.html#ranking-model-docs)) ported to the `Transformers` format.
|
|
|
2 |
All credit goes to the authors of DeepPavlov.
|
3 |
|
4 |
The model has been trained on the dataset from http://paraphraser.ru/.
|
@@ -24,4 +25,6 @@ print(compare_texts('Сегодня на улице хорошая погода'
|
|
24 |
# [0.16524374 0.8347562 ]
|
25 |
```
|
26 |
|
27 |
-
P.S. In the DeepPavlov repository, the tokenizer uses `max_seq_length=64`.
|
|
|
|
|
|
1 |
This is a version of paraphrase detector by DeepPavlov ([details in the documentation](http://docs.deeppavlov.ai/en/master/features/overview.html#ranking-model-docs)) ported to the `Transformers` format.
|
2 |
+
|
3 |
All credit goes to the authors of DeepPavlov.
|
4 |
|
5 |
The model has been trained on the dataset from http://paraphraser.ru/.
|
|
|
25 |
# [0.16524374 0.8347562 ]
|
26 |
```
|
27 |
|
28 |
+
P.S. In the DeepPavlov repository, the tokenizer uses `max_seq_length=64`.
|
29 |
+
This model, however, uses `model_max_length=512`.
|
30 |
+
Therefore, the results on long texts may be inadequate.
|