Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ licenses:
|
|
11 |
|
12 |
## General concept of the model
|
13 |
|
14 |
-
This model is trained on the dataset of inappropriate messages of the Russian language. The concept of inappropriateness is described [in this article ](https://
|
15 |
|
16 |
The model was trained, validated and tested only on the samples with 100% confidence, which allowed to get the following metrics on test set:
|
17 |
|
|
|
11 |
|
12 |
## General concept of the model
|
13 |
|
14 |
+
This model is trained on the dataset of inappropriate messages of the Russian language. The concept of inappropriateness is described [in this article ](https://www.aclweb.org/anthology/2021.bsnlp-1.4/) presented at the workshop for Balto-Slavic NLP at the EACL-2021 conference. Please note that this article describes the first version of the dataset, while the model is trained on the extended version of the dataset open-sourced on our [GitHub](https://github.com/skoltech-nlp/inappropriate-sensitive-topics/blob/main/Version2/appropriateness/Appropriateness.csv) or on [kaggle](https://www.kaggle.com/nigula/russianinappropriatemessages). The properties of the dataset is the same as the one described in the article, the only difference is the size.
|
15 |
|
16 |
The model was trained, validated and tested only on the samples with 100% confidence, which allowed to get the following metrics on test set:
|
17 |
|