Update README.md
Browse files
README.md
CHANGED
@@ -37,12 +37,20 @@ The model was trained, validated and tested only on the samples with 100% confid
|
|
37 |
If you find this repository helpful, feel free to cite our publication:
|
38 |
|
39 |
```
|
40 |
-
@inproceedings{babakov-etal-2021-
|
41 |
-
title = "Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation",
|
42 |
-
author = "Babakov, Nikolay
|
43 |
-
|
44 |
-
|
|
|
|
|
|
|
|
|
45 |
year = "2021",
|
46 |
-
address = "
|
|
|
|
|
|
|
|
|
47 |
}
|
48 |
```
|
|
|
37 |
If you find this repository helpful, feel free to cite our publication:
|
38 |
|
39 |
```
|
40 |
+
@inproceedings{babakov-etal-2021-detecting,
|
41 |
+
title = "Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company{'}s Reputation",
|
42 |
+
author = "Babakov, Nikolay and
|
43 |
+
Logacheva, Varvara and
|
44 |
+
Kozlova, Olga and
|
45 |
+
Semenov, Nikita and
|
46 |
+
Panchenko, Alexander",
|
47 |
+
booktitle = "Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing",
|
48 |
+
month = apr,
|
49 |
year = "2021",
|
50 |
+
address = "Kiyv, Ukraine",
|
51 |
+
publisher = "Association for Computational Linguistics",
|
52 |
+
url = "https://www.aclweb.org/anthology/2021.bsnlp-1.4",
|
53 |
+
pages = "26--36",
|
54 |
+
abstract = "Not all topics are equally {``}flammable{''} in terms of toxicity: a calm discussion of turtles or fishing less often fuels inappropriate toxic dialogues than a discussion of politics or sexual minorities. We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labelling a dataset for appropriateness. While toxicity in user-generated data is well-studied, we aim at defining a more fine-grained notion of inappropriateness. The core of inappropriateness is that it can harm the reputation of a speaker. This is different from toxicity in two respects: (i) inappropriateness is topic-related, and (ii) inappropriate message is not toxic but still unacceptable. We collect and release two datasets for Russian: a topic-labelled dataset and an appropriateness-labelled dataset. We also release pre-trained classification models trained on this data.",
|
55 |
}
|
56 |
```
|