Spaces:
Runtime error
Runtime error
Oskar van der Wal
commited on
Update app.py
Browse files
app.py
CHANGED
@@ -77,7 +77,8 @@ iface = gradio.Interface(
|
|
77 |
CrowS-Pairs is a dataset with pairs of more and less stereotypical examples for different social groups (e.g., gender and nationality stereotypes).
|
78 |
We sample 10 random pairs from CrowS-Pairs and show whether the stereotypical example gets a higher score ('is more likely').
|
79 |
If GPT-2 systematically prefers the stereotypical examples, it has probably learnt these stereotypes from the training data.
|
80 |
-
**DISCLAIMER: How to measure bias in language models is not trivial and an active area of research
|
|
|
81 |
""",
|
82 |
)
|
83 |
|
|
|
77 |
CrowS-Pairs is a dataset with pairs of more and less stereotypical examples for different social groups (e.g., gender and nationality stereotypes).
|
78 |
We sample 10 random pairs from CrowS-Pairs and show whether the stereotypical example gets a higher score ('is more likely').
|
79 |
If GPT-2 systematically prefers the stereotypical examples, it has probably learnt these stereotypes from the training data.
|
80 |
+
**DISCLAIMER: How to measure bias in language models is not trivial and an active area of research.
|
81 |
+
CrowS-Pairs is only one bias benchmark, and here you can probably find some examples that are nonsensical, with typos, or containing stereotypes that are only relevant in the American cultural context.**
|
82 |
""",
|
83 |
)
|
84 |
|