Commit
·
dd930a6
1
Parent(s):
c707c2b
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,81 @@
|
|
1 |
---
|
2 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
license:
|
5 |
+
- mit
|
6 |
+
library_name: "pytorch"
|
7 |
+
multilinguality:
|
8 |
+
- monolingual
|
9 |
+
pretty_name: perturber
|
10 |
+
datasets:
|
11 |
+
- panda
|
12 |
+
tags:
|
13 |
+
- counterfactual
|
14 |
+
- perturb
|
15 |
+
- fairness
|
16 |
+
- nlp
|
17 |
+
- demographic
|
18 |
+
- diverse
|
19 |
+
- gender
|
20 |
+
- non-binary
|
21 |
+
- race
|
22 |
+
- age
|
23 |
+
metrics:
|
24 |
+
- bleu
|
25 |
---
|
26 |
+
|
27 |
+
# Model Card for the perturber
|
28 |
+
|
29 |
+
## Table of Contents
|
30 |
+
- [Table of Contents](#table-of-contents)
|
31 |
+
- [Model Description](#model-description)
|
32 |
+
- [Model Summary](#dataset-summary)
|
33 |
+
- [Languages](#languages)
|
34 |
+
- [Dataset Structure](#dataset-structure)
|
35 |
+
- [Data Instances](#data-instances)
|
36 |
+
- [Data Fields](#data-fields)
|
37 |
+
- [Data Splits](#data-splits)
|
38 |
+
- [Dataset Creation](#dataset-creation)
|
39 |
+
- [Curation Rationale](#curation-rationale)
|
40 |
+
- [Source Data](#source-data)
|
41 |
+
- [Annotations](#annotations)
|
42 |
+
- [Personal and Sensitive Information](#personal-and-sensitive-information)
|
43 |
+
- [Considerations for Using the Data](#considerations-for-using-the-data)
|
44 |
+
- [Social Impact of Dataset](#social-impact-of-dataset)
|
45 |
+
- [Discussion of Biases](#discussion-of-biases)
|
46 |
+
- [Other Known Limitations](#other-known-limitations)
|
47 |
+
- [Additional Information](#additional-information)
|
48 |
+
- [Dataset Curators](#dataset-curators)
|
49 |
+
- [Licensing Information](#licensing-information)
|
50 |
+
- [Citation Information](#citation-information)
|
51 |
+
- [Contributions](#contributions)
|
52 |
+
|
53 |
+
## Model Description
|
54 |
+
|
55 |
+
The perturber is a seq2seq controlled generation model that rewrites text along a specified demographic axis and attribute.
|
56 |
+
|
57 |
+
The perturber takes in (i) a source text snippet, (ii) a word in the snippet referring to a demographic group, and (iii) a new target demographic attribute, and generates a perturbed snippet that refers to the target demographic attribute, while preserving overall meaning.
|
58 |
+
|
59 |
+
- **Repository:** https://github.com/facebookresearch/ResponsibleNLP/
|
60 |
+
- **Paper:** https://arxiv.org/abs/2205.12586
|
61 |
+
- **Point of Contact:** [email protected], [email protected], [email protected], [email protected]
|
62 |
+
- **License:** MIT
|
63 |
+
|
64 |
+
### Model Summary
|
65 |
+
|
66 |
+
The perturber is a finetuned BART model (Lewis et al., 2020) with 24 layers, 1024 hidden size, 406M parameters, and 16 attention heads. To train the perturber in the original paper, we finetune BART on PANDA using the ParlAI library.
|
67 |
+
|
68 |
+
This model release is separately trained using the HuggingFace transformers library, with the same parameters as the ParlAI model.
|
69 |
+
|
70 |
+
Currently the perturber supports text rewriting along three axes and several attributes:
|
71 |
+
- **gender:** man, woman, non-binary
|
72 |
+
- **race:** black, white, asian, hispanic, native-american, pacific-islander
|
73 |
+
- **age:** child, young, middle-aged, senior, adult
|
74 |
+
|
75 |
+
### Uses
|
76 |
+
|
77 |
+
The perturber is intended for use by fairness researchers and engineers working on demographic debiasing applications. The perturber supports counterfactual perturbation augmentation.
|
78 |
+
|
79 |
+
### Contributions
|
80 |
+
|
81 |
+
Thanks to [@Rebecca-Qian](https://github.com/Rebecca-Qian) for adding this model.
|