rebeccaqian commited on
Commit
dd930a6
·
1 Parent(s): c707c2b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +79 -1
README.md CHANGED
@@ -1,3 +1,81 @@
1
  ---
2
- license: mit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ license:
5
+ - mit
6
+ library_name: "pytorch"
7
+ multilinguality:
8
+ - monolingual
9
+ pretty_name: perturber
10
+ datasets:
11
+ - panda
12
+ tags:
13
+ - counterfactual
14
+ - perturb
15
+ - fairness
16
+ - nlp
17
+ - demographic
18
+ - diverse
19
+ - gender
20
+ - non-binary
21
+ - race
22
+ - age
23
+ metrics:
24
+ - bleu
25
  ---
26
+
27
+ # Model Card for the perturber
28
+
29
+ ## Table of Contents
30
+ - [Table of Contents](#table-of-contents)
31
+ - [Model Description](#model-description)
32
+ - [Model Summary](#dataset-summary)
33
+ - [Languages](#languages)
34
+ - [Dataset Structure](#dataset-structure)
35
+ - [Data Instances](#data-instances)
36
+ - [Data Fields](#data-fields)
37
+ - [Data Splits](#data-splits)
38
+ - [Dataset Creation](#dataset-creation)
39
+ - [Curation Rationale](#curation-rationale)
40
+ - [Source Data](#source-data)
41
+ - [Annotations](#annotations)
42
+ - [Personal and Sensitive Information](#personal-and-sensitive-information)
43
+ - [Considerations for Using the Data](#considerations-for-using-the-data)
44
+ - [Social Impact of Dataset](#social-impact-of-dataset)
45
+ - [Discussion of Biases](#discussion-of-biases)
46
+ - [Other Known Limitations](#other-known-limitations)
47
+ - [Additional Information](#additional-information)
48
+ - [Dataset Curators](#dataset-curators)
49
+ - [Licensing Information](#licensing-information)
50
+ - [Citation Information](#citation-information)
51
+ - [Contributions](#contributions)
52
+
53
+ ## Model Description
54
+
55
+ The perturber is a seq2seq controlled generation model that rewrites text along a specified demographic axis and attribute.
56
+
57
+ The perturber takes in (i) a source text snippet, (ii) a word in the snippet referring to a demographic group, and (iii) a new target demographic attribute, and generates a perturbed snippet that refers to the target demographic attribute, while preserving overall meaning.
58
+
59
+ - **Repository:** https://github.com/facebookresearch/ResponsibleNLP/
60
+ - **Paper:** https://arxiv.org/abs/2205.12586
61
62
+ - **License:** MIT
63
+
64
+ ### Model Summary
65
+
66
+ The perturber is a finetuned BART model (Lewis et al., 2020) with 24 layers, 1024 hidden size, 406M parameters, and 16 attention heads. To train the perturber in the original paper, we finetune BART on PANDA using the ParlAI library.
67
+
68
+ This model release is separately trained using the HuggingFace transformers library, with the same parameters as the ParlAI model.
69
+
70
+ Currently the perturber supports text rewriting along three axes and several attributes:
71
+ - **gender:** man, woman, non-binary
72
+ - **race:** black, white, asian, hispanic, native-american, pacific-islander
73
+ - **age:** child, young, middle-aged, senior, adult
74
+
75
+ ### Uses
76
+
77
+ The perturber is intended for use by fairness researchers and engineers working on demographic debiasing applications. The perturber supports counterfactual perturbation augmentation.
78
+
79
+ ### Contributions
80
+
81
+ Thanks to [@Rebecca-Qian](https://github.com/Rebecca-Qian) for adding this model.