Update README.md
Browse files
README.md
CHANGED
@@ -107,7 +107,7 @@ Use the code below to get started with the model.
|
|
107 |
- This section provides information about throughput, start/end time, checkpoint size if relevant, etc.
|
108 |
|
109 |
[More Information Needed]
|
110 |
-
|
111 |
## Evaluation
|
112 |
|
113 |
This section describes the evaluation protocols and provides the results.
|
@@ -116,26 +116,24 @@ Use the code below to get started with the model.
|
|
116 |
|
117 |
#### Testing Data
|
118 |
|
119 |
-
This
|
120 |
-
|
121 |
-
[More Information Needed]
|
122 |
|
|
|
123 |
#### Factors
|
124 |
|
125 |
These are the things the evaluation is disaggregating by, e.g., subpopulations or domains.
|
126 |
|
127 |
[More Information Needed]
|
128 |
-
|
129 |
#### Metrics
|
130 |
|
131 |
-
|
132 |
-
|
133 |
-
[More Information Needed]
|
134 |
|
135 |
### Results
|
136 |
|
137 |
-
|
138 |
|
|
|
139 |
#### Summary
|
140 |
|
141 |
|
|
|
107 |
- This section provides information about throughput, start/end time, checkpoint size if relevant, etc.
|
108 |
|
109 |
[More Information Needed]
|
110 |
+
-->
|
111 |
## Evaluation
|
112 |
|
113 |
This section describes the evaluation protocols and provides the results.
|
|
|
116 |
|
117 |
#### Testing Data
|
118 |
|
119 |
+
This model was tested on `jigsaw_toxic_pred` testset.
|
|
|
|
|
120 |
|
121 |
+
<!--
|
122 |
#### Factors
|
123 |
|
124 |
These are the things the evaluation is disaggregating by, e.g., subpopulations or domains.
|
125 |
|
126 |
[More Information Needed]
|
127 |
+
-->
|
128 |
#### Metrics
|
129 |
|
130 |
+
Model was evaluated using `perplexity` (on the MLM task).
|
|
|
|
|
131 |
|
132 |
### Results
|
133 |
|
134 |
+
Perplexity: _1.03_
|
135 |
|
136 |
+
<!--
|
137 |
#### Summary
|
138 |
|
139 |
|