Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot_v4 config and test split of mathemakitten/winobias_antistereotype_test_cot_v4
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_test_cot_v4 config and test split of the [mathemakitten/winobias_antistereotype_test_cot_v4](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_test_cot_v4) dataset by
@mathemakitten
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_test_cot_v4-math-54ae93-2018366736).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_test_cot_v4).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_test_cot_v4).
@@ -111,6 +111,23 @@ model-index:
|
|
111 |
type: loss
|
112 |
value: 1.539870785999474
|
113 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
114 |
---
|
115 |
|
116 |
# OPT : Open Pre-trained Transformer Language Models
|
|
|
111 |
type: loss
|
112 |
value: 1.539870785999474
|
113 |
verified: true
|
114 |
+
- task:
|
115 |
+
type: zero-shot-classification
|
116 |
+
name: Zero-Shot Text Classification
|
117 |
+
dataset:
|
118 |
+
name: mathemakitten/winobias_antistereotype_test_cot_v4
|
119 |
+
type: mathemakitten/winobias_antistereotype_test_cot_v4
|
120 |
+
config: mathemakitten--winobias_antistereotype_test_cot_v4
|
121 |
+
split: test
|
122 |
+
metrics:
|
123 |
+
- name: Accuracy
|
124 |
+
type: accuracy
|
125 |
+
value: 0.3131067961165049
|
126 |
+
verified: true
|
127 |
+
- name: Loss
|
128 |
+
type: loss
|
129 |
+
value: 1.4315469591985621
|
130 |
+
verified: true
|
131 |
---
|
132 |
|
133 |
# OPT : Open Pre-trained Transformer Language Models
|