leaderboard-pt-pr-bot commited on
Commit
12a2bfc
1 Parent(s): c210cd8

Fixing some errors of the leaderboard evaluation results in the ModelCard yaml

Browse files

The name of a few benchmarks are incorrect on the model metadata. This commit fixes some minor errors of the [last PR](1) on the ModelCard YAML metadata.

Files changed (1) hide show
  1. README.md +27 -1
README.md CHANGED
@@ -83,6 +83,19 @@ model-index:
83
  - type: f1_macro
84
  value: 48.84
85
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
86
  - type: pearson
87
  value: 37.56
88
  name: pearson
@@ -110,7 +123,7 @@ model-index:
110
  name: Text Generation
111
  dataset:
112
  name: HateBR Binary
113
- type: eduagarcia/portuguese_benchmark
114
  split: test
115
  args:
116
  num_few_shot: 25
@@ -118,6 +131,19 @@ model-index:
118
  - type: f1_macro
119
  value: 85.0
120
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
121
  - type: f1_macro
122
  value: 67.43
123
  name: f1-macro
 
83
  - type: f1_macro
84
  value: 48.84
85
  name: f1-macro
86
+ source:
87
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=lrds-code/boana-7b-instruct
88
+ name: Open Portuguese LLM Leaderboard
89
+ - task:
90
+ type: text-generation
91
+ name: Text Generation
92
+ dataset:
93
+ name: Assin2 STS
94
+ type: eduagarcia/portuguese_benchmark
95
+ split: test
96
+ args:
97
+ num_few_shot: 15
98
+ metrics:
99
  - type: pearson
100
  value: 37.56
101
  name: pearson
 
123
  name: Text Generation
124
  dataset:
125
  name: HateBR Binary
126
+ type: ruanchaves/hatebr
127
  split: test
128
  args:
129
  num_few_shot: 25
 
131
  - type: f1_macro
132
  value: 85.0
133
  name: f1-macro
134
+ source:
135
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=lrds-code/boana-7b-instruct
136
+ name: Open Portuguese LLM Leaderboard
137
+ - task:
138
+ type: text-generation
139
+ name: Text Generation
140
+ dataset:
141
+ name: PT Hate Speech Binary
142
+ type: hate_speech_portuguese
143
+ split: test
144
+ args:
145
+ num_few_shot: 25
146
+ metrics:
147
  - type: f1_macro
148
  value: 67.43
149
  name: f1-macro