leaderboard-pt-pr-bot commited on
Commit
7da319b
1 Parent(s): 91dfcdb

Fixing some errors of the leaderboard evaluation results in the ModelCard yaml

Browse files

The name of a few benchmarks are incorrect on the model metadata. This commit fixes some minor errors of the [last PR](1) on the ModelCard YAML metadata.

Files changed (1) hide show
  1. README.md +27 -1
README.md CHANGED
@@ -67,6 +67,19 @@ model-index:
67
  - type: f1_macro
68
  value: 85.55
69
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
70
  - type: pearson
71
  value: 57.19
72
  name: pearson
@@ -94,7 +107,7 @@ model-index:
94
  name: Text Generation
95
  dataset:
96
  name: HateBR Binary
97
- type: eduagarcia/portuguese_benchmark
98
  split: test
99
  args:
100
  num_few_shot: 25
@@ -102,6 +115,19 @@ model-index:
102
  - type: f1_macro
103
  value: 77.78
104
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
105
  - type: f1_macro
106
  value: 64.13
107
  name: f1-macro
 
67
  - type: f1_macro
68
  value: 85.55
69
  name: f1-macro
70
+ source:
71
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicolasdec/cabra13b
72
+ name: Open Portuguese LLM Leaderboard
73
+ - task:
74
+ type: text-generation
75
+ name: Text Generation
76
+ dataset:
77
+ name: Assin2 STS
78
+ type: eduagarcia/portuguese_benchmark
79
+ split: test
80
+ args:
81
+ num_few_shot: 15
82
+ metrics:
83
  - type: pearson
84
  value: 57.19
85
  name: pearson
 
107
  name: Text Generation
108
  dataset:
109
  name: HateBR Binary
110
+ type: ruanchaves/hatebr
111
  split: test
112
  args:
113
  num_few_shot: 25
 
115
  - type: f1_macro
116
  value: 77.78
117
  name: f1-macro
118
+ source:
119
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicolasdec/cabra13b
120
+ name: Open Portuguese LLM Leaderboard
121
+ - task:
122
+ type: text-generation
123
+ name: Text Generation
124
+ dataset:
125
+ name: PT Hate Speech Binary
126
+ type: hate_speech_portuguese
127
+ split: test
128
+ args:
129
+ num_few_shot: 25
130
+ metrics:
131
  - type: f1_macro
132
  value: 64.13
133
  name: f1-macro