Commit
1006696
1 Parent(s): 0608844

Adding Evaluation Results (#7)

Browse files

- Adding Evaluation Results (f1a2fe7e65b940ab11a4067bd8ce468f98ec95cf)


Co-authored-by: Open LLM Leaderboard PR Bot <[email protected]>

Files changed (1) hide show
  1. README.md +17 -4
README.md CHANGED
@@ -1,14 +1,14 @@
1
  ---
 
2
  library_name: peft
3
  tags:
4
  - generated_from_trainer
 
 
5
  base_model: 152334H/miqu-1-70b-sf
6
  model-index:
7
  - name: Senku-70B-Full
8
  results: []
9
- license: cc0-1.0
10
- datasets:
11
- - Open-Orca/SlimOrca
12
  ---
13
 
14
  # ShinojiResearch/Senku-70B-Full
@@ -167,4 +167,17 @@ The following hyperparameters were used during training:
167
  - Transformers 4.38.0.dev0
168
  - Pytorch 2.1.2+cu118
169
  - Datasets 2.16.1
170
- - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: cc0-1.0
3
  library_name: peft
4
  tags:
5
  - generated_from_trainer
6
+ datasets:
7
+ - Open-Orca/SlimOrca
8
  base_model: 152334H/miqu-1-70b-sf
9
  model-index:
10
  - name: Senku-70B-Full
11
  results: []
 
 
 
12
  ---
13
 
14
  # ShinojiResearch/Senku-70B-Full
 
167
  - Transformers 4.38.0.dev0
168
  - Pytorch 2.1.2+cu118
169
  - Datasets 2.16.1
170
+ - Tokenizers 0.15.0
171
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
172
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ShinojiResearch__Senku-70B-Full)
173
+
174
+ | Metric |Value|
175
+ |---------------------------------|----:|
176
+ |Avg. |75.44|
177
+ |AI2 Reasoning Challenge (25-Shot)|71.50|
178
+ |HellaSwag (10-Shot) |87.88|
179
+ |MMLU (5-Shot) |75.20|
180
+ |TruthfulQA (0-shot) |61.96|
181
+ |Winogrande (5-shot) |84.77|
182
+ |GSM8k (5-shot) |71.34|
183
+