lemon07r commited on
Commit
e45b11f
1 Parent(s): 7325092
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. Gemma-2-Ataraxy-Remix-9B-Q8_0.gguf +3 -0
  3. README.md +193 -3
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Gemma-2-Ataraxy-Remix-9B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
Gemma-2-Ataraxy-Remix-9B-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f5730195101682109b32da54f74e02a6aff696d64e881020b9e463c236087d9f
3
+ size 9827151648
README.md CHANGED
@@ -1,3 +1,193 @@
1
- ---
2
- license: gemma
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags:
4
+ - mergekit
5
+ - merge
6
+ base_model:
7
+ - nbeerbower/gemma2-gutenberg-9B
8
+ - princeton-nlp/gemma-2-9b-it-SimPO
9
+ - jsgreenawalt/gemma-2-9B-it-advanced-v2.1
10
+ - UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
11
+ - recoilme/Gemma-2-Ataraxy-Gemmasutra-9B-slerp
12
+ - unsloth/gemma-2-9b-it
13
+ - lemon07r/Gemma-2-Ataraxy-v2-9B
14
+ - ifable/gemma-2-Ifable-9B
15
+ - grimjim/Gemma2-Nephilim-v3-9B
16
+ - lemon07r/Gemma-2-Ataraxy-v2a-9B
17
+ - wzhouad/gemma-2-9b-it-WPO-HB
18
+ - lemon07r/Gemma-2-Ataraxy-9B
19
+ model-index:
20
+ - name: Gemma-2-Ataraxy-Remix-9B
21
+ results:
22
+ - task:
23
+ type: text-generation
24
+ name: Text Generation
25
+ dataset:
26
+ name: IFEval (0-Shot)
27
+ type: HuggingFaceH4/ifeval
28
+ args:
29
+ num_few_shot: 0
30
+ metrics:
31
+ - type: inst_level_strict_acc and prompt_level_strict_acc
32
+ value: 70.83
33
+ name: strict accuracy
34
+ source:
35
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=lemon07r/Gemma-2-Ataraxy-Remix-9B
36
+ name: Open LLM Leaderboard
37
+ - task:
38
+ type: text-generation
39
+ name: Text Generation
40
+ dataset:
41
+ name: BBH (3-Shot)
42
+ type: BBH
43
+ args:
44
+ num_few_shot: 3
45
+ metrics:
46
+ - type: acc_norm
47
+ value: 41.59
48
+ name: normalized accuracy
49
+ source:
50
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=lemon07r/Gemma-2-Ataraxy-Remix-9B
51
+ name: Open LLM Leaderboard
52
+ - task:
53
+ type: text-generation
54
+ name: Text Generation
55
+ dataset:
56
+ name: MATH Lvl 5 (4-Shot)
57
+ type: hendrycks/competition_math
58
+ args:
59
+ num_few_shot: 4
60
+ metrics:
61
+ - type: exact_match
62
+ value: 1.28
63
+ name: exact match
64
+ source:
65
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=lemon07r/Gemma-2-Ataraxy-Remix-9B
66
+ name: Open LLM Leaderboard
67
+ - task:
68
+ type: text-generation
69
+ name: Text Generation
70
+ dataset:
71
+ name: GPQA (0-shot)
72
+ type: Idavidrein/gpqa
73
+ args:
74
+ num_few_shot: 0
75
+ metrics:
76
+ - type: acc_norm
77
+ value: 11.86
78
+ name: acc_norm
79
+ source:
80
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=lemon07r/Gemma-2-Ataraxy-Remix-9B
81
+ name: Open LLM Leaderboard
82
+ - task:
83
+ type: text-generation
84
+ name: Text Generation
85
+ dataset:
86
+ name: MuSR (0-shot)
87
+ type: TAUR-Lab/MuSR
88
+ args:
89
+ num_few_shot: 0
90
+ metrics:
91
+ - type: acc_norm
92
+ value: 13.72
93
+ name: acc_norm
94
+ source:
95
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=lemon07r/Gemma-2-Ataraxy-Remix-9B
96
+ name: Open LLM Leaderboard
97
+ - task:
98
+ type: text-generation
99
+ name: Text Generation
100
+ dataset:
101
+ name: MMLU-PRO (5-shot)
102
+ type: TIGER-Lab/MMLU-Pro
103
+ config: main
104
+ split: test
105
+ args:
106
+ num_few_shot: 5
107
+ metrics:
108
+ - type: acc
109
+ value: 35.99
110
+ name: accuracy
111
+ source:
112
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=lemon07r/Gemma-2-Ataraxy-Remix-9B
113
+ name: Open LLM Leaderboard
114
+ ---
115
+ # Gemma-2-Ataraxy-Remix-9B-Q8_0-GGUF
116
+
117
+ This is the Q8_0 GGUF for people to try while I evaluate my merges.
118
+
119
+ Another test model. Ignore this for now. Probably wont be good but I am testing a lot of stuff.
120
+
121
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
122
+
123
+ ## Merge Details
124
+ ### Merge Method
125
+
126
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [unsloth/gemma-2-9b-it](https://huggingface.co/unsloth/gemma-2-9b-it) as a base.
127
+
128
+ ### Models Merged
129
+
130
+ The following models were included in the merge:
131
+ * [nbeerbower/gemma2-gutenberg-9B](https://huggingface.co/nbeerbower/gemma2-gutenberg-9B)
132
+ * [princeton-nlp/gemma-2-9b-it-SimPO](https://huggingface.co/princeton-nlp/gemma-2-9b-it-SimPO)
133
+ * [jsgreenawalt/gemma-2-9B-it-advanced-v2.1](https://huggingface.co/jsgreenawalt/gemma-2-9B-it-advanced-v2.1)
134
+ * [UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3)
135
+ * [recoilme/Gemma-2-Ataraxy-Gemmasutra-9B-slerp](https://huggingface.co/recoilme/Gemma-2-Ataraxy-Gemmasutra-9B-slerp)
136
+ * [lemon07r/Gemma-2-Ataraxy-v2-9B](https://huggingface.co/lemon07r/Gemma-2-Ataraxy-v2-9B)
137
+ * [ifable/gemma-2-Ifable-9B](https://huggingface.co/ifable/gemma-2-Ifable-9B)
138
+ * [grimjim/Gemma2-Nephilim-v3-9B](https://huggingface.co/grimjim/Gemma2-Nephilim-v3-9B)
139
+ * [lemon07r/Gemma-2-Ataraxy-v2a-9B](https://huggingface.co/lemon07r/Gemma-2-Ataraxy-v2a-9B)
140
+ * [wzhouad/gemma-2-9b-it-WPO-HB](https://huggingface.co/wzhouad/gemma-2-9b-it-WPO-HB)
141
+ * [lemon07r/Gemma-2-Ataraxy-9B](https://huggingface.co/lemon07r/Gemma-2-Ataraxy-9B)
142
+
143
+ ### Configuration
144
+
145
+ The following YAML configuration was used to produce this model:
146
+
147
+ ```yaml
148
+ base_model: unsloth/gemma-2-9b-it
149
+ dtype: bfloat16
150
+ merge_method: model_stock
151
+ slices:
152
+ - sources:
153
+ - layer_range: [0, 42]
154
+ model: lemon07r/Gemma-2-Ataraxy-9B
155
+ - layer_range: [0, 42]
156
+ model: lemon07r/Gemma-2-Ataraxy-v2-9B
157
+ - layer_range: [0, 42]
158
+ model: lemon07r/Gemma-2-Ataraxy-v2a-9B
159
+ - layer_range: [0, 42]
160
+ model: jsgreenawalt/gemma-2-9B-it-advanced-v2.1
161
+ - layer_range: [0, 42]
162
+ model: ifable/gemma-2-Ifable-9B
163
+ - layer_range: [0, 42]
164
+ model: UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
165
+ - layer_range: [0, 42]
166
+ model: princeton-nlp/gemma-2-9b-it-SimPO
167
+ - layer_range: [0, 42]
168
+ model: wzhouad/gemma-2-9b-it-WPO-HB
169
+ - layer_range: [0, 42]
170
+ model: nbeerbower/gemma2-gutenberg-9B
171
+ - layer_range: [0, 42]
172
+ model: grimjim/Gemma2-Nephilim-v3-9B
173
+ - layer_range: [0, 42]
174
+ model: recoilme/Gemma-2-Ataraxy-Gemmasutra-9B-slerp
175
+ - layer_range: [0, 42]
176
+ model: UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
177
+ - layer_range: [0, 42]
178
+ model: unsloth/gemma-2-9b-it
179
+ ```
180
+
181
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
182
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_lemon07r__Gemma-2-Ataraxy-Remix-9B)
183
+
184
+ | Metric |Value|
185
+ |-------------------|----:|
186
+ |Avg. |29.21|
187
+ |IFEval (0-Shot) |70.83|
188
+ |BBH (3-Shot) |41.59|
189
+ |MATH Lvl 5 (4-Shot)| 1.28|
190
+ |GPQA (0-shot) |11.86|
191
+ |MuSR (0-shot) |13.72|
192
+ |MMLU-PRO (5-shot) |35.99|
193
+