morriszms commited on
Commit
3f25080
1 Parent(s): 48c96f6

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Llama-3-8B-Instruct-v0.10-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ Llama-3-8B-Instruct-v0.10-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ Llama-3-8B-Instruct-v0.10-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ Llama-3-8B-Instruct-v0.10-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ Llama-3-8B-Instruct-v0.10-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ Llama-3-8B-Instruct-v0.10-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ Llama-3-8B-Instruct-v0.10-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ Llama-3-8B-Instruct-v0.10-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ Llama-3-8B-Instruct-v0.10-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ Llama-3-8B-Instruct-v0.10-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ Llama-3-8B-Instruct-v0.10-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ Llama-3-8B-Instruct-v0.10-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3-8B-Instruct-v0.10-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:129ad59b754706d7d4fe930ff2b489fabd7351c69fec616a4a4341ef3ff4a737
3
+ size 3179132288
Llama-3-8B-Instruct-v0.10-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e14434b5276eb479756eacecd780290b723c0e8b6b2a13b781e57538d9bf3317
3
+ size 4321957248
Llama-3-8B-Instruct-v0.10-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac61b3b972000de489585e4c0d9fce843ebd8c62c2c28c235bbeaa9c0afa9675
3
+ size 4018918784
Llama-3-8B-Instruct-v0.10-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:157e215ef43b1de1ccf23546fa7d45f04bbe76839c5553a03af467939e03406a
3
+ size 3664500096
Llama-3-8B-Instruct-v0.10-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a0d348424e6d04687b3d685f87e8fa7b6784288a2d3e3e3321cce6c4db0aea0
3
+ size 4661212544
Llama-3-8B-Instruct-v0.10-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:045207d8cab074151c791da8996ee5513d44125b6432fbe6b557aafb7ef4f464
3
+ size 4920735104
Llama-3-8B-Instruct-v0.10-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef364271488aeb234061e0d62fe44d7b469ab3a71969dafbb366c49558f90b4c
3
+ size 4692669824
Llama-3-8B-Instruct-v0.10-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:134b7d0ea06f9495636befe626ce5631542d3de86a092356a702f8dc4fd58baa
3
+ size 5599294848
Llama-3-8B-Instruct-v0.10-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6e7dbbdc104c5ca6d0729eb4f910f9bd2df38a26347b2823c8252b6999b20c5a
3
+ size 5732988288
Llama-3-8B-Instruct-v0.10-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dc831c82c64b5d9f350758bc95eed4c27d61bf6331b518640931ac6ea2f0d766
3
+ size 5599294848
Llama-3-8B-Instruct-v0.10-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1aa086b0e14850e701534169e0afd6755ba3b5b3fe550d4f0c2f590db2a8f110
3
+ size 6596007296
Llama-3-8B-Instruct-v0.10-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a68cb539bc811c08be523eb2c396de87ebd9297e9cc44fc6d2186073d0cba7ae
3
+ size 8540771712
README.md ADDED
@@ -0,0 +1,184 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: other
5
+ library_name: transformers
6
+ tags:
7
+ - axolotl
8
+ - finetune
9
+ - facebook
10
+ - meta
11
+ - pytorch
12
+ - llama
13
+ - llama-3
14
+ - TensorBlock
15
+ - GGUF
16
+ base_model: MaziyarPanahi/Llama-3-8B-Instruct-v0.10
17
+ pipeline_tag: text-generation
18
+ license_name: llama3
19
+ license_link: LICENSE
20
+ inference: false
21
+ model_creator: MaziyarPanahi
22
+ model-index:
23
+ - name: Llama-3-8B-Instruct-v0.10
24
+ results:
25
+ - task:
26
+ type: text-generation
27
+ name: Text Generation
28
+ dataset:
29
+ name: IFEval (0-Shot)
30
+ type: HuggingFaceH4/ifeval
31
+ args:
32
+ num_few_shot: 0
33
+ metrics:
34
+ - type: inst_level_strict_acc and prompt_level_strict_acc
35
+ value: 76.67
36
+ name: strict accuracy
37
+ source:
38
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/Llama-3-8B-Instruct-v0.10
39
+ name: Open LLM Leaderboard
40
+ - task:
41
+ type: text-generation
42
+ name: Text Generation
43
+ dataset:
44
+ name: BBH (3-Shot)
45
+ type: BBH
46
+ args:
47
+ num_few_shot: 3
48
+ metrics:
49
+ - type: acc_norm
50
+ value: 27.92
51
+ name: normalized accuracy
52
+ source:
53
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/Llama-3-8B-Instruct-v0.10
54
+ name: Open LLM Leaderboard
55
+ - task:
56
+ type: text-generation
57
+ name: Text Generation
58
+ dataset:
59
+ name: MATH Lvl 5 (4-Shot)
60
+ type: hendrycks/competition_math
61
+ args:
62
+ num_few_shot: 4
63
+ metrics:
64
+ - type: exact_match
65
+ value: 4.91
66
+ name: exact match
67
+ source:
68
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/Llama-3-8B-Instruct-v0.10
69
+ name: Open LLM Leaderboard
70
+ - task:
71
+ type: text-generation
72
+ name: Text Generation
73
+ dataset:
74
+ name: GPQA (0-shot)
75
+ type: Idavidrein/gpqa
76
+ args:
77
+ num_few_shot: 0
78
+ metrics:
79
+ - type: acc_norm
80
+ value: 7.83
81
+ name: acc_norm
82
+ source:
83
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/Llama-3-8B-Instruct-v0.10
84
+ name: Open LLM Leaderboard
85
+ - task:
86
+ type: text-generation
87
+ name: Text Generation
88
+ dataset:
89
+ name: MuSR (0-shot)
90
+ type: TAUR-Lab/MuSR
91
+ args:
92
+ num_few_shot: 0
93
+ metrics:
94
+ - type: acc_norm
95
+ value: 10.81
96
+ name: acc_norm
97
+ source:
98
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/Llama-3-8B-Instruct-v0.10
99
+ name: Open LLM Leaderboard
100
+ - task:
101
+ type: text-generation
102
+ name: Text Generation
103
+ dataset:
104
+ name: MMLU-PRO (5-shot)
105
+ type: TIGER-Lab/MMLU-Pro
106
+ config: main
107
+ split: test
108
+ args:
109
+ num_few_shot: 5
110
+ metrics:
111
+ - type: acc
112
+ value: 31.8
113
+ name: accuracy
114
+ source:
115
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/Llama-3-8B-Instruct-v0.10
116
+ name: Open LLM Leaderboard
117
+ ---
118
+
119
+ <div style="width: auto; margin-left: auto; margin-right: auto">
120
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
121
+ </div>
122
+ <div style="display: flex; justify-content: space-between; width: 100%;">
123
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
124
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
125
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
126
+ </p>
127
+ </div>
128
+ </div>
129
+
130
+ ## MaziyarPanahi/Llama-3-8B-Instruct-v0.10 - GGUF
131
+
132
+ This repo contains GGUF format model files for [MaziyarPanahi/Llama-3-8B-Instruct-v0.10](https://huggingface.co/MaziyarPanahi/Llama-3-8B-Instruct-v0.10).
133
+
134
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
135
+
136
+ ## Prompt template
137
+
138
+ ```
139
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
140
+
141
+ {system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>
142
+
143
+ {prompt}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
144
+ ```
145
+
146
+ ## Model file specification
147
+
148
+ | Filename | Quant type | File Size | Description |
149
+ | -------- | ---------- | --------- | ----------- |
150
+ | [Llama-3-8B-Instruct-v0.10-Q2_K.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q2_K.gguf) | Q2_K | 2.961 GB | smallest, significant quality loss - not recommended for most purposes |
151
+ | [Llama-3-8B-Instruct-v0.10-Q3_K_S.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q3_K_S.gguf) | Q3_K_S | 3.413 GB | very small, high quality loss |
152
+ | [Llama-3-8B-Instruct-v0.10-Q3_K_M.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q3_K_M.gguf) | Q3_K_M | 3.743 GB | very small, high quality loss |
153
+ | [Llama-3-8B-Instruct-v0.10-Q3_K_L.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q3_K_L.gguf) | Q3_K_L | 4.025 GB | small, substantial quality loss |
154
+ | [Llama-3-8B-Instruct-v0.10-Q4_0.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q4_0.gguf) | Q4_0 | 4.341 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
155
+ | [Llama-3-8B-Instruct-v0.10-Q4_K_S.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q4_K_S.gguf) | Q4_K_S | 4.370 GB | small, greater quality loss |
156
+ | [Llama-3-8B-Instruct-v0.10-Q4_K_M.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q4_K_M.gguf) | Q4_K_M | 4.583 GB | medium, balanced quality - recommended |
157
+ | [Llama-3-8B-Instruct-v0.10-Q5_0.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q5_0.gguf) | Q5_0 | 5.215 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
158
+ | [Llama-3-8B-Instruct-v0.10-Q5_K_S.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q5_K_S.gguf) | Q5_K_S | 5.215 GB | large, low quality loss - recommended |
159
+ | [Llama-3-8B-Instruct-v0.10-Q5_K_M.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q5_K_M.gguf) | Q5_K_M | 5.339 GB | large, very low quality loss - recommended |
160
+ | [Llama-3-8B-Instruct-v0.10-Q6_K.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q6_K.gguf) | Q6_K | 6.143 GB | very large, extremely low quality loss |
161
+ | [Llama-3-8B-Instruct-v0.10-Q8_0.gguf](https://huggingface.co/tensorblock/Llama-3-8B-Instruct-v0.10-GGUF/tree/main/Llama-3-8B-Instruct-v0.10-Q8_0.gguf) | Q8_0 | 7.954 GB | very large, extremely low quality loss - not recommended |
162
+
163
+
164
+ ## Downloading instruction
165
+
166
+ ### Command line
167
+
168
+ Firstly, install Huggingface Client
169
+
170
+ ```shell
171
+ pip install -U "huggingface_hub[cli]"
172
+ ```
173
+
174
+ Then, downoad the individual model file the a local directory
175
+
176
+ ```shell
177
+ huggingface-cli download tensorblock/Llama-3-8B-Instruct-v0.10-GGUF --include "Llama-3-8B-Instruct-v0.10-Q2_K.gguf" --local-dir MY_LOCAL_DIR
178
+ ```
179
+
180
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
181
+
182
+ ```shell
183
+ huggingface-cli download tensorblock/Llama-3-8B-Instruct-v0.10-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
184
+ ```