Upload folder using huggingface_hub
Browse files- .gitattributes +16 -0
- README.md +2 -51
- model.shard-00001-of-00016.gguf +3 -0
- model.shard-00002-of-00016.gguf +3 -0
- model.shard-00003-of-00016.gguf +3 -0
- model.shard-00004-of-00016.gguf +3 -0
- model.shard-00005-of-00016.gguf +3 -0
- model.shard-00006-of-00016.gguf +3 -0
- model.shard-00007-of-00016.gguf +3 -0
- model.shard-00008-of-00016.gguf +3 -0
- model.shard-00009-of-00016.gguf +3 -0
- model.shard-00010-of-00016.gguf +3 -0
- model.shard-00011-of-00016.gguf +3 -0
- model.shard-00012-of-00016.gguf +3 -0
- model.shard-00013-of-00016.gguf +3 -0
- model.shard-00014-of-00016.gguf +3 -0
- model.shard-00015-of-00016.gguf +3 -0
- model.shard-00016-of-00016.gguf +3 -0
.gitattributes
CHANGED
@@ -35,3 +35,19 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
smollm2-1.7b-instruct-q5_k_m-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
smollm2-1.7b-instruct-q5_k_m-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
38 |
+
model.shard-00001-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
model.shard-00002-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
model.shard-00003-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
model.shard-00004-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
model.shard-00005-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
43 |
+
model.shard-00006-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
44 |
+
model.shard-00007-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
45 |
+
model.shard-00008-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
46 |
+
model.shard-00009-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
47 |
+
model.shard-00010-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
48 |
+
model.shard-00011-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
49 |
+
model.shard-00012-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
50 |
+
model.shard-00013-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
51 |
+
model.shard-00014-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
52 |
+
model.shard-00015-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
53 |
+
model.shard-00016-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -1,54 +1,5 @@
|
|
1 |
---
|
2 |
-
|
3 |
-
license: apache-2.0
|
4 |
-
language:
|
5 |
-
- en
|
6 |
-
tags:
|
7 |
-
- llama-cpp
|
8 |
-
- gguf-my-repo
|
9 |
-
base_model: HuggingFaceTB/SmolLM2-1.7B-Instruct
|
10 |
---
|
11 |
|
12 |
-
|
13 |
-
This model was converted to GGUF format from [`HuggingFaceTB/SmolLM2-1.7B-Instruct`](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
14 |
-
Refer to the [original model card](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct) for more details on the model.
|
15 |
-
|
16 |
-
## Use with llama.cpp
|
17 |
-
Install llama.cpp through brew (works on Mac and Linux)
|
18 |
-
|
19 |
-
```bash
|
20 |
-
brew install llama.cpp
|
21 |
-
|
22 |
-
```
|
23 |
-
Invoke the llama.cpp server or the CLI.
|
24 |
-
|
25 |
-
### CLI:
|
26 |
-
```bash
|
27 |
-
llama-cli --hf-repo Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF --hf-file smollm2-1.7b-instruct-q5_k_m-imat.gguf -p "The meaning to life and the universe is"
|
28 |
-
```
|
29 |
-
|
30 |
-
### Server:
|
31 |
-
```bash
|
32 |
-
llama-server --hf-repo Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF --hf-file smollm2-1.7b-instruct-q5_k_m-imat.gguf -c 2048
|
33 |
-
```
|
34 |
-
|
35 |
-
Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
|
36 |
-
|
37 |
-
Step 1: Clone llama.cpp from GitHub.
|
38 |
-
```
|
39 |
-
git clone https://github.com/ggerganov/llama.cpp
|
40 |
-
```
|
41 |
-
|
42 |
-
Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
|
43 |
-
```
|
44 |
-
cd llama.cpp && LLAMA_CURL=1 make
|
45 |
-
```
|
46 |
-
|
47 |
-
Step 3: Run inference through the main binary.
|
48 |
-
```
|
49 |
-
./llama-cli --hf-repo Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF --hf-file smollm2-1.7b-instruct-q5_k_m-imat.gguf -p "The meaning to life and the universe is"
|
50 |
-
```
|
51 |
-
or
|
52 |
-
```
|
53 |
-
./llama-server --hf-repo Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF --hf-file smollm2-1.7b-instruct-q5_k_m-imat.gguf -c 2048
|
54 |
-
```
|
|
|
1 |
---
|
2 |
+
base_model: Felladrin/gguf-Q5_K_M-SmolLM2-1.7B-Instruct
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
4 |
|
5 |
+
Sharded GGUF version of [Felladrin/gguf-Q5_K_M-SmolLM2-1.7B-Instruct](https://huggingface.co/Felladrin/gguf-Q5_K_M-SmolLM2-1.7B-Instruct).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
model.shard-00001-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0f39235a25b7137db097fcb94496099810eed6e56e39435e4bf44b56f7b2063c
|
3 |
+
size 84353600
|
model.shard-00002-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c6ec57e7c0b0ecfb4d14bb24be768e47f357b94163d366ece06caf3228b4b719
|
3 |
+
size 74236672
|
model.shard-00003-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:76f62d405dd1f925514904e43aa66cd61126f806598bcce944b1033b99280651
|
3 |
+
size 72582176
|
model.shard-00004-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f1dd50ddaa71e149c1af4471e829f685a7c19d805d42e19fb28a667856247420
|
3 |
+
size 83649472
|
model.shard-00005-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3204e1a02a2cfdea5cd34657f5c0bccee5335e0fe7461b3c630a6d3cd0b6ad08
|
3 |
+
size 80667648
|
model.shard-00006-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3f5cc6f89c14e5bb523ae6bd0a20ed6522aa5ccf81ff3df0c2e5eb917a9aa5b3
|
3 |
+
size 83002464
|
model.shard-00007-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8ee74635922ac4bf9c738d882e079d5b69830e20d7504881f712ca93391d843d
|
3 |
+
size 81331296
|
model.shard-00008-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:52221dbde679d5eb0bd6763e0cb47c54cb69fe26d0cbac8b2df04bb6992d486c
|
3 |
+
size 82994048
|
model.shard-00009-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a5141a07a2efd4674438093bf1819bb3a79bc0c80f0114614e14d0cea4dfea53
|
3 |
+
size 81323072
|
model.shard-00010-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0789b3e5e2f220849462aafe75fdbbe627bb67deb2497585e115043bab97698d
|
3 |
+
size 83559520
|
model.shard-00011-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:555c86235406b57fe350831fe645b21edba99e5732efe8bef902aa9dcd0b8482
|
3 |
+
size 80774240
|
model.shard-00012-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:af8403b614add4f6da2a77f3f095157ca675ca3b811135e3436d65980927bf9d
|
3 |
+
size 83551104
|
model.shard-00013-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eb3e20b575c82fd61d60fe872409b0ff2d9ec76ca61431bada5b6e6493095a7d
|
3 |
+
size 82994208
|
model.shard-00014-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:876745e3b186faa3c8124aca61315ec50f4fb16a1bed3726f905d953666bd1a8
|
3 |
+
size 72582176
|
model.shard-00015-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ddb5d743bd22a2d8845dd2262e534143f0064813ae08827a589097316a63ab1b
|
3 |
+
size 74236672
|
model.shard-00016-of-00016.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e889ac1b51697d78746a09cebb0813aaf264d30b4b02700df3992204a84332ae
|
3 |
+
size 23642624
|