Felladrin commited on
Commit
37b6029
1 Parent(s): aaa4446

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -35,3 +35,19 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  smollm2-1.7b-instruct-q5_k_m-imat.gguf filter=lfs diff=lfs merge=lfs -text
37
  imatrix.dat filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  smollm2-1.7b-instruct-q5_k_m-imat.gguf filter=lfs diff=lfs merge=lfs -text
37
  imatrix.dat filter=lfs diff=lfs merge=lfs -text
38
+ model.shard-00001-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
39
+ model.shard-00002-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
40
+ model.shard-00003-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
41
+ model.shard-00004-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
42
+ model.shard-00005-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
43
+ model.shard-00006-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
44
+ model.shard-00007-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
45
+ model.shard-00008-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
46
+ model.shard-00009-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
47
+ model.shard-00010-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
48
+ model.shard-00011-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
49
+ model.shard-00012-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
50
+ model.shard-00013-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
51
+ model.shard-00014-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
52
+ model.shard-00015-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
53
+ model.shard-00016-of-00016.gguf filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,54 +1,5 @@
1
  ---
2
- library_name: transformers
3
- license: apache-2.0
4
- language:
5
- - en
6
- tags:
7
- - llama-cpp
8
- - gguf-my-repo
9
- base_model: HuggingFaceTB/SmolLM2-1.7B-Instruct
10
  ---
11
 
12
- # Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF
13
- This model was converted to GGUF format from [`HuggingFaceTB/SmolLM2-1.7B-Instruct`](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
14
- Refer to the [original model card](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct) for more details on the model.
15
-
16
- ## Use with llama.cpp
17
- Install llama.cpp through brew (works on Mac and Linux)
18
-
19
- ```bash
20
- brew install llama.cpp
21
-
22
- ```
23
- Invoke the llama.cpp server or the CLI.
24
-
25
- ### CLI:
26
- ```bash
27
- llama-cli --hf-repo Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF --hf-file smollm2-1.7b-instruct-q5_k_m-imat.gguf -p "The meaning to life and the universe is"
28
- ```
29
-
30
- ### Server:
31
- ```bash
32
- llama-server --hf-repo Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF --hf-file smollm2-1.7b-instruct-q5_k_m-imat.gguf -c 2048
33
- ```
34
-
35
- Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
36
-
37
- Step 1: Clone llama.cpp from GitHub.
38
- ```
39
- git clone https://github.com/ggerganov/llama.cpp
40
- ```
41
-
42
- Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
43
- ```
44
- cd llama.cpp && LLAMA_CURL=1 make
45
- ```
46
-
47
- Step 3: Run inference through the main binary.
48
- ```
49
- ./llama-cli --hf-repo Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF --hf-file smollm2-1.7b-instruct-q5_k_m-imat.gguf -p "The meaning to life and the universe is"
50
- ```
51
- or
52
- ```
53
- ./llama-server --hf-repo Felladrin/SmolLM2-1.7B-Instruct-Q5_K_M-GGUF --hf-file smollm2-1.7b-instruct-q5_k_m-imat.gguf -c 2048
54
- ```
 
1
  ---
2
+ base_model: Felladrin/gguf-Q5_K_M-SmolLM2-1.7B-Instruct
 
 
 
 
 
 
 
3
  ---
4
 
5
+ Sharded GGUF version of [Felladrin/gguf-Q5_K_M-SmolLM2-1.7B-Instruct](https://huggingface.co/Felladrin/gguf-Q5_K_M-SmolLM2-1.7B-Instruct).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
model.shard-00001-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f39235a25b7137db097fcb94496099810eed6e56e39435e4bf44b56f7b2063c
3
+ size 84353600
model.shard-00002-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c6ec57e7c0b0ecfb4d14bb24be768e47f357b94163d366ece06caf3228b4b719
3
+ size 74236672
model.shard-00003-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76f62d405dd1f925514904e43aa66cd61126f806598bcce944b1033b99280651
3
+ size 72582176
model.shard-00004-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1dd50ddaa71e149c1af4471e829f685a7c19d805d42e19fb28a667856247420
3
+ size 83649472
model.shard-00005-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3204e1a02a2cfdea5cd34657f5c0bccee5335e0fe7461b3c630a6d3cd0b6ad08
3
+ size 80667648
model.shard-00006-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3f5cc6f89c14e5bb523ae6bd0a20ed6522aa5ccf81ff3df0c2e5eb917a9aa5b3
3
+ size 83002464
model.shard-00007-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8ee74635922ac4bf9c738d882e079d5b69830e20d7504881f712ca93391d843d
3
+ size 81331296
model.shard-00008-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:52221dbde679d5eb0bd6763e0cb47c54cb69fe26d0cbac8b2df04bb6992d486c
3
+ size 82994048
model.shard-00009-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a5141a07a2efd4674438093bf1819bb3a79bc0c80f0114614e14d0cea4dfea53
3
+ size 81323072
model.shard-00010-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0789b3e5e2f220849462aafe75fdbbe627bb67deb2497585e115043bab97698d
3
+ size 83559520
model.shard-00011-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:555c86235406b57fe350831fe645b21edba99e5732efe8bef902aa9dcd0b8482
3
+ size 80774240
model.shard-00012-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af8403b614add4f6da2a77f3f095157ca675ca3b811135e3436d65980927bf9d
3
+ size 83551104
model.shard-00013-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eb3e20b575c82fd61d60fe872409b0ff2d9ec76ca61431bada5b6e6493095a7d
3
+ size 82994208
model.shard-00014-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:876745e3b186faa3c8124aca61315ec50f4fb16a1bed3726f905d953666bd1a8
3
+ size 72582176
model.shard-00015-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ddb5d743bd22a2d8845dd2262e534143f0064813ae08827a589097316a63ab1b
3
+ size 74236672
model.shard-00016-of-00016.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e889ac1b51697d78746a09cebb0813aaf264d30b4b02700df3992204a84332ae
3
+ size 23642624