GGUF
English
TensorBlock
GGUF
Inference Endpoints
conversational
morriszms commited on
Commit
2d15fbd
·
verified ·
1 Parent(s): 0798830

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ ReflectionCoder-DS-33B-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ ReflectionCoder-DS-33B-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ ReflectionCoder-DS-33B-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ ReflectionCoder-DS-33B-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ ReflectionCoder-DS-33B-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ ReflectionCoder-DS-33B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ ReflectionCoder-DS-33B-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ ReflectionCoder-DS-33B-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ ReflectionCoder-DS-33B-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ ReflectionCoder-DS-33B-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ ReflectionCoder-DS-33B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ ReflectionCoder-DS-33B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,75 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - SenseLLM/ReflectionSeq-GPT
5
+ - SenseLLM/ReflectionSeq-DS
6
+ language:
7
+ - en
8
+ base_model: SenseLLM/ReflectionCoder-DS-33B
9
+ tags:
10
+ - TensorBlock
11
+ - GGUF
12
+ ---
13
+
14
+ <div style="width: auto; margin-left: auto; margin-right: auto">
15
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
16
+ </div>
17
+ <div style="display: flex; justify-content: space-between; width: 100%;">
18
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
19
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
20
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
21
+ </p>
22
+ </div>
23
+ </div>
24
+
25
+ ## SenseLLM/ReflectionCoder-DS-33B - GGUF
26
+
27
+ This repo contains GGUF format model files for [SenseLLM/ReflectionCoder-DS-33B](https://huggingface.co/SenseLLM/ReflectionCoder-DS-33B).
28
+
29
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
30
+
31
+ ## Prompt template
32
+
33
+ ```
34
+ <|system|><|text|>{system_prompt}<|endofblock|><|endofmessage|><|user|><|text|>{prompt}<|endofblock|><|endofmessage|><|assistant|>
35
+ ```
36
+
37
+ ## Model file specification
38
+
39
+ | Filename | Quant type | File Size | Description |
40
+ | -------- | ---------- | --------- | ----------- |
41
+ | [ReflectionCoder-DS-33B-Q2_K.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q2_K.gguf) | Q2_K | 11.505 GB | smallest, significant quality loss - not recommended for most purposes |
42
+ | [ReflectionCoder-DS-33B-Q3_K_S.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q3_K_S.gguf) | Q3_K_S | 13.429 GB | very small, high quality loss |
43
+ | [ReflectionCoder-DS-33B-Q3_K_M.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q3_K_M.gguf) | Q3_K_M | 14.985 GB | very small, high quality loss |
44
+ | [ReflectionCoder-DS-33B-Q3_K_L.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q3_K_L.gguf) | Q3_K_L | 16.352 GB | small, substantial quality loss |
45
+ | [ReflectionCoder-DS-33B-Q4_0.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q4_0.gguf) | Q4_0 | 17.525 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
46
+ | [ReflectionCoder-DS-33B-Q4_K_S.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q4_K_S.gguf) | Q4_K_S | 17.640 GB | small, greater quality loss |
47
+ | [ReflectionCoder-DS-33B-Q4_K_M.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q4_K_M.gguf) | Q4_K_M | 18.569 GB | medium, balanced quality - recommended |
48
+ | [ReflectionCoder-DS-33B-Q5_0.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q5_0.gguf) | Q5_0 | 21.379 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
49
+ | [ReflectionCoder-DS-33B-Q5_K_S.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q5_K_S.gguf) | Q5_K_S | 21.379 GB | large, low quality loss - recommended |
50
+ | [ReflectionCoder-DS-33B-Q5_K_M.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q5_K_M.gguf) | Q5_K_M | 21.917 GB | large, very low quality loss - recommended |
51
+ | [ReflectionCoder-DS-33B-Q6_K.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q6_K.gguf) | Q6_K | 25.475 GB | very large, extremely low quality loss |
52
+ | [ReflectionCoder-DS-33B-Q8_0.gguf](https://huggingface.co/tensorblock/ReflectionCoder-DS-33B-GGUF/tree/main/ReflectionCoder-DS-33B-Q8_0.gguf) | Q8_0 | 32.994 GB | very large, extremely low quality loss - not recommended |
53
+
54
+
55
+ ## Downloading instruction
56
+
57
+ ### Command line
58
+
59
+ Firstly, install Huggingface Client
60
+
61
+ ```shell
62
+ pip install -U "huggingface_hub[cli]"
63
+ ```
64
+
65
+ Then, downoad the individual model file the a local directory
66
+
67
+ ```shell
68
+ huggingface-cli download tensorblock/ReflectionCoder-DS-33B-GGUF --include "ReflectionCoder-DS-33B-Q2_K.gguf" --local-dir MY_LOCAL_DIR
69
+ ```
70
+
71
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
72
+
73
+ ```shell
74
+ huggingface-cli download tensorblock/ReflectionCoder-DS-33B-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
75
+ ```
ReflectionCoder-DS-33B-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b1d05627078fe2f935eaa4a1e6a5518046a22d40c016a643fd821ae88ec4da1
3
+ size 12353881152
ReflectionCoder-DS-33B-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a63e1ae643c46ea3dc74aebd42b98fe90a704dc31f8c4d39c988c6f9bd6d917d
3
+ size 17558200000
ReflectionCoder-DS-33B-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8d2bced7dde1d9cdbeca106ae595cfa8a2bb82792617d5793ece3b4fe9d49b15
3
+ size 16089964224
ReflectionCoder-DS-33B-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:71038cd078837bafa4d3c781edcdc65dddb89cdc79a9903d503e9fcbf8fc5c70
3
+ size 14419734208
ReflectionCoder-DS-33B-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:11fdfdb61f92f4eded032e3559564271e8726fd8abe645bcf72dfb09a5955175
3
+ size 18817057408
ReflectionCoder-DS-33B-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa7da2d55cea950550e526fe9159d10b0a9bfc4591d4e035a6ae8f70a7de426a
3
+ size 19938275968
ReflectionCoder-DS-33B-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4733d7e9fc46ccbaf1b1821d20eadc228f66e46c453dbac3649c3fbd4c40f6c3
3
+ size 18941149824
ReflectionCoder-DS-33B-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:825d6bef3dd3368cb8689662059128e6329e196fccdc803bf7d2ab58ef05d9da
3
+ size 22955714560
ReflectionCoder-DS-33B-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a899c8b64e0d6536c90db0c8ca7453a4013f6c7e39065d7e148d725615ad165a
3
+ size 23533312000
ReflectionCoder-DS-33B-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e0011ce3dcd9acfa3115578855589c42d53cf61e72cdf2ec6d05b12b8253ddf
3
+ size 22955714560
ReflectionCoder-DS-33B-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:485dc1aa3c9b804d8f0aeefcab330a160385b5b277e935f280ca367ab832a939
3
+ size 27353037792
ReflectionCoder-DS-33B-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7eab227d1519d0dc7386ddf8fc4d9441da548ca7233679aae139f7e84eaa6d3c
3
+ size 35427288352