Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -1,8 +1,6 @@
|
|
1 |
---
|
2 |
quantized_by: bartowski
|
3 |
pipeline_tag: text-generation
|
4 |
-
license: apache-2.0
|
5 |
-
base_model: RekaAI/reka-flash-3
|
6 |
---
|
7 |
|
8 |
## Llamacpp imatrix Quantizations of reka-flash-3 by RekaAI
|
@@ -20,9 +18,13 @@ Run them directly with [llama.cpp](https://github.com/ggerganov/llama.cpp), or a
|
|
20 |
## Prompt format
|
21 |
|
22 |
```
|
23 |
-
human: {prompt} <sep> assistant:
|
24 |
```
|
25 |
|
|
|
|
|
|
|
|
|
26 |
## Download a file (not the whole branch) from below:
|
27 |
|
28 |
| Filename | Quant type | File Size | Split | Description |
|
@@ -52,6 +54,8 @@ human: {prompt} <sep> assistant:
|
|
52 |
| [reka-flash-3-Q2_K.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-Q2_K.gguf) | Q2_K | 8.63GB | false | Very low quality but surprisingly usable. |
|
53 |
| [reka-flash-3-IQ2_M.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_M.gguf) | IQ2_M | 8.51GB | false | Relatively low quality, uses SOTA techniques to be surprisingly usable. |
|
54 |
| [reka-flash-3-IQ2_S.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_S.gguf) | IQ2_S | 8.12GB | false | Low quality, uses SOTA techniques to be usable. |
|
|
|
|
|
55 |
|
56 |
## Embed/output weights
|
57 |
|
|
|
1 |
---
|
2 |
quantized_by: bartowski
|
3 |
pipeline_tag: text-generation
|
|
|
|
|
4 |
---
|
5 |
|
6 |
## Llamacpp imatrix Quantizations of reka-flash-3 by RekaAI
|
|
|
18 |
## Prompt format
|
19 |
|
20 |
```
|
21 |
+
human: {system_prompt} {prompt} <sep> assistant:
|
22 |
```
|
23 |
|
24 |
+
## What's new:
|
25 |
+
|
26 |
+
Fix chat template
|
27 |
+
|
28 |
## Download a file (not the whole branch) from below:
|
29 |
|
30 |
| Filename | Quant type | File Size | Split | Description |
|
|
|
54 |
| [reka-flash-3-Q2_K.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-Q2_K.gguf) | Q2_K | 8.63GB | false | Very low quality but surprisingly usable. |
|
55 |
| [reka-flash-3-IQ2_M.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_M.gguf) | IQ2_M | 8.51GB | false | Relatively low quality, uses SOTA techniques to be surprisingly usable. |
|
56 |
| [reka-flash-3-IQ2_S.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_S.gguf) | IQ2_S | 8.12GB | false | Low quality, uses SOTA techniques to be usable. |
|
57 |
+
| [reka-flash-3-IQ2_XS.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_XS.gguf) | IQ2_XS | 7.83GB | false | Low quality, uses SOTA techniques to be usable. |
|
58 |
+
| [reka-flash-3-IQ2_XXS.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_XXS.gguf) | IQ2_XXS | 7.39GB | false | Very low quality, uses SOTA techniques to be usable. |
|
59 |
|
60 |
## Embed/output weights
|
61 |
|