bartowski commited on
Commit
7465f51
·
verified ·
1 Parent(s): a619a1e

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -1,8 +1,6 @@
1
  ---
2
  quantized_by: bartowski
3
  pipeline_tag: text-generation
4
- license: apache-2.0
5
- base_model: RekaAI/reka-flash-3
6
  ---
7
 
8
  ## Llamacpp imatrix Quantizations of reka-flash-3 by RekaAI
@@ -20,9 +18,13 @@ Run them directly with [llama.cpp](https://github.com/ggerganov/llama.cpp), or a
20
  ## Prompt format
21
 
22
  ```
23
- human: {prompt} <sep> assistant:
24
  ```
25
 
 
 
 
 
26
  ## Download a file (not the whole branch) from below:
27
 
28
  | Filename | Quant type | File Size | Split | Description |
@@ -52,6 +54,8 @@ human: {prompt} <sep> assistant:
52
  | [reka-flash-3-Q2_K.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-Q2_K.gguf) | Q2_K | 8.63GB | false | Very low quality but surprisingly usable. |
53
  | [reka-flash-3-IQ2_M.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_M.gguf) | IQ2_M | 8.51GB | false | Relatively low quality, uses SOTA techniques to be surprisingly usable. |
54
  | [reka-flash-3-IQ2_S.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_S.gguf) | IQ2_S | 8.12GB | false | Low quality, uses SOTA techniques to be usable. |
 
 
55
 
56
  ## Embed/output weights
57
 
 
1
  ---
2
  quantized_by: bartowski
3
  pipeline_tag: text-generation
 
 
4
  ---
5
 
6
  ## Llamacpp imatrix Quantizations of reka-flash-3 by RekaAI
 
18
  ## Prompt format
19
 
20
  ```
21
+ human: {system_prompt} {prompt} <sep> assistant:
22
  ```
23
 
24
+ ## What's new:
25
+
26
+ Fix chat template
27
+
28
  ## Download a file (not the whole branch) from below:
29
 
30
  | Filename | Quant type | File Size | Split | Description |
 
54
  | [reka-flash-3-Q2_K.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-Q2_K.gguf) | Q2_K | 8.63GB | false | Very low quality but surprisingly usable. |
55
  | [reka-flash-3-IQ2_M.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_M.gguf) | IQ2_M | 8.51GB | false | Relatively low quality, uses SOTA techniques to be surprisingly usable. |
56
  | [reka-flash-3-IQ2_S.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_S.gguf) | IQ2_S | 8.12GB | false | Low quality, uses SOTA techniques to be usable. |
57
+ | [reka-flash-3-IQ2_XS.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_XS.gguf) | IQ2_XS | 7.83GB | false | Low quality, uses SOTA techniques to be usable. |
58
+ | [reka-flash-3-IQ2_XXS.gguf](https://huggingface.co/bartowski/RekaAI_reka-flash-3-GGUF/blob/main/RekaAI_reka-flash-3-IQ2_XXS.gguf) | IQ2_XXS | 7.39GB | false | Very low quality, uses SOTA techniques to be usable. |
59
 
60
  ## Embed/output weights
61