Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ Quantized using the default exllamav2 quantization script/dataset, with the foll
|
|
5 |
- Fewer rows, but ultimately, much more data was used.
|
6 |
- A few rows of an "extra" dataset, with some examples of long, coherent text and this model's chat tokens, were added to the dataset.
|
7 |
|
8 |
-
The goal is less
|
9 |
|
10 |
# DeepSeek-R1
|
11 |
<!-- markdownlint-disable first-line-h1 -->
|
|
|
5 |
- Fewer rows, but ultimately, much more data was used.
|
6 |
- A few rows of an "extra" dataset, with some examples of long, coherent text and this model's chat tokens, were added to the dataset.
|
7 |
|
8 |
+
The goal is less degradation from quantization at long context. But I tried to stay as close to default exl2 quantization parameters as possible, as straying too far from them only seems to degrade performance.
|
9 |
|
10 |
# DeepSeek-R1
|
11 |
<!-- markdownlint-disable first-line-h1 -->
|