Downtown-Case
/

deepseek-ai_DeepSeek-R1-Distill-Qwen-32B-exl2-4.5bpw-8K-Cal

Model card Files Files and versions Community

Downtown-Case commited on Jan 22

Commit

dab8cac

·

verified ·

1 Parent(s): a4a4afb

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ Quantized using the default exllamav2 quantization script/dataset, with the foll
 - Fewer rows, but ultimately, much more data was used.
 - A few rows of an "extra" dataset, with some examples of long, coherent text and this model's chat tokens, were added to the dataset.
-The goal is less degredation from quantization at long context. But I tried to stay as close to default exl2 quantization parameters as possible, as straying too far from them only seems to degrade performance.
 # DeepSeek-R1
 <!-- markdownlint-disable first-line-h1 -->

 - Fewer rows, but ultimately, much more data was used.
 - A few rows of an "extra" dataset, with some examples of long, coherent text and this model's chat tokens, were added to the dataset.
+The goal is less degradation from quantization at long context. But I tried to stay as close to default exl2 quantization parameters as possible, as straying too far from them only seems to degrade performance.
 # DeepSeek-R1
 <!-- markdownlint-disable first-line-h1 -->