unsloth
/

DeepSeek-R1-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

8bits quantization

#20

by ramkumarkoppu - opened 4 days ago

4 days ago

•

edited 4 days ago

Hi @Unsloth team for the great work. Can you please provide the instructions if I want to quantize to 8bits locally on my linux system where model weights downloaded from https://huggingface.co/deepseek-ai/DeepSeek-R1/tree/main to reproduce these quantized model files in the directory DeepSeek-R1-Q8_0 locally

Unsloth AI org 2 days ago

The R1 model is already 8bit by default :)

2 days ago

I am confused more, the model weights in the repo https://huggingface.co/deepseek-ai/DeepSeek-R1/tree/main
tells me differently

2 days ago

The large matrices are fp8. Specifically F8_e4m3

2 days ago

so, what @unsloth team done to create https://huggingface.co/unsloth/DeepSeek-R1-GGUF/tree/main/DeepSeek-R1-Q8_0 from https://huggingface.co/deepseek-ai/DeepSeek-R1/tree/main ? what are the reproduction steps?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment