unsloth
/

DeepSeek-R1-Distill-Qwen-32B-bnb-4bit

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Resources

View closed (0)

VLLM with error Blockwise quantization only supports 16/32-bit floats, but got torch.uint8

#3 opened 9 days ago by

How to convert this model to GGUF?

#2 opened 9 days ago by

The `tokenizer_config.json` is missing the `chat_template` jinja?

#1 opened 20 days ago by