unsloth/DeepSeek-V3-GGUF · I'm a newbie. How to use?

The model card page [https://huggingface.co/unsloth/DeepSeek-V3-GGUF] has links to different google collab notebooks that can be used for fine tuning and inferencing (towards the end in the collab sheet).

Q*_* named files are typically Quantized files (GGUF format) they can work on CPU also BUT performance degrades. This is possible due to llama.cpp project that runs these modes on CPU. Files with safe tensor extension (not present in this repo since this is a GGUF repo) are typically deployed on GPUs.

You can use Ollama or similar tools to run GGUF files easily.