EmbeddingGemma 300M ggml-org/embeddinggemma-300M-GGUF 0.3B • Updated Sep 4 • 5.12k • 10 ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated 21 days ago • 634 • 2 ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated 21 days ago • 722 • 2
ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated 21 days ago • 634 • 2
ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated 21 days ago • 722 • 2
Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli Gemma 3 Collection 10 items • Updated Aug 27 • 19 Kimi-VL Collection 2 items • Updated Aug 20 ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • 24B • Updated May 1 • 438 • 4 InternVL 3 and InternVL 2.5 Collection 10 items • Updated Aug 20
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • 24B • Updated May 1 • 438 • 4
EmbeddingGemma 300M ggml-org/embeddinggemma-300M-GGUF 0.3B • Updated Sep 4 • 5.12k • 10 ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated 21 days ago • 634 • 2 ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated 21 days ago • 722 • 2
ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated 21 days ago • 634 • 2
ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated 21 days ago • 722 • 2
Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli Gemma 3 Collection 10 items • Updated Aug 27 • 19 Kimi-VL Collection 2 items • Updated Aug 20 ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • 24B • Updated May 1 • 438 • 4 InternVL 3 and InternVL 2.5 Collection 10 items • Updated Aug 20
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • 24B • Updated May 1 • 438 • 4
ggml-org/Qwen3-30B-A3B-Instruct-2507-Q8_0-GGUF Text Generation • 31B • Updated 14 days ago • 215 • 1
ggml-org/Qwen3-30B-A3B-Thinking-2507-Q8_0-GGUF Text Generation • 31B • Updated 14 days ago • 129
ggml-org/Qwen3-4B-Thinking-2507-Q8_0-GGUF Text Generation • 4B • Updated 14 days ago • 131 • 1