Description

These are GGUF model format files for the rhysjones/Phi-3-mini-mango-1 Phi-3 4k model.

Conversion process

The useful conversion script GGUF-n-Go by thesven was used along with llama.cpp to generate the different quantized sizes for the model.

Downloads last month
99
GGUF
Model size
3.82B params
Architecture
phi3

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for rhysjones/Phi-3-mini-mango-1-GGUF

Quantized
(1)
this model