Description

These are GGUF model format files for the rhysjones/Phi-3-mini-mango-1 Phi-3 4k model.

Conversion process

The useful conversion script GGUF-n-Go by thesven was used along with llama.cpp to generate the different quantized sizes for the model.

GGUF

Model size

3.82B params

Architecture

phi3

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Base model

Quantized

(1)

this model