README.md exists but content is empty.
Downloads last month
59
GGUF
Model size
7.24B params
Architecture
llama

5-bit

6-bit

Inference Examples
Inference API (serverless) does not yet support llama.cpp models for this pipeline type.