README.md · dranger003/labradorite-13b-iMat.GGUF at 52afcbb3e01d027067d45b20480284845bce4fa8

metadata

license: apache-2.0
pipeline_tag: text-generation
library_name: gguf
base_model: ibm/labradorite-13b

NOTE: You will need a recent build of llama.cpp to run these quants (i.e. at least commit 494c870).

GGUF importance matrix (imatrix) quants for https://huggingface.co/ibm/labradorite-13b

The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a general purpose imatrix calibration dataset.
The imatrix is being used on the K-quants as well.

Layers	Context	Template
40	32768	<\|system\|> {sys_prompt} <\|user\|> {inputs} <\|assistant\|> {response}<\|endoftext\|>