Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Perplexity (ctx 8192)

BF16:

I1005 10:12:54.337877 967332 eval_ppl.py:66] wikitext2 perplexity: 6.49992036819458
I1005 10:17:19.718732 967332 eval_ppl.py:66] c4 perplexity: 8.022844314575195

4 Bit QTIP (this model):

I1005 10:13:22.272425 967335 eval_ppl.py:66] wikitext2 perplexity: 6.610454082489014
I1005 10:19:33.215519 967335 eval_ppl.py:66] c4 perplexity: 8.128130912780762

Zeroshot Results with lm_eval 0.3.0

BF16:

|-------------|------:|--------|-----:|---|-----:|
|arc_challenge|      0|acc     |0.5196|±  |0.0146|
|             |       |acc_norm|0.5512|±  |0.0145|
|arc_easy     |      0|acc     |0.8182|±  |0.0079|
|             |       |acc_norm|0.7980|±  |0.0082|
|boolq        |      1|acc     |0.8410|±  |0.0064|
|piqa         |      0|acc     |0.8003|±  |0.0093|
|             |       |acc_norm|0.8090|±  |0.0092|
|winogrande   |      0|acc     |0.7380|±  |0.0124|

4 Bit QTIP (this model):

|-------------|------:|--------|-----:|---|-----:|
|arc_challenge|      0|acc     |0.5179|±  |0.0146|
|             |       |acc_norm|0.5486|±  |0.0145|
|arc_easy     |      0|acc     |0.8157|±  |0.0080|
|             |       |acc_norm|0.7942|±  |0.0083|
|boolq        |      1|acc     |0.8425|±  |0.0064|
|piqa         |      0|acc     |0.8025|±  |0.0093|
|             |       |acc_norm|0.8079|±  |0.0092|
|winogrande   |      0|acc     |0.7419|±  |0.0123|
Downloads last month
4
Safetensors
Model size
2.8B params
Tensor type
BF16
·
F32
·
FP16
·
I16
·
Inference API
Unable to determine this model's library. Check the docs .

Collection including relaxml/Llama-3.1-8b-Instruct-QTIP-4Bit