metadata

license: apache-2.0

gguf versions of OpenLLaMa 3B

Version: 1T tokens final version
Project: OpenLLaMA: An Open Reproduction of LLaMA
Model: openlm-research/open_llama_3b
llama.cpp: build 1012 (6381d4e) or later
ggml version

Newer quantizations

There are now more quantization types in llama.cpp, some lower than 4 bits. Currently these are not supported, maybe because some weights have shapes that don't divide by 256.

Perplexity on wiki.test.406

Coming soon...