rozek
/

StableLM-3B-4E1T_GGUF

Inference Endpoints

Model card Files Files and versions Community

rozek commited on Nov 18, 2023

Commit

132eb31

·

1 Parent(s): 282d944

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -7,6 +7,12 @@ This repository contains the most relevant quantizations of Stability AI's
 in GGUF format - ready to be used with
 [llama.cpp](https://github.com/ggerganov/llama.cpp) and similar applications.
 Right now, the following quantizations are available:
 * [stablelm-3b-4e1t-Q4_K_M](https://huggingface.co/rozek/StableLM-3B-4E1T_GGUF/blob/main/stablelm-3b-4e1t-Q4_K_M.bin)
@@ -17,6 +23,13 @@ Right now, the following quantizations are available:
 These files are presented here with the written permission of Stability AI (although
 access to the model itself is "gated").
 The chosen license is the same as that
 of the original model.

 in GGUF format - ready to be used with
 [llama.cpp](https://github.com/ggerganov/llama.cpp) and similar applications.
+> For this model, Stability AI claims: "_StableLM-3B-4E1T achieves
+> state-of-the-art performance (September 2023) at the 3B parameter scale
+> for open-source models and is competitive with many of the popular
+> contemporary 7B models, even outperforming our most recent 7B
+> StableLM-Base-Alpha-v2._"
 Right now, the following quantizations are available:
 * [stablelm-3b-4e1t-Q4_K_M](https://huggingface.co/rozek/StableLM-3B-4E1T_GGUF/blob/main/stablelm-3b-4e1t-Q4_K_M.bin)
 These files are presented here with the written permission of Stability AI (although
 access to the model itself is "gated").
+Any model details can be found on the
+[original model card](https://huggingface.co/stabilityai/stablelm-3b-4e1t) and in
+a paper on [StableLM-3B-4E1T](https://stability.wandb.io/stability-llm/stable-lm/reports/StableLM-3B-4E1T--VmlldzoyMjU4?accessToken=u3zujipenkx5g7rtcj9qojjgxpconyjktjkli2po09nffrffdhhchq045vp0wyfo).
+The most important ones are
+* context length is 4096
 The chosen license is the same as that
 of the original model.