Crataco
/

RWKV-4-PilePlus-Series-GGML

Text Generation

Model card Files Files and versions Community

Merry commited on May 24, 2023

Commit

17eb111

·

1 Parent(s): 67d9bd9

Update README.md

Files changed (1) hide show

README.md +18 -1

README.md CHANGED Viewed

@@ -16,7 +16,24 @@ datasets:
 This is [BlinkDL/rwkv-4-pileplus](https://huggingface.co/BlinkDL/rwkv-4-pileplus) converted to GGML for use with rwkv.cpp and KoboldCpp. [rwkv.cpp's conversion instructions](https://github.com/saharNooby/rwkv.cpp#option-32-convert-and-quantize-pytorch-model) were followed.
-Original model card is below.
 * * *

 This is [BlinkDL/rwkv-4-pileplus](https://huggingface.co/BlinkDL/rwkv-4-pileplus) converted to GGML for use with rwkv.cpp and KoboldCpp. [rwkv.cpp's conversion instructions](https://github.com/saharNooby/rwkv.cpp#option-32-convert-and-quantize-pytorch-model) were followed.
+### RAM USAGE (KoboldCpp)
+Model | RAM usage (with OpenBLAS)
+:--:|:--:
+Unloaded | 41.3 MiB
+169M q4_0 | 249.0 MiB
+169M q5_0 | 254.2 MiB
+169M q5_1 | 259.6 MiB
+430M q4_0 | 443.7 MiB
+430M q5_0 | 463.2 MiB
+430M q5_1 | 482.8 MiB
+1.5B q4_0 | 1.2 GiB
+1.5B q5_0 | 1.3 GiB
+1.5B q5_1 | 1.4 GiB
+3B q4_0 | 2.1 GiB
+3B q5_0 | 2.3 GiB
+3B q5_1 | 2.5 GiB
+Original model card by BlinkDL is below.
 * * *