Crataco
/

RWKV-4-PilePlus-Series-GGML

Merry commited on Jun 16, 2023

Commit

817e951

•

1 Parent(s): 5a9ce2f

Alright I was way too tired lol

Files changed (1) hide show

README.md CHANGED Viewed

@@ -14,12 +14,24 @@ datasets:
 **Last updated:** 2023-06-07
-This is [BlinkDL/rwkv-4-world](https://huggingface.co/BlinkDL/rwkv-4-world) converted to GGML for use with rwkv.cpp and KoboldCpp. [rwkv.cpp's conversion instructions](https://github.com/saharNooby/rwkv.cpp#option-32-convert-and-quantize-pytorch-model) were followed.
 ### RAM USAGE (KoboldCpp)
 Model | RAM usage (with OpenBLAS)
 :--:|:--:
 Unloaded | 41.3 MiB
 Original model card by BlinkDL is below.

 **Last updated:** 2023-06-07
+This is [BlinkDL/rwkv-4-pileplus](https://huggingface.co/BlinkDL/rwkv-4-pileplus) converted to GGML for use with rwkv.cpp and KoboldCpp. [rwkv.cpp's conversion instructions](https://github.com/saharNooby/rwkv.cpp#option-32-convert-and-quantize-pytorch-model) were followed.
 ### RAM USAGE (KoboldCpp)
 Model | RAM usage (with OpenBLAS)
 :--:|:--:
 Unloaded | 41.3 MiB
+169M q4_0 | 232.2 MiB
+169M q5_0 | 243.3 MiB
+169M q5_1 | 249.2 MiB
+430M q4_0 | 413.2 MiB
+430M q5_0 | 454.4 MiB
+430M q5_1 | 471.8 MiB
+1.5B q4_0 | 1.1 GiB
+1.5B q5_0 | 1.3 GiB
+1.5B q5_1 | 1.3 GiB
+3B q4_0 | 2.0 GiB
+3B q5_0 | 2.3 GiB
+3B q5_1 | 2.4 GiB
 Original model card by BlinkDL is below.