Merry commited on
Commit
17eb111
·
1 Parent(s): 67d9bd9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -1
README.md CHANGED
@@ -16,7 +16,24 @@ datasets:
16
 
17
  This is [BlinkDL/rwkv-4-pileplus](https://huggingface.co/BlinkDL/rwkv-4-pileplus) converted to GGML for use with rwkv.cpp and KoboldCpp. [rwkv.cpp's conversion instructions](https://github.com/saharNooby/rwkv.cpp#option-32-convert-and-quantize-pytorch-model) were followed.
18
 
19
- Original model card is below.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
 
21
  * * *
22
 
 
16
 
17
  This is [BlinkDL/rwkv-4-pileplus](https://huggingface.co/BlinkDL/rwkv-4-pileplus) converted to GGML for use with rwkv.cpp and KoboldCpp. [rwkv.cpp's conversion instructions](https://github.com/saharNooby/rwkv.cpp#option-32-convert-and-quantize-pytorch-model) were followed.
18
 
19
+ ### RAM USAGE (KoboldCpp)
20
+ Model | RAM usage (with OpenBLAS)
21
+ :--:|:--:
22
+ Unloaded | 41.3 MiB
23
+ 169M q4_0 | 249.0 MiB
24
+ 169M q5_0 | 254.2 MiB
25
+ 169M q5_1 | 259.6 MiB
26
+ 430M q4_0 | 443.7 MiB
27
+ 430M q5_0 | 463.2 MiB
28
+ 430M q5_1 | 482.8 MiB
29
+ 1.5B q4_0 | 1.2 GiB
30
+ 1.5B q5_0 | 1.3 GiB
31
+ 1.5B q5_1 | 1.4 GiB
32
+ 3B q4_0 | 2.1 GiB
33
+ 3B q5_0 | 2.3 GiB
34
+ 3B q5_1 | 2.5 GiB
35
+
36
+ Original model card by BlinkDL is below.
37
 
38
  * * *
39