BlinkDL
/

rwkv-4-pileplus

Text Generation

Model card Files Files and versions Community

BlinkDL commited on May 21, 2023

Commit

e637e53

·

1 Parent(s): aec24d1

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -18,9 +18,11 @@ datasets:
 RWKV-4-pile models finetuning on [RedPajama + some of Pile v2 = 1.7T tokens]. Updated with 2020+2021+2022 data, and better at all European languages.
-These are intermedia checkpoints (XXXGtokens means finetuned for XXXG tokens). They are not instruct tuned yet, and they are recommended to replace vanilla Pile models.
-You can already use them because I am finetuning from Pile models (instead of retraining).
 See https://github.com/BlinkDL/RWKV-LM for details.

 RWKV-4-pile models finetuning on [RedPajama + some of Pile v2 = 1.7T tokens]. Updated with 2020+2021+2022 data, and better at all European languages.
+Although some of these are intermedia checkpoints (XXXGtokens means finetuned for XXXG tokens), you can already use them because I am finetuning from Pile models (instead of retraining).
+Note: not instruct tuned yet, and recommended to replace vanilla Pile models.
+7B and 14B coming soon.
 See https://github.com/BlinkDL/RWKV-LM for details.