fla-hub
/

rwkv7-168M-pile

Text Generation

Model card Files Files and versions Community

Add paper link and library name

#2

by nielsr HF staff - opened 5 days ago

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -1,14 +1,15 @@
 ---
-license: apache-2.0
 datasets:
 - EleutherAI/the_pile_deduplicated
 language:
 - en
 metrics:
 - accuracy
-base_model:
-- BlinkDL/rwkv-7-pile
 pipeline_tag: text-generation
 ---
 # rwkv7-168M-pile
@@ -37,7 +38,7 @@ This is RWKV-7 model under flash-linear attention format.
 <!-- Provide the basic links for the model. -->
 - **Repository:** https://github.com/fla-org/flash-linear-attention ; https://github.com/BlinkDL/RWKV-LM
-- **Paper:** https://arxiv.org/abs/2503.14456
 - **Weights:** Converted from https://modelscope.cn/models/RWKV/rwkv-7-pile/file/view/master?fileName=RWKV-x070-Pile-168M-20241120-ctx4096.pth
 ## Uses
@@ -81,4 +82,4 @@ This model is trained on the Pile with a total of 332 billion tokens.
 ## FAQ
 Q: safetensors metadata is none.
-A: upgrade transformers to >=4.48.0: `pip install 'transformers>=4.48.0'`

 ---
+base_model:
+- BlinkDL/rwkv-7-pile
 datasets:
 - EleutherAI/the_pile_deduplicated
 language:
 - en
+license: apache-2.0
 metrics:
 - accuracy
 pipeline_tag: text-generation
+library_name: rwkv
 ---
 # rwkv7-168M-pile
 <!-- Provide the basic links for the model. -->
 - **Repository:** https://github.com/fla-org/flash-linear-attention ; https://github.com/BlinkDL/RWKV-LM
+- **Paper:** https://huggingface.co/papers/2503.14456
 - **Weights:** Converted from https://modelscope.cn/models/RWKV/rwkv-7-pile/file/view/master?fileName=RWKV-x070-Pile-168M-20241120-ctx4096.pth
 ## Uses
 ## FAQ
 Q: safetensors metadata is none.
+A: upgrade transformers to >=4.48.0: `pip install 'transformers>=4.48.0'`