fla-hub
/

rwkv7-168M-pile

Text Generation

Model card Files Files and versions Community

ZhangRC commited on 23 days ago

Commit

1733e53

·

verified ·

1 Parent(s): ce348d1

Update README.md

Files changed (1) hide show

README.md +12 -3

README.md CHANGED Viewed

@@ -1,5 +1,14 @@
 ---
 license: apache-2.0
 ---
 # rwkv7-168m-pile
@@ -36,7 +45,7 @@ This is RWKV-7 model under flash-linear attention format.
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 Install flash-linear-attention before using this model:
-```
 git clone https://github.com/fla-org/flash-linear-attention
 cd flash-linear-attention
 pip install -e .
@@ -46,7 +55,7 @@ pip install -e .
 <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
 You can use this model just as any other HuggingFace models:
-```
 from transformers import AutoModelForCausalLM, AutoTokenizer
 model = AutoModelForCausalLM.from_pretrained('fla-hub/rwkv7-168m-pile', trust_remote_code=True)
 tokenizer = AutoTokenizer.from_pretrained('fla-hub/rwkv7-168m-pile', trust_remote_code=True)
@@ -67,4 +76,4 @@ This model is trained on the Pile with a total of 332 billion tokens.
 #### Metrics
 `lambada_openai`: ppl 14.2 acc 45.6%
-`piqa`: acc 65.5%

 ---
 license: apache-2.0
+datasets:
+- EleutherAI/the_pile_deduplicated
+language:
+- en
+metrics:
+- accuracy
+base_model:
+- BlinkDL/rwkv-7-pile
+pipeline_tag: text-generation
 ---
 # rwkv7-168m-pile
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 Install flash-linear-attention before using this model:
+```bash
 git clone https://github.com/fla-org/flash-linear-attention
 cd flash-linear-attention
 pip install -e .
 <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
 You can use this model just as any other HuggingFace models:
+```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 model = AutoModelForCausalLM.from_pretrained('fla-hub/rwkv7-168m-pile', trust_remote_code=True)
 tokenizer = AutoTokenizer.from_pretrained('fla-hub/rwkv7-168m-pile', trust_remote_code=True)
 #### Metrics
 `lambada_openai`: ppl 14.2 acc 45.6%
+`piqa`: acc 65.5%