IDEA-CCNL
/

Randeng-T5-77M

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

roygan commited on Jun 8, 2022

Commit

c4a67c7

·

1 Parent(s): 16dde8a

Update README.md

Files changed (1) hide show

README.md +38 -0

README.md CHANGED Viewed

@@ -1,3 +1,41 @@
 ---
 license: apache-2.0
 ---

 ---
+language:
+  - zh
 license: apache-2.0
+tags:
+- T5
+- chinese
+- sentencepiece
+inference: true
+widget:
+- text: "北京有悠久的 <extra_id_0>和 <extra_id_1>。"
+- type: "text-generation"
 ---
+# Randeng-T5-77M, one model of [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM).
+Based on mt5-small, Randeng-T5-77M only retains the vocabulary and embedding corresponding to Chinese and English, and continues to train on the basis of 180G Chinese general pre-training corpus. The pretrain target is span corruption. We pretrain the model based on our [fengshen framework](https://github.com/IDEA-CCNL/Fengshenbang-LM/tree/main/fengshen), use 8 * A100 for 24 hours.
+## Usage
+```python
+from transformers import T5ForConditionalGeneration, AutoTokenizer
+import torch
+tokenizer=AutoTokenizer.from_pretrained('IDEA-CCNL/Randeng-T5-77M', use_fast=false)
+model=T5ForConditionalGeneration.from_pretrained('IDEA-CCNL/Randeng-T5-77M')
+```
+## Citation
+If you find the resource is useful, please cite the following website in your paper.
+```
+@misc{Fengshenbang-LM,
+  title={Fengshenbang-LM},
+  author={IDEA-CCNL},
+  year={2022},
+  howpublished={\url{https://github.com/IDEA-CCNL/Fengshenbang-LM}},
+}
+```