vaiv
/

kobigbird-roberta-large

Inference Endpoints

Model card Files Files and versions Community

ksyang commited on Sep 6, 2023

Commit

4e12e52

•

1 Parent(s): 424f69c

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -9,9 +9,8 @@ tags:
 # **KoBigBird-RoBERTa-large**
 This is a large-sized Korean BigBird model introduced in our [paper]() (IJCNLP-AACL 2023).
-The model draws heavily from the parameters of [klue/roberta-large](https://huggingface.co/klue/roberta-large) to ensure high performance
-and employs the BigBird architecture to extend its input length.
-With the assistance of TAPER to extend position embeddings, the language model's extrapolation capabilities are enhanced.
 ### How to Use
@@ -33,7 +32,7 @@ Measurement on validation sets of the KLUE benchmark datasets
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62ce3886a9be5c195564fd71/50jMYggkGVUM06n2v1Hxm.png)
 ### Limitations
-While our model achieves great results without further pretraining, direct pretraining can further refine positional representations.
 ## Citation Information

 # **KoBigBird-RoBERTa-large**
 This is a large-sized Korean BigBird model introduced in our [paper]() (IJCNLP-AACL 2023).
+The model draws heavily from the parameters of [klue/roberta-large](https://huggingface.co/klue/roberta-large) to ensure high performance.
+By employing the BigBird architecture and incorporating the newly proposed TAPER, the language model accommodates even longer input lengths.
 ### How to Use
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62ce3886a9be5c195564fd71/50jMYggkGVUM06n2v1Hxm.png)
 ### Limitations
+While our model achieves great results even without additional pretraining, direct pretraining can further refine positional representations.
 ## Citation Information