huawei-noah
/

EntityCS-39-MLM-xlmr-base

Inference Endpoints

Model card Files Files and versions Community

fenchri commited on Sep 13, 2023

Commit

45cd7b6

·

1 Parent(s): c1f1a7d

Update README.md

Files changed (1) hide show

README.md +7 -5

README.md CHANGED Viewed

@@ -49,7 +49,9 @@ language:
 - Paper: https://aclanthology.org/2022.findings-emnlp.499.pdf
 - Repository: https://github.com/huawei-noah/noah-research/tree/master/NLP/EntityCS
 - Point of Contact: [Fenia Christopoulou](mailto:[email protected]), [Chenxi Whitehouse](mailto:[email protected])
 This model has been trained on the EntityCS corpus, an English corpus from Wikipedia with replaced entities in different languages.
 The corpus can be found in [https://huggingface.co/huawei-noah/entity_cs](https://huggingface.co/huawei-noah/entity_cs), check the link for more details.
 To train models on the corpus, we first employ the conventional 80-10-10 MLM objective, where 15% of sentence subwords are considered as masking candidates. From those, we replace subwords
@@ -105,7 +107,7 @@ In the paper, we focused on entity-related tasks, such as NER, Word Sense Disamb
 Alternatively, it can be used directly (no fine-tuning) for probing tasks, i.e. predict missing words, such as [X-FACTR](https://aclanthology.org/2020.emnlp-main.479/).
-For results on each downstream task, please refer to the paper.
 ## How to Get Started with the Model
@@ -114,7 +116,7 @@ Use the code below to get started with the model: https://github.com/huawei-noah
 ## Citation
-**BibTeX:**
 ```html
 @inproceedings{whitehouse-etal-2022-entitycs,
@@ -132,8 +134,8 @@ Use the code below to get started with the model: https://github.com/huawei-noah
 }
 ```
-**APA:**
 ```html
-Whitehouse, C., Christopoulou, F., & Iacobacci, I. (2022). EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code Switching. In Findings of the Association for Computational Linguistics: EMNLP 2022 (pp. 6698–6714). Association for Computational Linguistics.
 ```

 - Paper: https://aclanthology.org/2022.findings-emnlp.499.pdf
 - Repository: https://github.com/huawei-noah/noah-research/tree/master/NLP/EntityCS
 - Point of Contact: [Fenia Christopoulou](mailto:[email protected]), [Chenxi Whitehouse](mailto:[email protected])
+## Model Description
 This model has been trained on the EntityCS corpus, an English corpus from Wikipedia with replaced entities in different languages.
 The corpus can be found in [https://huggingface.co/huawei-noah/entity_cs](https://huggingface.co/huawei-noah/entity_cs), check the link for more details.
 To train models on the corpus, we first employ the conventional 80-10-10 MLM objective, where 15% of sentence subwords are considered as masking candidates. From those, we replace subwords
 Alternatively, it can be used directly (no fine-tuning) for probing tasks, i.e. predict missing words, such as [X-FACTR](https://aclanthology.org/2020.emnlp-main.479/).
+For results on each downstream task, please refer to the [paper](https://aclanthology.org/2022.findings-emnlp.499.pdf).
 ## How to Get Started with the Model
 ## Citation
+**BibTeX**
 ```html
 @inproceedings{whitehouse-etal-2022-entitycs,
 }
 ```
+**APA**
 ```html
+Whitehouse, C., Christopoulou, F., & Iacobacci, I. (2022). EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code Switching. In Findings of the Association for Computational Linguistics: EMNLP 2022.
 ```