language: | |
- "zh" | |
tags: | |
- "chinese" | |
- "token-classification" | |
- "pos" | |
- "dependency-parsing" | |
base_model: KoichiYasuoka/roberta-base-chinese-upos | |
datasets: | |
- "universal_dependencies" | |
license: "cc-by-sa-4.0" | |
pipeline_tag: "token-classification" | |
# roberta-base-chinese-ud-goeswith | |
## Model Description | |
This is a RoBERTa model pre-trained on Chinese Wikipedia texts (both simplified and traditional) for POS-tagging and dependency-parsing (using `goeswith` for subwords), derived from [roberta-base-chinese-upos](https://huggingface.co/KoichiYasuoka/roberta-base-chinese-upos). | |
## How to Use | |
```py | |
from transformers import pipeline | |
nlp=pipeline("universal-dependencies","KoichiYasuoka/roberta-base-chinese-ud-goeswith",trust_remote_code=True,aggregation_strategy="simple") | |
print(nlp("我把这本书看完了")) | |
``` | |
## Reference | |
Koichi Yasuoka: [Sequence-Labeling RoBERTa Model for Dependency-Parsing in Classical Chinese and Its Application to Vietnamese and Thai](https://doi.org/10.1109/ICBIR57571.2023.10147628), ICBIR 2023: 8th International Conference on Business and Industrial Research (May 2023), pp.169-173. | |