Update README.md

@basilevc

@skirres

I have specified that by Chinese we meant simplified chinese as requested by

@ArianeCavet
here.

I have also reorder the language by the alphabetical order of the language codes,

@ArianeCavet
ok for you?

Just note that zs is not recognized by huggingface language tags.

Files changed (1) hide show

README.md +21 -21

README.md CHANGED Viewed

@@ -1,18 +1,18 @@
 ---
 pipeline_tag: sentence-similarity
 tags:
-  - feature-extraction
-  - sentence-similarity
 language:
-  - de
-  - en
-  - es
-  - fr
-  - it
-  - nl
-  - ja
-  - pt
-  - zh
 ---
 # Model Card for `vectorizer.raspberry`
@@ -27,15 +27,15 @@ Model name: `vectorizer.raspberry`
 The model was trained and tested in the following languages:
-- English
-- French
 - German
 - Spanish
 - Italian
 - Dutch
 - Japanese
 - Portuguese
-- Chinese
 Besides these languages, basic support can be expected for additional 91 languages that were used during the pretraining
 of the base model (see Appendix A of XLM-R paper).
@@ -115,10 +115,10 @@ We evaluated the model on the datasets of the [MIRACL benchmark](https://github.
 multilingual capacities. Note that not all training languages are part of the benchmark, so we only report the metrics
 for the existing languages.
-| Language | Recall@100 |
-|:---------|-----------:|
-| French   |      0.650 |
-| German   |      0.528 |
-| Spanish  |      0.602 |
-| Japanese |      0.614 |
-| Chinese  |      0.680 |

 ---
 pipeline_tag: sentence-similarity
 tags:
+- feature-extraction
+- sentence-similarity
 language:
+- de
+- en
+- es
+- fr
+- it
+- nl
+- ja
+- pt
+- zs
 ---
 # Model Card for `vectorizer.raspberry`
 The model was trained and tested in the following languages:
 - German
+- English
 - Spanish
+- French
 - Italian
 - Dutch
 - Japanese
 - Portuguese
+- Simplified Chinese
 Besides these languages, basic support can be expected for additional 91 languages that were used during the pretraining
 of the base model (see Appendix A of XLM-R paper).
 multilingual capacities. Note that not all training languages are part of the benchmark, so we only report the metrics
 for the existing languages.
+| Language            | Recall@100 |
+|:--------------------|-----------:|
+| German              |      0.528 |
+| Spanish             |      0.602 |
+| French              |      0.650 |
+| Japanese            |      0.614 |
+| Simplified Chinese  |      0.680 |