Update README.md
Browse files
README.md
CHANGED
@@ -88,7 +88,7 @@ For more details about how the model was trained, check out [our blogpost](https
|
|
88 |
|
89 |
### Evaluation
|
90 |
|
91 |
-
|
92 |
|
93 |
The m-ArenaHard dataset, used to evaluate Aya Expanse’s capabilities, is publicly available [here](https://huggingface.co/datasets/CohereForAI/m-ArenaHard).
|
94 |
|
@@ -104,4 +104,4 @@ For errors or additional questions about details in this model card, contact inf
|
|
104 |
|
105 |
### Terms of Use
|
106 |
|
107 |
-
|
|
|
88 |
|
89 |
### Evaluation
|
90 |
|
91 |
+
They evaluated Aya Expanse 8B against Gemma 2 9B, Llama 3.1 8B, Ministral 8B, and Qwen 2.5 7B using the `dolly_human_edited` subset from the [Aya Evaluation Suite dataset](https://huggingface.co/datasets/CohereForAI/aya_evaluation_suite) and m-ArenaHard, a dataset based on the [Arena-Hard-Auto dataset](https://huggingface.co/datasets/lmarena-ai/arena-hard-auto-v0.1) and translated to the 23 languages we support in Aya Expanse 8B. Win-rates were determined using gpt-4o-2024-08-06 as a judge. For a conservative benchmark, we report results from gpt-4o-2024-08-06, though gpt-4o-mini scores showed even stronger performance.
|
92 |
|
93 |
The m-ArenaHard dataset, used to evaluate Aya Expanse’s capabilities, is publicly available [here](https://huggingface.co/datasets/CohereForAI/m-ArenaHard).
|
94 |
|
|
|
104 |
|
105 |
### Terms of Use
|
106 |
|
107 |
+
Tehy hope that the release of this model will make community-based research efforts more accessible, by releasing the weights of a highly performant multilingual model to researchers all over the world. This model is governed by a [CC-BY-NC](https://cohere.com/c4ai-cc-by-nc-license) License with an acceptable use addendum, and also requires adhering to [C4AI's Acceptable Use Policy](https://docs.cohere.com/docs/c4ai-acceptable-use-policy).
|