lightblue
/

suzume-llama-3-8B-multilingual

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ptrdvn commited on Apr 25, 2024

Commit

c57962c

·

verified ·

1 Parent(s): 06e7cf7

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -59,10 +59,11 @@ We achieve the following MT-Bench scores across 6 languages:
 | **German** 🇩🇪   | NaN                                     | 7.26                                         | 6.99                              | 7.68              |
 | **French** 🇫🇷   | NaN                                     | 7.66                                         | 7.29                              | 7.74              |
 | **Japanese** 🇯🇵 | NaN                                     | 6.56                                         | 6.22                              | 7.84              |
-| **Russian** 🇷🇺  | NaN                                     | 8.19                                         | 8.28                              | 7.94              |
 | **Chinese** 🇨🇳  | NaN                                     | 7.11                                         | 6.97                              | 7.55              |
 | **English** 🇺🇸  | 7.98                                    | 7.73                                         | 7.92                              | 8.26              |
-(Note the Russian scores exclude code, reasoning and math problems due to not having any translated reference answers for these questions.)
 We observe minimal degredation of Llama 3's English ability while achieving best-in-class multilingual abilities compared to the top rated 7B model ([Nexusflow/Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)) on the [Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).

 | **German** 🇩🇪   | NaN                                     | 7.26                                         | 6.99                              | 7.68              |
 | **French** 🇫🇷   | NaN                                     | 7.66                                         | 7.29                              | 7.74              |
 | **Japanese** 🇯🇵 | NaN                                     | 6.56                                         | 6.22                              | 7.84              |
+| **Russian** 🇷🇺 * | NaN                                     | 8.19                                         | 8.28                              | 7.94              |
 | **Chinese** 🇨🇳  | NaN                                     | 7.11                                         | 6.97                              | 7.55              |
 | **English** 🇺🇸  | 7.98                                    | 7.73                                         | 7.92                              | 8.26              |
+\* (Note the Russian scores exclude code, reasoning and math problems due to not having any translated reference answers for these questions.)
 We observe minimal degredation of Llama 3's English ability while achieving best-in-class multilingual abilities compared to the top rated 7B model ([Nexusflow/Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)) on the [Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).