Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ metrics:
|
|
18 |
<img src="SambaLingo_Logo.png" width="340" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
|
19 |
|
20 |
<!-- Provide a quick summary of what the model is/does. -->
|
21 |
-
SambaLingo-Thai-Base is a pretrained Bi-lingual Thai and English model that adapts [Llama 2](https://huggingface.co/meta-llama/Llama-2-7b-hf) to Thai by training on
|
22 |
|
23 |
## Model Description
|
24 |
<!-- Provide a longer summary of what this model is. -->
|
@@ -96,7 +96,7 @@ We would like to give a special thanks to the following groups:
|
|
96 |
@software{sambalingo,
|
97 |
title = {{SambaLingo: Language Experts Adapted From Llama}},
|
98 |
author = {SambaNova Systems},
|
99 |
-
url = {https://huggingface.co/sambanovasystems/SambaLingo
|
100 |
month = {2},
|
101 |
year = {2024},
|
102 |
version = {1.0},
|
|
|
18 |
<img src="SambaLingo_Logo.png" width="340" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
|
19 |
|
20 |
<!-- Provide a quick summary of what the model is/does. -->
|
21 |
+
SambaLingo-Thai-Base is a pretrained Bi-lingual Thai and English model that adapts [Llama 2](https://huggingface.co/meta-llama/Llama-2-7b-hf) to Thai by training on 38 billion tokens from the Thai split of the [Cultura-X](https://huggingface.co/datasets/uonlp/CulturaX) dataset. This model reports state of the art evaluation results in perplexity and FLORES-200 translation. For the chat version of this model, please see [sambanovasystems/SambaLingo-Thai-Chat](https://huggingface.co/sambanovasystems/SambaLingo-Thai-Chat).
|
22 |
|
23 |
## Model Description
|
24 |
<!-- Provide a longer summary of what this model is. -->
|
|
|
96 |
@software{sambalingo,
|
97 |
title = {{SambaLingo: Language Experts Adapted From Llama}},
|
98 |
author = {SambaNova Systems},
|
99 |
+
url = {https://huggingface.co/sambanovasystems/SambaLingo-Thai-Base}
|
100 |
month = {2},
|
101 |
year = {2024},
|
102 |
version = {1.0},
|