aisingapore
/

llama3.1-8b-cpt-sea-lionv3-base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tainc commited on 30 days ago

Commit

a78b005

·

verified ·

1 Parent(s): 5e2ad22

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ base_model: meta-llama/Llama-3.1-8B-Instruct
 # Llama3.1 8B CPT SEA-LIONv3
 SEA-LION is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
-Llama3.1 8B CPT SEA-LIONv3 Base is a multilingual model which has undergone continued pre-training from [Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) on English and Southeast Asian text.
 SEA-LION stands for <i>Southeast Asian Languages In One Network</i>.
@@ -33,7 +33,7 @@ SEA-LION stands for <i>Southeast Asian Languages In One Network</i>.
 ## Model Details
 ### Model Description
-The continued pre-training data for Llama3.1 8B CPT SEA-LIONv3 Base encompasses approximately 200B tokens across the 11 official Southeast Asian languages: English, Chinese, Vietnamese, Indonesian, Thai, Tamil, Filipino, Malay, Khmer, Lao, Burmese.
 For tokenisation, the model employs the default tokenizer used in Llama3.1 8B Instruct.

 # Llama3.1 8B CPT SEA-LIONv3
 SEA-LION is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
+Llama3.1 8B CPT SEA-LIONv3 Base is a multilingual model which has undergone continued pre-training on approximately **200B** tokens across the 11 official Southeast Asian languages: English, Chinese, Vietnamese, Indonesian, Thai, Tamil, Filipino, Malay, Khmer, Lao, Burmese.
 SEA-LION stands for <i>Southeast Asian Languages In One Network</i>.
 ## Model Details
 ### Model Description
+We performed continued pre-training in English and ASEAN languages on [Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct), a decoder model using the Llama 3.1 architecture, to create Llama3.1 8B CPT SEA-LIONv3 Base.
 For tokenisation, the model employs the default tokenizer used in Llama3.1 8B Instruct.