Update README.md
Browse files
README.md
CHANGED
@@ -23,7 +23,7 @@ https://www.linkedin.com/in/vishnu-prasad-j/
|
|
23 |
# Model description
|
24 |
The MalayaLLM models have been improved and customized to incorporate a comprehensive Malayalam vocabulary comprising approximately 18,000 tokens, expanding upon the groundwork laid by the original LLaMA-2.
|
25 |
|
26 |
-
- **Model type:** A 7B LLaMA2 pretrained model on Malayalam .
|
27 |
- **Language(s):** Malayalam and English
|
28 |
- **Datasets:** [ai4bharat](https://storage.googleapis.com/ai4bharat-public-indic-nlp-corpora/indiccorp/ml.tar.xz) ,
|
29 |
[CulturaX](https://huggingface.co/datasets/uonlp/CulturaX/tree/main/ml)
|
|
|
23 |
# Model description
|
24 |
The MalayaLLM models have been improved and customized to incorporate a comprehensive Malayalam vocabulary comprising approximately 18,000 tokens, expanding upon the groundwork laid by the original LLaMA-2.
|
25 |
|
26 |
+
- **Model type:** A 7B LLaMA2 pretrained model on Malayalam tokens.
|
27 |
- **Language(s):** Malayalam and English
|
28 |
- **Datasets:** [ai4bharat](https://storage.googleapis.com/ai4bharat-public-indic-nlp-corpora/indiccorp/ml.tar.xz) ,
|
29 |
[CulturaX](https://huggingface.co/datasets/uonlp/CulturaX/tree/main/ml)
|