Hemanth-thunder
/

Tamil-Mistral-7B-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Hemanth-thunder commited on Mar 13, 2024

Commit

c6fa14a

·

verified ·

1 Parent(s): d7487dc

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -14,8 +14,9 @@ datasets:
 # Model Card for Tamil-Mistral-7B-v0.1
 The Tamil-Mistral-7B-v0.1 Large Language Model (LLM) is a pre-trained generative text model trained at the top of mistral base model 7 billion parameters. This is extends version of tokenization capability by increasing tamil tokens by 20k.
-Additionally, it was Pretrained on 1.19 million tamil documents sourced from madlad-400 (Tamil) [MADLAD-400 (Multilingual Audited Dataset: Low-resource And Document-level)](https://arxiv.org/abs/2309.04662).
 ## Mistral model details
 For full details of this model please read our [paper](https://arxiv.org/abs/2310.06825) and [release blog post](https://mistral.ai/news/announcing-mistral-7b/).

 # Model Card for Tamil-Mistral-7B-v0.1
 The Tamil-Mistral-7B-v0.1 Large Language Model (LLM) is a pre-trained generative text model trained at the top of mistral base model 7 billion parameters. This is extends version of tokenization capability by increasing tamil tokens by 20k.
+Additionally, it was Pretrained on 1.19 million Tamil documents sourced from madlad-400 (Tamil) [MADLAD-400 (Multilingual Audited Dataset: Low-resource And Document-level)](https://arxiv.org/abs/2309.04662).
+pretraining time: 145 hours (GPU NVIDIA RTX A6000 48GB)
 ## Mistral model details
 For full details of this model please read our [paper](https://arxiv.org/abs/2310.06825) and [release blog post](https://mistral.ai/news/announcing-mistral-7b/).