atsuki-yamaguchi commited on
Commit
d8cce44
1 Parent(s): 8e70ae0

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ license: llama2
4
+ language:
5
+ - si
6
+ base_model: meta-llama/Llama-2-7b-hf
7
+ library_name: transformers
8
+ ---
9
+ # Llama2 7B for Sinhala: No vocabulary adaptation
10
+
11
+ This model is built on top of Llama2 7B adapted for Sinhala using 30K target language sentences sampled from CC-100.
12
+
13
+ ## Model Details
14
+
15
+ * **Vocabulary**: This model has no additional target vocabulary. It retains the original vocabulary of Llama2 7B.
16
+
17
+
18
+ ## Model Description
19
+
20
+ - **Language:** Sinhala
21
+ - **License:** Llama 2 Community License Agreement
22
+ - **Fine-tuned from model:** meta-llama/Llama-2-7b-hf
23
+
24
+
25
+ ## Model Sources
26
+
27
+ - **Repository:** https://github.com/gucci-j/lowres-cve
28
+ - **Paper:** https://arxiv.org/abs/2406.11477
29
+
30
+ ## How to Get Started with the Model
31
+ Use the code below to get started with the model.
32
+ ```python
33
+ from transformers import AutoTokenizer, AutoModelForCausalLM
34
+
35
+ model = AutoModelForCausalLM.from_pretrained(
36
+ "atsuki-yamaguchi/Llama-2-7b-hf-si-30K-lapt"
37
+ )
38
+ tokenizer = AutoTokenizer.from_pretrained(
39
+ "atsuki-yamaguchi/Llama-2-7b-hf-si-30K-lapt"
40
+ )
41
+ ```
42
+
43
+