lordjia commited on
Commit
fe95a2e
1 Parent(s): 2f0a4dd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -1
README.md CHANGED
@@ -11,4 +11,53 @@ tags:
11
  - Cantonese
12
  - chat
13
  - Llama3
14
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  - Cantonese
12
  - chat
13
  - Llama3
14
+ ---
15
+
16
+ # Llama-3-Cantonese-8B-Instruct
17
+
18
+ ## Model Overview
19
+
20
+ Llama-3-Cantonese-8B-Instruct is a Cantonese language model based on Meta-Llama-3-8B-Instruct, fine-tuned using LoRA. It aims to enhance Cantonese text generation and comprehension capabilities, supporting various tasks such as dialogue generation, text summarization, and question-answering.
21
+
22
+ ## Model Features
23
+
24
+ - **Base Model**: Meta-Llama-3-8B-Instruct
25
+ - **Fine-tuning Method**: LoRA instruction tuning
26
+ - **Training Steps**: 4562 steps
27
+ - **Primary Language**: Cantonese
28
+ - **Datasets**:
29
+ - [jed351/cantonese-wikipedia](https://huggingface.co/datasets/jed351/cantonese-wikipedia)
30
+ - [lordjia/Cantonese_English_Translation](https://huggingface.co/datasets/lordjia/Cantonese_English_Translation)
31
+ - **Training Tools**: [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)
32
+
33
+ ## Usage
34
+
35
+ You can easily load and use this model with Hugging Face's Transformers library. Here is a simple example:
36
+
37
+ ```python
38
+ from transformers import AutoModelForCausalLM, AutoTokenizer
39
+
40
+ tokenizer = AutoTokenizer.from_pretrained("lordjia/Llama-3-Cantonese-8B-Instruct")
41
+ model = AutoModelForCausalLM.from_pretrained("lordjia/Llama-3-Cantonese-8B-Instruct")
42
+
43
+ input_text = "唔該你用廣東話講下你係邊個。"
44
+ inputs = tokenizer(input_text, return_tensors="pt")
45
+ outputs = model.generate(**inputs)
46
+ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
47
+ ```
48
+
49
+ ## Quantized Version
50
+
51
+ A 4-bit quantized version of this model is also available: [llama3-cantonese-8b-instruct-q4_0.gguf](https://huggingface.co/lordjia/Llama-3-Cantonese-8B-Instruct/blob/main/llama3-cantonese-8b-instruct-q4_0.gguf).
52
+
53
+ ## License
54
+
55
+ This model is licensed under the Llama 3 Community License. Please review the terms before use.
56
+
57
+ ## Contributors
58
+
59
+ - LordJia
60
+
61
+ ## Acknowledgements
62
+
63
+ Thanks to Hugging Face for providing the platform and tools, and to all the developers and researchers contributing to the open-source community.