AIJapanese
/

Moriyasu_Qwen2_JP_7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AIJapanese commited on Dec 11, 2024

Commit

c6c7d16

·

verified ·

1 Parent(s): ac81f2e

Update README.md

Files changed (1) hide show

README.md +11 -9

README.md CHANGED Viewed

@@ -14,15 +14,6 @@ library_name: transformers
 **Moriyasu_Qwen2_JP_7B** is a is a large language model trained by Moriyasu. Based on [Qwen/Qwen2-7B](https://huggingface.co/Qwen/Qwen2-7B), it has been enhanced for Japanese usage through additional pre-training and instruction tuning.
-# Training Datasets
-### Pre-training dataset
-The model is continually pre-trained on Japanese data from the Qwen2-7b model while maintaining the model's English ability (80% Japanese, 20% English). We use about 120 billion tokens sampled from, Japanese and English Wikipedia articles, Japanese CC-100 Japanese C4, Japanese OSCAR ,The Pile, Webfined, Japanese websites, book data, mathematics and code,...
-### Instruction Tuning
-We generated about 1 million Instruction data from various methods such as generated data, translated data, and data manually tagged by humans.
 # Model Performance
 ### JGLUE tasks
@@ -128,3 +119,14 @@ generated_ids = [
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 print(response)
 ```

 **Moriyasu_Qwen2_JP_7B** is a is a large language model trained by Moriyasu. Based on [Qwen/Qwen2-7B](https://huggingface.co/Qwen/Qwen2-7B), it has been enhanced for Japanese usage through additional pre-training and instruction tuning.
 # Model Performance
 ### JGLUE tasks
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 print(response)
 ```
+# Training Datasets
+### Pre-training dataset
+The model is continually pre-trained on Japanese data from the Qwen2-7b model while maintaining the model's English ability (80% Japanese, 20% English). We use about 120 billion tokens sampled from, Japanese and English Wikipedia articles, Japanese CC-100 Japanese C4, Japanese OSCAR ,The Pile, Webfined, Japanese websites, book data, mathematics and code,...
+### Instruction Tuning
+We generated about 1 million Instruction data from various methods such as generated data, translated data, and data manually tagged by humans.
+# Contact: