tokyo-electron-device-ai
/

llama3-tedllm-8b-v0

Model card Files Files and versions Community

tokyo-electron-device-ai commited on 26 days ago

Commit

eece17e

•

1 Parent(s): 42b973c

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -60,10 +60,10 @@ We follow the approach described in [Bilingual Adaptation of Monolingual Foundat
 ### Training data
 This model was continuously trained on 173B tokens, with the training data consisting of 20% English and 80% Japanese. The raw Japanese data was filtered using scripts from [llm-jp-corpus repository](https://github.com/llm-jp/llm-jp-corpus). The following Japanese datasets were included into the training data mixture:
-    - [legacy-datasets/mc4](https://huggingface.co/datasets/legacy-datasets/mc4)
-    - [range3/cc100-ja](https://huggingface.co/datasets/range3/cc100-ja)
-    - [if001/oscar_2023_filtered](https://huggingface.co/datasets/if001/oscar_2023_filtered)
-    - [dumps.wikimedia.org](https://dumps.wikimedia.org/)
 * Note this released model was trained exclusively on open-source datasets. We also trained models using proprietary domain-specific data, but there are no plans to release those models.
 ### Hyper-parameters

 ### Training data
 This model was continuously trained on 173B tokens, with the training data consisting of 20% English and 80% Japanese. The raw Japanese data was filtered using scripts from [llm-jp-corpus repository](https://github.com/llm-jp/llm-jp-corpus). The following Japanese datasets were included into the training data mixture:
+* **[legacy-datasets/mc4](https://huggingface.co/datasets/legacy-datasets/mc4)**
+* **[range3/cc100-ja](https://huggingface.co/datasets/range3/cc100-ja)**
+* **[if001/oscar_2023_filtered](https://huggingface.co/datasets/if001/oscar_2023_filtered)**
+* **[dumps.wikimedia.org](https://dumps.wikimedia.org/)**
 * Note this released model was trained exclusively on open-source datasets. We also trained models using proprietary domain-specific data, but there are no plans to release those models.
 ### Hyper-parameters