Phi3_French_Hypertokenizer_1HT / trainer_config.yaml
aindreias's picture
Upload 4 files
0b37518 verified
raw
history blame contribute delete
222 Bytes
cls: HF
base_tokenizer_path: microsoft/Phi-3-mini-128k-instruct
dataset:
path: allenai/c4
data_dir: fr
name: c4_fr
split: train
column: text
target_num_hyper_token: 1
batch_size: 1000
total_training_size: 100000