• Using opencsg/csg-wukong-2b-chinese-fineweb-edu as base model, we fine-tune it on smoltalk-chinese for 2 epoch
  • learning rate = 3e-4 ; global batch size = 32 ; lr scheduler=cosine
Downloads last month
15
Safetensors
Model size
2.17B params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for opencsg/csg-wukong-2b-smoltalk-chinese

Finetuned
(1)
this model
Finetunes
2 models

Dataset used to train opencsg/csg-wukong-2b-smoltalk-chinese

Collection including opencsg/csg-wukong-2b-smoltalk-chinese