b09501048 commited on
Commit
91cb5cb
·
verified ·
1 Parent(s): f75f12d

End of training

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md CHANGED
@@ -18,6 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
  # ADL_Gemma6
19
 
20
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on an unknown dataset.
 
 
21
 
22
  ## Model description
23
 
@@ -44,6 +46,17 @@ The following hyperparameters were used during training:
44
  - lr_scheduler_type: cosine_with_restarts
45
  - num_epochs: 5
46
 
 
 
 
 
 
 
 
 
 
 
 
47
  ### Framework versions
48
 
49
  - PEFT 0.13.2
 
18
  # ADL_Gemma6
19
 
20
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on an unknown dataset.
21
+ It achieves the following results on the evaluation set:
22
+ - Loss: 1.2804
23
 
24
  ## Model description
25
 
 
46
  - lr_scheduler_type: cosine_with_restarts
47
  - num_epochs: 5
48
 
49
+ ### Training results
50
+
51
+ | Training Loss | Epoch | Step | Validation Loss |
52
+ |:-------------:|:-----:|:----:|:---------------:|
53
+ | 1.35 | 1.0 | 1250 | 1.2978 |
54
+ | 1.3365 | 2.0 | 2500 | 1.2999 |
55
+ | 1.3023 | 3.0 | 3750 | 1.2839 |
56
+ | 1.3025 | 4.0 | 5000 | 1.2844 |
57
+ | 1.2701 | 5.0 | 6250 | 1.2804 |
58
+
59
+
60
  ### Framework versions
61
 
62
  - PEFT 0.13.2