jazzson commited on
Commit
7fd326f
1 Parent(s): 6994877

End of training

Browse files
Files changed (1) hide show
  1. README.md +6 -12
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 2.3204
20
 
21
  ## Model description
22
 
@@ -49,17 +49,11 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:------:|:----:|:---------------:|
52
- | 2.4837 | 0.3556 | 200 | 2.2732 |
53
- | 2.2305 | 0.7111 | 400 | 2.1807 |
54
- | 2.1064 | 1.0667 | 600 | 2.1585 |
55
- | 1.8998 | 1.4222 | 800 | 2.1417 |
56
- | 1.9278 | 1.7778 | 1000 | 2.1226 |
57
- | 1.7739 | 2.1333 | 1200 | 2.1940 |
58
- | 1.5948 | 2.4889 | 1400 | 2.1968 |
59
- | 1.6098 | 2.8444 | 1600 | 2.1953 |
60
- | 1.5028 | 3.2 | 1800 | 2.3265 |
61
- | 1.3777 | 3.5556 | 2000 | 2.3321 |
62
- | 1.3716 | 3.9111 | 2200 | 2.3204 |
63
 
64
 
65
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 2.5204
20
 
21
  ## Model description
22
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:------:|:----:|:---------------:|
52
+ | 2.4673 | 0.7105 | 200 | 2.3491 |
53
+ | 2.0993 | 1.4210 | 400 | 2.3077 |
54
+ | 1.8831 | 2.1314 | 600 | 2.3793 |
55
+ | 1.6486 | 2.8419 | 800 | 2.3754 |
56
+ | 1.4189 | 3.5524 | 1000 | 2.5204 |
 
 
 
 
 
 
57
 
58
 
59
  ### Framework versions