yosefw commited on
Commit
0955830
·
verified ·
1 Parent(s): 6bca670

End of training

Browse files
README.md CHANGED
@@ -16,13 +16,13 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - eval_loss: 4.3855
20
- - eval_model_preparation_time: 0.0013
21
- - eval_runtime: 14.0825
22
- - eval_samples_per_second: 743.05
23
- - eval_steps_per_second: 5.823
24
- - epoch: 1.1292
25
- - step: 9521
26
 
27
  ## Model description
28
 
@@ -42,8 +42,8 @@ More information needed
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 0.0003
45
- - train_batch_size: 128
46
- - eval_batch_size: 128
47
  - seed: 42
48
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
49
  - lr_scheduler_type: cosine
 
16
 
17
  This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - eval_loss: 3.9086
20
+ - eval_model_preparation_time: 0.0016
21
+ - eval_runtime: 16.8971
22
+ - eval_samples_per_second: 619.276
23
+ - eval_steps_per_second: 5.563
24
+ - epoch: 13.5067
25
+ - step: 130151
26
 
27
  ## Model description
28
 
 
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 0.0003
45
+ - train_batch_size: 112
46
+ - eval_batch_size: 112
47
  - seed: 42
48
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
49
  - lr_scheduler_type: cosine
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4b0b4f47c6c19b7fcd15025ed45df083d19cafe45cd25f48b8908a63a3a92ad9
3
  size 826798720
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc9d06f25b7839d69de2d44fc4a6cfdb22d4028b359825727477246bb02836e4
3
  size 826798720
runs/Jan16_11-25-23_81d9b8b5d266/events.out.tfevents.1737026742.81d9b8b5d266.25767.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4b93b6f9de348c9e9de91668dd7539eaca0edf4f2b8418a940126c78bbe2a2a6
3
- size 65564
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:83833ff100378b6807f73d36f032cc5d04186389790200c22ab7ee812b0a0483
3
+ size 65907