kanishka
/

smolm-autoreg-bpe-seed_888

@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [models/smolm-autoreg-bpe-seed_888/config.json](https://huggingface.co/models/smolm-autoreg-bpe-seed_888/config.json) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.2878
-- Accuracy: 0.5417
 ## Model description
@@ -39,7 +39,7 @@ The following hyperparameters were used during training:
 - learning_rate: 0.003
 - train_batch_size: 64
 - eval_batch_size: 512
-- seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 24000
@@ -49,16 +49,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 5.8796        | 1.0   | 826  | 3.1083          | 0.4611   |
-| 2.802         | 2.0   | 1652 | 2.7455          | 0.4965   |
-| 2.6268        | 3.0   | 2478 | 2.5734          | 0.5116   |
-| 2.4165        | 4.0   | 3304 | 2.4667          | 0.5211   |
-| 2.2892        | 5.0   | 4130 | 2.3949          | 0.5288   |
-| 2.2315        | 6.0   | 4956 | 2.3446          | 0.5338   |
-| 2.1587        | 7.0   | 5782 | 2.3208          | 0.5374   |
-| 2.1253        | 8.0   | 6608 | 2.3044          | 0.5394   |
-| 2.0858        | 9.0   | 7434 | 2.2940          | 0.5404   |
-| 2.0556        | 10.0  | 8260 | 2.2878          | 0.5417   |
 ### Framework versions

 This model is a fine-tuned version of [models/smolm-autoreg-bpe-seed_888/config.json](https://huggingface.co/models/smolm-autoreg-bpe-seed_888/config.json) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.2831
+- Accuracy: 0.5426
 ## Model description
 - learning_rate: 0.003
 - train_batch_size: 64
 - eval_batch_size: 512
+- seed: 888
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 24000
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 5.8023        | 1.0   | 826  | 3.1092          | 0.4621   |
+| 2.7942        | 2.0   | 1652 | 2.7389          | 0.4975   |
+| 2.625         | 3.0   | 2478 | 2.5701          | 0.5110   |
+| 2.412         | 4.0   | 3304 | 2.4619          | 0.5223   |
+| 2.2885        | 5.0   | 4130 | 2.3940          | 0.5288   |
+| 2.2294        | 6.0   | 4956 | 2.3464          | 0.5342   |
+| 2.16          | 7.0   | 5782 | 2.3206          | 0.5372   |
+| 2.1272        | 8.0   | 6608 | 2.3046          | 0.5395   |
+| 2.0865        | 9.0   | 7434 | 2.2912          | 0.5405   |
+| 2.0577        | 10.0  | 8260 | 2.2831          | 0.5426   |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:debb02c658f2e0a53ae5965aa9a7b822d4ff985d152e679a328ce01acba3215f
 size 33843613

 version https://git-lfs.github.com/spec/v1
+oid sha256:b77e680a6919e6aae3574cb09673f0afb708a17917d31ad9e517159566658d4a
 size 33843613