coinplusfire/coinplusfire_llm_2

Browse files

Files changed (3) hide show

README.md +22 -12
runs/Apr17_02-10-11_2cf8682266e1/events.out.tfevents.1713319812.2cf8682266e1.709.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3706
 ## Model description
@@ -44,23 +44,33 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.2985        | 0.99  | 51   | 1.8297          |
-| 1.6179        | 1.99  | 103  | 1.6461          |
-| 1.4812        | 3.0   | 155  | 1.5621          |
-| 1.3993        | 4.0   | 207  | 1.5078          |
-| 1.3646        | 4.99  | 258  | 1.4690          |
-| 1.2914        | 5.99  | 310  | 1.4270          |
-| 1.2542        | 7.0   | 362  | 1.4022          |
-| 1.2263        | 8.0   | 414  | 1.3832          |
-| 1.2279        | 8.99  | 465  | 1.3720          |
-| 1.1812        | 9.86  | 510  | 1.3706          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1450
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.2919        | 0.99  | 51   | 1.8319          |
+| 1.6082        | 1.99  | 103  | 1.6426          |
+| 1.4689        | 3.0   | 155  | 1.5522          |
+| 1.3821        | 4.0   | 207  | 1.4883          |
+| 1.3406        | 4.99  | 258  | 1.4421          |
+| 1.2592        | 5.99  | 310  | 1.3900          |
+| 1.2115        | 7.0   | 362  | 1.3508          |
+| 1.1705        | 8.0   | 414  | 1.3213          |
+| 1.1555        | 8.99  | 465  | 1.2913          |
+| 1.1031        | 9.99  | 517  | 1.2629          |
+| 1.0727        | 11.0  | 569  | 1.2418          |
+| 1.0481        | 12.0  | 621  | 1.2208          |
+| 1.0466        | 12.99 | 672  | 1.1971          |
+| 1.006         | 13.99 | 724  | 1.1864          |
+| 0.989         | 15.0  | 776  | 1.1732          |
+| 0.9719        | 16.0  | 828  | 1.1589          |
+| 0.979         | 16.99 | 879  | 1.1535          |
+| 0.9494        | 17.99 | 931  | 1.1469          |
+| 0.9401        | 19.0  | 983  | 1.1449          |
+| 0.9302        | 19.71 | 1020 | 1.1450          |
 ### Framework versions

runs/Apr17_02-10-11_2cf8682266e1/events.out.tfevents.1713319812.2cf8682266e1.709.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ff0e4e5deabc2788c0fc3e5e5c234c110cff73ec0dd9d510bb48e89afeea0e2f
+size 15207

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:449d1160346d35d092c056d4ce219dab2f49067dc87dce42020a43fa8d3bf98a
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:56c8bbbdaafc013ef0e09284808f7a446e06bfb5403b766b09d22327cfc345f2
 size 4920