DeepDream2045
/

ba808ad5-6897-453a-96e8-65d7ed43d9f8

Generated from Trainer

Model card Files Files and versions Community

DeepDream2045 commited on 14 days ago

Commit

38147ca

•

1 Parent(s): 0dfe125

End of training

Files changed (2) hide show

README.md +4 -4
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -105,7 +105,7 @@ xformers_attention: true
 This model is a fine-tuned version of [VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct](https://huggingface.co/VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0385
 ## Model description
@@ -142,9 +142,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.345         | 0.0396 | 1    | 1.3288          |
-| 1.1298        | 0.9901 | 25   | 1.0567          |
-| 1.1334        | 1.9802 | 50   | 1.0385          |
 ### Framework versions

 This model is a fine-tuned version of [VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct](https://huggingface.co/VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0388
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.3446        | 0.0396 | 1    | 1.3286          |
+| 1.1299        | 0.9901 | 25   | 1.0569          |
+| 1.1338        | 1.9802 | 50   | 1.0388          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fa42c00d4a5e58fa8763eece2c8f7580b2dccb0dc536ad427bbecc40495811e1
 size 335706186

 version https://git-lfs.github.com/spec/v1
+oid sha256:0cccfb581f012f53193e22c43d90c0e3311a69322adb12d475b701fd335a6b36
 size 335706186