End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -116,7 +116,7 @@ xformers_attention: null
 This model is a fine-tuned version of [Maykeye/TinyLLama-v0](https://huggingface.co/Maykeye/TinyLLama-v0) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 10.4767
 ## Model description
@@ -153,8 +153,8 @@ The following hyperparameters were used during training:
 |:-------------:|:------:|:----:|:---------------:|
 | 10.7803       | 0.0055 | 1    | 10.8762         |
 | 10.5919       | 0.0166 | 3    | 10.8762         |
-| 10.8178       | 0.0332 | 6    | 10.7948         |
-| 10.8378       | 0.0499 | 9    | 10.4767         |
 ### Framework versions

 This model is a fine-tuned version of [Maykeye/TinyLLama-v0](https://huggingface.co/Maykeye/TinyLLama-v0) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 10.4596
 ## Model description
 |:-------------:|:------:|:----:|:---------------:|
 | 10.7803       | 0.0055 | 1    | 10.8762         |
 | 10.5919       | 0.0166 | 3    | 10.8762         |
+| 10.8141       | 0.0332 | 6    | 10.7893         |
+| 10.8235       | 0.0499 | 9    | 10.4596         |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -21,12 +21,12 @@
   "revision": null,
   "target_modules": [
     "gate_proj",
-    "o_proj",
-    "v_proj",
-    "down_proj",
     "q_proj",
     "k_proj",
-    "up_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "revision": null,
   "target_modules": [
     "gate_proj",
+    "up_proj",
     "q_proj",
     "k_proj",
+    "o_proj",
+    "v_proj",
+    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d2aa8d36ef7ab6f0957f5faf3d1211df0a949fdac7e7e3b25d461b60f7a130a4
 size 793738

 version https://git-lfs.github.com/spec/v1
+oid sha256:77b841362d11686478355cf0ceeee738206902e0d668fb9873c2fa58b3f06178
 size 793738

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1178168b28c25756aa4281636b07615bd568deb155c023af2357b810c89b6bd8
 size 767856

 version https://git-lfs.github.com/spec/v1
+oid sha256:063267c323f826559daebddfb148a7f94fe2794e06e76b00be9adba90041236a
 size 767856

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dbdcc855d0f57473d8bd7ab766a229753c5c71be0fbef544d11bf2fb79cc11ff
 size 6712

 version https://git-lfs.github.com/spec/v1
+oid sha256:a9bb8d12ab7d3707b6887bbd0a3831cfac8994e15c07c849a86e5da758026626
 size 6712