AI-4-Health/HPP-FINETUNED-Meta-Llama-3-8B-Instruct

Files changed (5) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4403
 ## Model description
@@ -53,21 +53,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.8026        | 0.4162 | 50   | 1.7101          |
-| 1.5158        | 0.8325 | 100  | 1.5503          |
-| 1.437         | 1.2487 | 150  | 1.4967          |
-| 1.3731        | 1.6649 | 200  | 1.4697          |
-| 1.4355        | 2.0812 | 250  | 1.4455          |
-| 1.3037        | 2.4974 | 300  | 1.4299          |
-| 1.4995        | 2.9136 | 350  | 1.4190          |
-| 1.5689        | 3.3299 | 400  | 1.4199          |
-| 1.5008        | 3.7461 | 450  | 1.4122          |
-| 1.357         | 4.1623 | 500  | 1.4182          |
-| 1.3323        | 4.5786 | 550  | 1.4174          |
-| 1.1464        | 4.9948 | 600  | 1.4071          |
-| 1.3099        | 5.4110 | 650  | 1.4232          |
-| 1.2026        | 5.8273 | 700  | 1.4183          |
-| 1.2336        | 6.2435 | 750  | 1.4403          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6596
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.7073        | 0.4162 | 50   | 1.5993          |
+| 1.4004        | 0.8325 | 100  | 1.4527          |
+| 1.3051        | 1.2487 | 150  | 1.4122          |
+| 1.2396        | 1.6649 | 200  | 1.3871          |
+| 1.2044        | 2.0812 | 250  | 1.3906          |
+| 1.1019        | 2.4974 | 300  | 1.3775          |
+| 1.2682        | 2.9136 | 350  | 1.3649          |
+| 1.1681        | 3.3299 | 400  | 1.4233          |
+| 1.1343        | 3.7461 | 450  | 1.4160          |
+| 0.7987        | 4.1623 | 500  | 1.4964          |
+| 0.8663        | 4.5786 | 550  | 1.5011          |
+| 0.7473        | 4.9948 | 600  | 1.4845          |
+| 0.7386        | 5.4110 | 650  | 1.5706          |
+| 0.61          | 5.8273 | 700  | 1.5695          |
+| 0.4689        | 6.2435 | 750  | 1.6596          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,8 +20,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "down_proj",
+    "up_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c3d1e12a70fd36173ced19ee8461d02a1be7de1c5b43fdf0c7ec3594fc00da61
-size 27280152

 version https://git-lfs.github.com/spec/v1
+oid sha256:4b76654e3600fee393722e7c47549703279d66c347d34487b9ed9141ea1f4ada
+size 75514264

runs/Jun14_06-28-00_jupiter/events.out.tfevents.1718317693.jupiter.387952.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1b46605443e75b2b0cfa162c6544f2ec81a76c275b0a4f6eb31943f7314085b7
+size 167788

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51c6f0012d8dbcf6c404cdd940bb60a5816ad0c5830ce0b969d6ae893eb88541
 size 4859

 version https://git-lfs.github.com/spec/v1
+oid sha256:b60f91acef8675e753e5ce52be173bf501a019901152f7c03ea1acd58546f651
 size 4859