Jlonge4/rag-rel

Browse files

Files changed (5) hide show

README.md +16 -16
adapter_config.json +3 -3
adapter_model.safetensors +1 -1
runs/Oct13_18-11-38_48c91bda07e0/events.out.tfevents.1728843099.48c91bda07e0.7389.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,12 +16,12 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/josh-longenecker1-groundedai/grounded-ai-rag-relevance/runs/c4r1ici4)
 # grounded-ai-rag-3
 This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5609
 - Rouge1: 1.0
 - Rouge2: 0.0
 - Rougel: 1.0
@@ -59,20 +59,20 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
-| 0.8701        | 5.0   | 5    | 2.0856          | 0.0    | 0.0    | 0.0    | 0.0       |
-| 0.7972        | 10.0  | 10   | 1.8223          | 0.0    | 0.0    | 0.0    | 0.0       |
-| 0.7269        | 15.0  | 15   | 1.5054          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.6436        | 20.0  | 20   | 1.2845          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.5553        | 25.0  | 25   | 1.0245          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.43          | 30.0  | 30   | 0.6756          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.3252        | 35.0  | 35   | 0.4455          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.2419        | 40.0  | 40   | 0.3618          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.183         | 45.0  | 45   | 0.3606          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.0954        | 50.0  | 50   | 0.4148          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.0811        | 55.0  | 55   | 0.4816          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.0429        | 60.0  | 60   | 0.5086          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.0154        | 65.0  | 65   | 0.5523          | 1.0    | 0.0    | 1.0    | 1.0       |
-| 0.0059        | 70.0  | 70   | 0.5609          | 1.0    | 0.0    | 1.0    | 1.0       |
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/josh-longenecker1-groundedai/grounded-ai-rag-relevance/runs/bnvts1hx)
 # grounded-ai-rag-3
 This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4248
 - Rouge1: 1.0
 - Rouge2: 0.0
 - Rougel: 1.0
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
+| 1.0354        | 5.0   | 5    | 1.7161          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.9393        | 10.0  | 10   | 1.5598          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.8098        | 15.0  | 15   | 1.3913          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.6939        | 20.0  | 20   | 1.2470          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.5662        | 25.0  | 25   | 1.0859          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.4027        | 30.0  | 30   | 0.9390          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.2595        | 35.0  | 35   | 0.8088          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.159         | 40.0  | 40   | 0.8187          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.1225        | 45.0  | 45   | 0.9043          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.0808        | 50.0  | 50   | 0.9985          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.0334        | 55.0  | 55   | 1.1227          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.0121        | 60.0  | 60   | 1.2501          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.0052        | 65.0  | 65   | 1.3779          | 1.0    | 0.0    | 1.0    | 1.0       |
+| 0.0043        | 70.0  | 70   | 1.4248          | 1.0    | 0.0    | 1.0    | 1.0       |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,12 +20,12 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "gate_proj",
     "down_proj",
-    "o_proj",
-    "k_proj",
     "v_proj",
     "q_proj",
     "up_proj"
   ],
   "task_type": "CAUSAL_LM",

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "down_proj",
     "v_proj",
+    "gate_proj",
     "q_proj",
+    "k_proj",
+    "o_proj",
     "up_proj"
   ],
   "task_type": "CAUSAL_LM",

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:46d81f1704af5191a00035d1ebb307355be532e4957979a99541bae09b431c3f
 size 35668592

 version https://git-lfs.github.com/spec/v1
+oid sha256:eee14e572e234fef48bd05e6751732a7be5fcc36a58cb029f509a042420f7e66
 size 35668592

runs/Oct13_18-11-38_48c91bda07e0/events.out.tfevents.1728843099.48c91bda07e0.7389.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2c0c29c146a3cd6bbc02901bd47f237a3bd213d826cef4c8365083d795d8380b
+size 29481

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ed5b5532d552effab3b24b02b0fc55c160197e04e517a48beff7c066ba691726
 size 5432

 version https://git-lfs.github.com/spec/v1
+oid sha256:0aaa190a3cb68a15b368829d3dff9cf308f2721e76ebc2b27b8b0ec9bc845f20
 size 5432