End of training

Files changed (9) hide show

README.md CHANGED Viewed

@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # deepseek-coder-6.7b-instruct-finetuned-manimation
-This model is a fine-tuned version of [deepseek-ai/deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1691
 ## Model description
@@ -48,14 +48,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 0.98  | 35   | 1.1884          |
-| No log        | 1.99  | 71   | 1.1747          |
-| No log        | 2.95  | 105  | 1.1691          |
 ### Framework versions
 - Transformers 4.35.2
 - Pytorch 2.1.0+cu121
-- Datasets 2.16.1
-- Tokenizers 0.15.1

 # deepseek-coder-6.7b-instruct-finetuned-manimation
+This model is a fine-tuned version of [deepseek-ai/deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1297
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.98  | 35   | 1.1468          |
+| No log        | 1.99  | 71   | 1.1349          |
+| No log        | 2.95  | 105  | 1.1297          |
 ### Framework versions
 - Transformers 4.35.2
 - Pytorch 2.1.0+cu121
+- Datasets 2.17.0
+- Tokenizers 0.15.2

adapter_config.json CHANGED Viewed

@@ -19,8 +19,9 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "k_proj"
   ],
-  "task_type": "CAUSAL_LM"
 }

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "k_proj",
+    "q_proj"
   ],
+  "task_type": "CAUSAL_LM",
+  "use_rslora": false
 }

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f6661a3f98141ed889a1fa9254da1b68cb90bc466957e0913d510170ff8a87fa
 size 134235048

 version https://git-lfs.github.com/spec/v1
+oid sha256:233ab9cc1ac9b72bab96838c6bb287d8b5867eb608faa83d38c0e6a2a9c9506e
 size 134235048

runs/Feb17_02-09-24_1e0c7eb2a20a/events.out.tfevents.1708135770.1e0c7eb2a20a.5539.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:952a94fc1833870c8b10e7a6b7713495f4a382ed96a43a1100037199665499b8
+size 4184

runs/Feb17_02-12-21_1e0c7eb2a20a/events.out.tfevents.1708135942.1e0c7eb2a20a.7728.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e5ce71a4a45b6f5223388b567ce54706b39d43ff4d415316a5ced306929702f7
+size 4812

runs/Feb17_02-16-17_1e0c7eb2a20a/events.out.tfevents.1708136177.1e0c7eb2a20a.9226.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:fe7ddbe8eb783462f5900cf6caf340f468710fd8fa0264dcdd2a0f68895008be
+size 8280

runs/Feb17_02-19-31_1e0c7eb2a20a/events.out.tfevents.1708136372.1e0c7eb2a20a.10253.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5611aee1d6433dbdcf5652c31f7d4b8cf7f95391705be5f8a6b0e3234ec0ea02
+size 5958

runs/Feb17_02-19-31_1e0c7eb2a20a/events.out.tfevents.1708137548.1e0c7eb2a20a.10253.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d4bdcd3dbdea40601abdd4cd6a2feec67f57404c79411ce5c67ace5d629a2f73
+size 354

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3e8330024ee1e6c183e7fc2272bb1c5bc2bbfa2ce4010863fb8d6a356dc0b429
 size 4664

 version https://git-lfs.github.com/spec/v1
+oid sha256:3b3ee3ae9949d9e254e8070f9f9f729db02d673f1ba91151de64b60837e50739
 size 4664