pravsels commited on
Commit
d0b0e0c
1 Parent(s): c090fcd

End of training

Browse files
README.md CHANGED
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # deepseek-coder-6.7b-instruct-finetuned-manimation
15
 
16
- This model is a fine-tuned version of [deepseek-ai/deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.1691
19
 
20
  ## Model description
21
 
@@ -48,14 +48,14 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | No log | 0.98 | 35 | 1.1884 |
52
- | No log | 1.99 | 71 | 1.1747 |
53
- | No log | 2.95 | 105 | 1.1691 |
54
 
55
 
56
  ### Framework versions
57
 
58
  - Transformers 4.35.2
59
  - Pytorch 2.1.0+cu121
60
- - Datasets 2.16.1
61
- - Tokenizers 0.15.1
 
13
 
14
  # deepseek-coder-6.7b-instruct-finetuned-manimation
15
 
16
+ This model is a fine-tuned version of [deepseek-ai/deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.1297
19
 
20
  ## Model description
21
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | No log | 0.98 | 35 | 1.1468 |
52
+ | No log | 1.99 | 71 | 1.1349 |
53
+ | No log | 2.95 | 105 | 1.1297 |
54
 
55
 
56
  ### Framework versions
57
 
58
  - Transformers 4.35.2
59
  - Pytorch 2.1.0+cu121
60
+ - Datasets 2.17.0
61
+ - Tokenizers 0.15.2
adapter_config.json CHANGED
@@ -19,8 +19,9 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "q_proj",
23
- "k_proj"
24
  ],
25
- "task_type": "CAUSAL_LM"
 
26
  }
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
+ "k_proj",
23
+ "q_proj"
24
  ],
25
+ "task_type": "CAUSAL_LM",
26
+ "use_rslora": false
27
  }
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f6661a3f98141ed889a1fa9254da1b68cb90bc466957e0913d510170ff8a87fa
3
  size 134235048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:233ab9cc1ac9b72bab96838c6bb287d8b5867eb608faa83d38c0e6a2a9c9506e
3
  size 134235048
runs/Feb17_02-09-24_1e0c7eb2a20a/events.out.tfevents.1708135770.1e0c7eb2a20a.5539.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:952a94fc1833870c8b10e7a6b7713495f4a382ed96a43a1100037199665499b8
3
+ size 4184
runs/Feb17_02-12-21_1e0c7eb2a20a/events.out.tfevents.1708135942.1e0c7eb2a20a.7728.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e5ce71a4a45b6f5223388b567ce54706b39d43ff4d415316a5ced306929702f7
3
+ size 4812
runs/Feb17_02-16-17_1e0c7eb2a20a/events.out.tfevents.1708136177.1e0c7eb2a20a.9226.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fe7ddbe8eb783462f5900cf6caf340f468710fd8fa0264dcdd2a0f68895008be
3
+ size 8280
runs/Feb17_02-19-31_1e0c7eb2a20a/events.out.tfevents.1708136372.1e0c7eb2a20a.10253.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5611aee1d6433dbdcf5652c31f7d4b8cf7f95391705be5f8a6b0e3234ec0ea02
3
+ size 5958
runs/Feb17_02-19-31_1e0c7eb2a20a/events.out.tfevents.1708137548.1e0c7eb2a20a.10253.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4bdcd3dbdea40601abdd4cd6a2feec67f57404c79411ce5c67ace5d629a2f73
3
+ size 354
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3e8330024ee1e6c183e7fc2272bb1c5bc2bbfa2ce4010863fb8d6a356dc0b429
3
  size 4664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b3ee3ae9949d9e254e8070f9f9f729db02d673f1ba91151de64b60837e50739
3
  size 4664