pkarypis commited on
Commit
369edcb
·
1 Parent(s): 1699804

Model save

Browse files
README.md CHANGED
@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the generator dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.9543
23
 
24
  ## Model description
25
 
@@ -54,7 +54,7 @@ The following hyperparameters were used during training:
54
 
55
  | Training Loss | Epoch | Step | Validation Loss |
56
  |:-------------:|:-----:|:----:|:---------------:|
57
- | 1.1405 | 1.0 | 193 | 0.9543 |
58
 
59
 
60
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the generator dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.9542
23
 
24
  ## Model description
25
 
 
54
 
55
  | Training Loss | Epoch | Step | Validation Loss |
56
  |:-------------:|:-----:|:----:|:---------------:|
57
+ | 1.1405 | 1.0 | 193 | 0.9542 |
58
 
59
 
60
  ### Framework versions
all_results.json CHANGED
@@ -1,13 +1,13 @@
1
  {
2
  "epoch": 1.0,
3
- "eval_loss": 0.9543360471725464,
4
- "eval_runtime": 85.7357,
5
  "eval_samples": 23110,
6
- "eval_samples_per_second": 179.983,
7
- "eval_steps_per_second": 0.362,
8
- "train_loss": 1.0067575539949645,
9
- "train_runtime": 2177.9249,
10
  "train_samples": 155898,
11
- "train_samples_per_second": 45.206,
12
  "train_steps_per_second": 0.089
13
  }
 
1
  {
2
  "epoch": 1.0,
3
+ "eval_loss": 0.9541811347007751,
4
+ "eval_runtime": 85.9067,
5
  "eval_samples": 23110,
6
+ "eval_samples_per_second": 179.625,
7
+ "eval_steps_per_second": 0.361,
8
+ "train_loss": 1.006550097712581,
9
+ "train_runtime": 2174.1063,
10
  "train_samples": 155898,
11
+ "train_samples_per_second": 45.285,
12
  "train_steps_per_second": 0.089
13
  }
eval_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "epoch": 1.0,
3
- "eval_loss": 0.9543360471725464,
4
- "eval_runtime": 85.7357,
5
  "eval_samples": 23110,
6
- "eval_samples_per_second": 179.983,
7
- "eval_steps_per_second": 0.362
8
  }
 
1
  {
2
  "epoch": 1.0,
3
+ "eval_loss": 0.9541811347007751,
4
+ "eval_runtime": 85.9067,
5
  "eval_samples": 23110,
6
+ "eval_samples_per_second": 179.625,
7
+ "eval_steps_per_second": 0.361
8
  }
model-00001-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:350e4a318109d37074d40112c538df57b144e6f65383c189cf2dab8aaccf1fd4
3
  size 4943162336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4593d8d7034a7dcf2ddb88980cc8d8197643605e29f4fff2e8d3bf057151df89
3
  size 4943162336
model-00002-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5a8e67d16506823430ce78caacb5fb2448a1076fc663f6278c0215536b4b00d6
3
  size 4999819336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a87a203ff5688371b5948e44cf861d46e84e8d5f2b4e26c68acef59e7a94dd0e
3
  size 4999819336
model-00003-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e80cf9f2a771359d9c7222f890c7c79dc694f9943c94e88dc4d3ca344f22aba9
3
  size 4540516344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:969e722e1313c32065d00aad68022c57de010c9825cd6363d458510e3ed74907
3
  size 4540516344
runs/Dec29_19-23-52_aga39/events.out.tfevents.1703899471.aga39.210729.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9f652ca6b40b7f48eddaff070204c073af5b35680aa22d069a0b6ceb12b38864
3
+ size 5036
runs/Dec29_19-23-52_aga39/events.out.tfevents.1703901731.aga39.210729.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f9ad0755cdb2d6a9a6310a7bc0b88e6aad6c3dd213e442d97005bcf1a1f4b467
3
+ size 359
train_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "epoch": 1.0,
3
- "train_loss": 1.0067575539949645,
4
- "train_runtime": 2177.9249,
5
  "train_samples": 155898,
6
- "train_samples_per_second": 45.206,
7
  "train_steps_per_second": 0.089
8
  }
 
1
  {
2
  "epoch": 1.0,
3
+ "train_loss": 1.006550097712581,
4
+ "train_runtime": 2174.1063,
5
  "train_samples": 155898,
6
+ "train_samples_per_second": 45.285,
7
  "train_steps_per_second": 0.089
8
  }
trainer_state.json CHANGED
@@ -16,19 +16,19 @@
16
  },
17
  {
18
  "epoch": 1.0,
19
- "eval_loss": 0.9543360471725464,
20
- "eval_runtime": 87.6817,
21
- "eval_samples_per_second": 175.989,
22
- "eval_steps_per_second": 0.354,
23
  "step": 193
24
  },
25
  {
26
  "epoch": 1.0,
27
  "step": 193,
28
  "total_flos": 323282188369920.0,
29
- "train_loss": 1.0067575539949645,
30
- "train_runtime": 2177.9249,
31
- "train_samples_per_second": 45.206,
32
  "train_steps_per_second": 0.089
33
  }
34
  ],
 
16
  },
17
  {
18
  "epoch": 1.0,
19
+ "eval_loss": 0.9541811347007751,
20
+ "eval_runtime": 86.189,
21
+ "eval_samples_per_second": 179.037,
22
+ "eval_steps_per_second": 0.36,
23
  "step": 193
24
  },
25
  {
26
  "epoch": 1.0,
27
  "step": 193,
28
  "total_flos": 323282188369920.0,
29
+ "train_loss": 1.006550097712581,
30
+ "train_runtime": 2174.1063,
31
+ "train_samples_per_second": 45.285,
32
  "train_steps_per_second": 0.089
33
  }
34
  ],
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:17e491fd8a409290c3bd2bb4ca6078c439537e15854619dac636ef22bde10b2c
3
  size 5179
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d148946a22e9dea233ad66b2853eb57f064cfefbbea352308a432572a635b247
3
  size 5179