sanchit-gandhi HF staff commited on
Commit
0354e7d
1 Parent(s): b703d56

Model save

Browse files
README.md ADDED
@@ -0,0 +1,75 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat
3
+ tags:
4
+ - trl
5
+ - dpo
6
+ - generated_from_trainer
7
+ model-index:
8
+ - name: sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat
16
+
17
+ This model is a fine-tuned version of [sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat](https://huggingface.co/sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 0.6412
20
+ - Rewards/chosen: -0.1044
21
+ - Rewards/rejected: -0.2494
22
+ - Rewards/accuracies: 0.6445
23
+ - Rewards/margins: 0.1450
24
+ - Logps/rejected: -429.4582
25
+ - Logps/chosen: -433.6304
26
+ - Logits/rejected: -3.2047
27
+ - Logits/chosen: -3.2544
28
+
29
+ ## Model description
30
+
31
+ More information needed
32
+
33
+ ## Intended uses & limitations
34
+
35
+ More information needed
36
+
37
+ ## Training and evaluation data
38
+
39
+ More information needed
40
+
41
+ ## Training procedure
42
+
43
+ ### Training hyperparameters
44
+
45
+ The following hyperparameters were used during training:
46
+ - learning_rate: 5e-07
47
+ - train_batch_size: 8
48
+ - eval_batch_size: 8
49
+ - seed: 42
50
+ - distributed_type: multi-GPU
51
+ - num_devices: 8
52
+ - gradient_accumulation_steps: 2
53
+ - total_train_batch_size: 128
54
+ - total_eval_batch_size: 64
55
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
56
+ - lr_scheduler_type: cosine
57
+ - lr_scheduler_warmup_ratio: 0.1
58
+ - num_epochs: 1
59
+
60
+ ### Training results
61
+
62
+ | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
63
+ |:-------------:|:------:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
64
+ | 0.6795 | 0.2092 | 100 | 0.6759 | 0.0017 | -0.0328 | 0.6289 | 0.0345 | -407.8037 | -423.0197 | -3.2565 | -3.3136 |
65
+ | 0.6584 | 0.4184 | 200 | 0.6534 | -0.0666 | -0.1617 | 0.6445 | 0.0951 | -420.6952 | -429.8561 | -3.2240 | -3.2768 |
66
+ | 0.6494 | 0.6276 | 300 | 0.6438 | -0.1077 | -0.2410 | 0.6211 | 0.1333 | -428.6237 | -433.9640 | -3.2050 | -3.2553 |
67
+ | 0.6428 | 0.8368 | 400 | 0.6415 | -0.1001 | -0.2437 | 0.6211 | 0.1436 | -428.8884 | -433.2000 | -3.2046 | -3.2543 |
68
+
69
+
70
+ ### Framework versions
71
+
72
+ - Transformers 4.40.1
73
+ - Pytorch 2.2.2+cu121
74
+ - Datasets 2.19.0
75
+ - Tokenizers 0.19.1
all_results.json ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 1.0,
3
+ "eval_logits/chosen": -3.25435733795166,
4
+ "eval_logits/rejected": -3.204742431640625,
5
+ "eval_logps/chosen": -433.63043212890625,
6
+ "eval_logps/rejected": -429.45819091796875,
7
+ "eval_loss": 0.6412035226821899,
8
+ "eval_rewards/accuracies": 0.64453125,
9
+ "eval_rewards/chosen": -0.10438449680805206,
10
+ "eval_rewards/margins": 0.14497718214988708,
11
+ "eval_rewards/rejected": -0.24936166405677795,
12
+ "eval_runtime": 8.2917,
13
+ "eval_samples": 2000,
14
+ "eval_samples_per_second": 241.204,
15
+ "eval_steps_per_second": 3.859,
16
+ "total_flos": 0.0,
17
+ "train_loss": 0.6571792745689967,
18
+ "train_runtime": 784.6622,
19
+ "train_samples": 61135,
20
+ "train_samples_per_second": 77.913,
21
+ "train_steps_per_second": 0.609
22
+ }
eval_results.json ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 1.0,
3
+ "eval_logits/chosen": -3.25435733795166,
4
+ "eval_logits/rejected": -3.204742431640625,
5
+ "eval_logps/chosen": -433.63043212890625,
6
+ "eval_logps/rejected": -429.45819091796875,
7
+ "eval_loss": 0.6412035226821899,
8
+ "eval_rewards/accuracies": 0.64453125,
9
+ "eval_rewards/chosen": -0.10438449680805206,
10
+ "eval_rewards/margins": 0.14497718214988708,
11
+ "eval_rewards/rejected": -0.24936166405677795,
12
+ "eval_runtime": 8.2917,
13
+ "eval_samples": 2000,
14
+ "eval_samples_per_second": 241.204,
15
+ "eval_steps_per_second": 3.859
16
+ }
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "bos_token_id": 1,
4
+ "eos_token_id": 2,
5
+ "transformers_version": "4.40.1"
6
+ }
model-00001-of-00002.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6d1c7b079c6fe16b8306f4ad14c51991d2f4687c5913f4c69aaca2a0348cc737
3
+ size 4987196936
model-00002-of-00002.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9629c04e2da5a942f500f775442430cc7cbc755c6e95af6b26a3c7de9af646e9
3
+ size 1296089984
model.safetensors.index.json ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "metadata": {
3
+ "total_size": 6283280384
4
+ },
5
+ "weight_map": {
6
+ "lm_head.weight": "model-00002-of-00002.safetensors",
7
+ "model.embed_tokens.weight": "model-00001-of-00002.safetensors",
8
+ "model.layers.0.input_layernorm.weight": "model-00001-of-00002.safetensors",
9
+ "model.layers.0.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
10
+ "model.layers.0.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
11
+ "model.layers.0.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
12
+ "model.layers.0.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
13
+ "model.layers.0.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
14
+ "model.layers.0.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
15
+ "model.layers.0.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
16
+ "model.layers.0.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
17
+ "model.layers.1.input_layernorm.weight": "model-00001-of-00002.safetensors",
18
+ "model.layers.1.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
19
+ "model.layers.1.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
20
+ "model.layers.1.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
21
+ "model.layers.1.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
22
+ "model.layers.1.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
23
+ "model.layers.1.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
24
+ "model.layers.1.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
25
+ "model.layers.1.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
26
+ "model.layers.2.input_layernorm.weight": "model-00001-of-00002.safetensors",
27
+ "model.layers.2.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
28
+ "model.layers.2.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
29
+ "model.layers.2.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
30
+ "model.layers.2.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
31
+ "model.layers.2.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
32
+ "model.layers.2.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
33
+ "model.layers.2.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
34
+ "model.layers.2.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
35
+ "model.layers.3.input_layernorm.weight": "model-00001-of-00002.safetensors",
36
+ "model.layers.3.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
37
+ "model.layers.3.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
38
+ "model.layers.3.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
39
+ "model.layers.3.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
40
+ "model.layers.3.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
41
+ "model.layers.3.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
42
+ "model.layers.3.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
43
+ "model.layers.3.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
44
+ "model.layers.4.input_layernorm.weight": "model-00001-of-00002.safetensors",
45
+ "model.layers.4.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
46
+ "model.layers.4.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
47
+ "model.layers.4.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
48
+ "model.layers.4.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
49
+ "model.layers.4.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
50
+ "model.layers.4.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
51
+ "model.layers.4.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
52
+ "model.layers.4.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
53
+ "model.layers.5.input_layernorm.weight": "model-00002-of-00002.safetensors",
54
+ "model.layers.5.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
55
+ "model.layers.5.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
56
+ "model.layers.5.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
57
+ "model.layers.5.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
58
+ "model.layers.5.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
59
+ "model.layers.5.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
60
+ "model.layers.5.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
61
+ "model.layers.5.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
62
+ "model.norm.weight": "model-00002-of-00002.safetensors"
63
+ }
64
+ }
runs/Apr26_15-48-43_ip-26-0-160-225/events.out.tfevents.1714147034.ip-26-0-160-225.711598.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3a1fc27bf5185d63cd4e2dfcd363e9724966b1098397dd6e5f8a59a1a161d08c
3
- size 19346
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f5835ae8637ba91ca29d3afbfe85b2fdbacaa6b26c7ca7683878a0ac5a110e8
3
+ size 21764
runs/Apr26_15-48-43_ip-26-0-160-225/events.out.tfevents.1714147827.ip-26-0-160-225.711598.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b158da38885dfc714a392467a3ae1215e3432959ad5762c3ae7cf3464549eeac
3
+ size 828
train_results.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 1.0,
3
+ "total_flos": 0.0,
4
+ "train_loss": 0.6571792745689967,
5
+ "train_runtime": 784.6622,
6
+ "train_samples": 61135,
7
+ "train_samples_per_second": 77.913,
8
+ "train_steps_per_second": 0.609
9
+ }
trainer_state.json ADDED
@@ -0,0 +1,394 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_metric": null,
3
+ "best_model_checkpoint": null,
4
+ "epoch": 1.0,
5
+ "eval_steps": 100,
6
+ "global_step": 478,
7
+ "is_hyper_param_search": false,
8
+ "is_local_process_zero": true,
9
+ "is_world_process_zero": true,
10
+ "log_history": [
11
+ {
12
+ "epoch": 0.0020920502092050207,
13
+ "grad_norm": 9.710838317871094,
14
+ "learning_rate": 1.0416666666666666e-08,
15
+ "logits/chosen": -3.161454916000366,
16
+ "logits/rejected": -3.0680501461029053,
17
+ "logps/chosen": -437.20782470703125,
18
+ "logps/rejected": -343.8380432128906,
19
+ "loss": 0.6931,
20
+ "rewards/accuracies": 0.0,
21
+ "rewards/chosen": 0.0,
22
+ "rewards/margins": 0.0,
23
+ "rewards/rejected": 0.0,
24
+ "step": 1
25
+ },
26
+ {
27
+ "epoch": 0.05230125523012552,
28
+ "grad_norm": 8.52570629119873,
29
+ "learning_rate": 2.604166666666667e-07,
30
+ "logits/chosen": -3.30265212059021,
31
+ "logits/rejected": -3.2195777893066406,
32
+ "logps/chosen": -432.7400207519531,
33
+ "logps/rejected": -391.2707824707031,
34
+ "loss": 0.6931,
35
+ "rewards/accuracies": 0.5234375,
36
+ "rewards/chosen": 0.0006945470231585205,
37
+ "rewards/margins": 0.0007254942320287228,
38
+ "rewards/rejected": -3.094731437158771e-05,
39
+ "step": 25
40
+ },
41
+ {
42
+ "epoch": 0.10460251046025104,
43
+ "grad_norm": 9.973247528076172,
44
+ "learning_rate": 4.999733114418725e-07,
45
+ "logits/chosen": -3.299149751663208,
46
+ "logits/rejected": -3.2438504695892334,
47
+ "logps/chosen": -446.8968811035156,
48
+ "logps/rejected": -404.587158203125,
49
+ "loss": 0.6914,
50
+ "rewards/accuracies": 0.5799999833106995,
51
+ "rewards/chosen": 0.005268932785838842,
52
+ "rewards/margins": 0.002896952675655484,
53
+ "rewards/rejected": 0.002371980343014002,
54
+ "step": 50
55
+ },
56
+ {
57
+ "epoch": 0.15690376569037656,
58
+ "grad_norm": 9.357086181640625,
59
+ "learning_rate": 4.951516761176343e-07,
60
+ "logits/chosen": -3.302896738052368,
61
+ "logits/rejected": -3.225714683532715,
62
+ "logps/chosen": -480.6836853027344,
63
+ "logps/rejected": -436.4853515625,
64
+ "loss": 0.6871,
65
+ "rewards/accuracies": 0.5975000262260437,
66
+ "rewards/chosen": 0.013074235059320927,
67
+ "rewards/margins": 0.012715624645352364,
68
+ "rewards/rejected": 0.0003586093371268362,
69
+ "step": 75
70
+ },
71
+ {
72
+ "epoch": 0.20920502092050208,
73
+ "grad_norm": 8.75236988067627,
74
+ "learning_rate": 4.821741763807186e-07,
75
+ "logits/chosen": -3.2598507404327393,
76
+ "logits/rejected": -3.2092175483703613,
77
+ "logps/chosen": -426.1629638671875,
78
+ "logps/rejected": -389.50421142578125,
79
+ "loss": 0.6795,
80
+ "rewards/accuracies": 0.612500011920929,
81
+ "rewards/chosen": 0.003723819274455309,
82
+ "rewards/margins": 0.02804265171289444,
83
+ "rewards/rejected": -0.02431883104145527,
84
+ "step": 100
85
+ },
86
+ {
87
+ "epoch": 0.20920502092050208,
88
+ "eval_logits/chosen": -3.313567638397217,
89
+ "eval_logits/rejected": -3.2565112113952637,
90
+ "eval_logps/chosen": -423.0196533203125,
91
+ "eval_logps/rejected": -407.8036804199219,
92
+ "eval_loss": 0.6759119629859924,
93
+ "eval_rewards/accuracies": 0.62890625,
94
+ "eval_rewards/chosen": 0.0017230990342795849,
95
+ "eval_rewards/margins": 0.0345395989716053,
96
+ "eval_rewards/rejected": -0.03281649947166443,
97
+ "eval_runtime": 8.2733,
98
+ "eval_samples_per_second": 241.742,
99
+ "eval_steps_per_second": 3.868,
100
+ "step": 100
101
+ },
102
+ {
103
+ "epoch": 0.2615062761506276,
104
+ "grad_norm": 8.889654159545898,
105
+ "learning_rate": 4.614725560802639e-07,
106
+ "logits/chosen": -3.2867064476013184,
107
+ "logits/rejected": -3.2023117542266846,
108
+ "logps/chosen": -430.924560546875,
109
+ "logps/rejected": -382.1640319824219,
110
+ "loss": 0.6731,
111
+ "rewards/accuracies": 0.6474999785423279,
112
+ "rewards/chosen": -0.01004675030708313,
113
+ "rewards/margins": 0.04363078624010086,
114
+ "rewards/rejected": -0.05367753654718399,
115
+ "step": 125
116
+ },
117
+ {
118
+ "epoch": 0.3138075313807531,
119
+ "grad_norm": 9.882610321044922,
120
+ "learning_rate": 4.337355301007335e-07,
121
+ "logits/chosen": -3.222916841506958,
122
+ "logits/rejected": -3.1899585723876953,
123
+ "logps/chosen": -443.3583679199219,
124
+ "logps/rejected": -414.344970703125,
125
+ "loss": 0.667,
126
+ "rewards/accuracies": 0.637499988079071,
127
+ "rewards/chosen": -0.017106110230088234,
128
+ "rewards/margins": 0.06052257865667343,
129
+ "rewards/rejected": -0.07762870192527771,
130
+ "step": 150
131
+ },
132
+ {
133
+ "epoch": 0.36610878661087864,
134
+ "grad_norm": 9.968969345092773,
135
+ "learning_rate": 3.9988587174999306e-07,
136
+ "logits/chosen": -3.2201168537139893,
137
+ "logits/rejected": -3.131910800933838,
138
+ "logps/chosen": -477.7035217285156,
139
+ "logps/rejected": -405.7104797363281,
140
+ "loss": 0.6574,
141
+ "rewards/accuracies": 0.6399999856948853,
142
+ "rewards/chosen": -0.03587143123149872,
143
+ "rewards/margins": 0.09045815467834473,
144
+ "rewards/rejected": -0.12632958590984344,
145
+ "step": 175
146
+ },
147
+ {
148
+ "epoch": 0.41841004184100417,
149
+ "grad_norm": 9.230700492858887,
150
+ "learning_rate": 3.610497133404795e-07,
151
+ "logits/chosen": -3.23995304107666,
152
+ "logits/rejected": -3.1550867557525635,
153
+ "logps/chosen": -431.6617126464844,
154
+ "logps/rejected": -396.62774658203125,
155
+ "loss": 0.6584,
156
+ "rewards/accuracies": 0.6349999904632568,
157
+ "rewards/chosen": -0.06991340965032578,
158
+ "rewards/margins": 0.08443903923034668,
159
+ "rewards/rejected": -0.15435244143009186,
160
+ "step": 200
161
+ },
162
+ {
163
+ "epoch": 0.41841004184100417,
164
+ "eval_logits/chosen": -3.2767982482910156,
165
+ "eval_logits/rejected": -3.2240023612976074,
166
+ "eval_logps/chosen": -429.85614013671875,
167
+ "eval_logps/rejected": -420.6952209472656,
168
+ "eval_loss": 0.6533502340316772,
169
+ "eval_rewards/accuracies": 0.64453125,
170
+ "eval_rewards/chosen": -0.06664139777421951,
171
+ "eval_rewards/margins": 0.09509073942899704,
172
+ "eval_rewards/rejected": -0.16173213720321655,
173
+ "eval_runtime": 8.2763,
174
+ "eval_samples_per_second": 241.653,
175
+ "eval_steps_per_second": 3.866,
176
+ "step": 200
177
+ },
178
+ {
179
+ "epoch": 0.4707112970711297,
180
+ "grad_norm": 10.79430103302002,
181
+ "learning_rate": 3.185190812915646e-07,
182
+ "logits/chosen": -3.1671783924102783,
183
+ "logits/rejected": -3.118861436843872,
184
+ "logps/chosen": -446.6968994140625,
185
+ "logps/rejected": -410.1864013671875,
186
+ "loss": 0.6518,
187
+ "rewards/accuracies": 0.6449999809265137,
188
+ "rewards/chosen": -0.08074235171079636,
189
+ "rewards/margins": 0.10100732743740082,
190
+ "rewards/rejected": -0.18174967169761658,
191
+ "step": 225
192
+ },
193
+ {
194
+ "epoch": 0.5230125523012552,
195
+ "grad_norm": 10.086767196655273,
196
+ "learning_rate": 2.7370891215954565e-07,
197
+ "logits/chosen": -3.1980080604553223,
198
+ "logits/rejected": -3.1623446941375732,
199
+ "logps/chosen": -440.4261474609375,
200
+ "logps/rejected": -439.9627685546875,
201
+ "loss": 0.6476,
202
+ "rewards/accuracies": 0.637499988079071,
203
+ "rewards/chosen": -0.07635506987571716,
204
+ "rewards/margins": 0.11709018051624298,
205
+ "rewards/rejected": -0.19344525039196014,
206
+ "step": 250
207
+ },
208
+ {
209
+ "epoch": 0.5753138075313807,
210
+ "grad_norm": 9.709197044372559,
211
+ "learning_rate": 2.2810997961375938e-07,
212
+ "logits/chosen": -3.216128349304199,
213
+ "logits/rejected": -3.1448960304260254,
214
+ "logps/chosen": -425.5062561035156,
215
+ "logps/rejected": -391.5084228515625,
216
+ "loss": 0.6461,
217
+ "rewards/accuracies": 0.6800000071525574,
218
+ "rewards/chosen": -0.072402223944664,
219
+ "rewards/margins": 0.1306913197040558,
220
+ "rewards/rejected": -0.2030935436487198,
221
+ "step": 275
222
+ },
223
+ {
224
+ "epoch": 0.6276150627615062,
225
+ "grad_norm": 10.980072975158691,
226
+ "learning_rate": 1.8323929841460178e-07,
227
+ "logits/chosen": -3.2006824016571045,
228
+ "logits/rejected": -3.1383109092712402,
229
+ "logps/chosen": -443.0799865722656,
230
+ "logps/rejected": -436.2774658203125,
231
+ "loss": 0.6494,
232
+ "rewards/accuracies": 0.625,
233
+ "rewards/chosen": -0.1245601624250412,
234
+ "rewards/margins": 0.10581608861684799,
235
+ "rewards/rejected": -0.2303762584924698,
236
+ "step": 300
237
+ },
238
+ {
239
+ "epoch": 0.6276150627615062,
240
+ "eval_logits/chosen": -3.2553329467773438,
241
+ "eval_logits/rejected": -3.2049574851989746,
242
+ "eval_logps/chosen": -433.9639892578125,
243
+ "eval_logps/rejected": -428.6236572265625,
244
+ "eval_loss": 0.6438009142875671,
245
+ "eval_rewards/accuracies": 0.62109375,
246
+ "eval_rewards/chosen": -0.10771973431110382,
247
+ "eval_rewards/margins": 0.13329659402370453,
248
+ "eval_rewards/rejected": -0.24101632833480835,
249
+ "eval_runtime": 8.3559,
250
+ "eval_samples_per_second": 239.351,
251
+ "eval_steps_per_second": 3.83,
252
+ "step": 300
253
+ },
254
+ {
255
+ "epoch": 0.6799163179916318,
256
+ "grad_norm": 9.95361614227295,
257
+ "learning_rate": 1.4058965538597032e-07,
258
+ "logits/chosen": -3.2326276302337646,
259
+ "logits/rejected": -3.198971748352051,
260
+ "logps/chosen": -443.37371826171875,
261
+ "logps/rejected": -433.8561706542969,
262
+ "loss": 0.6392,
263
+ "rewards/accuracies": 0.6399999856948853,
264
+ "rewards/chosen": -0.11981040984392166,
265
+ "rewards/margins": 0.12150833755731583,
266
+ "rewards/rejected": -0.2413187474012375,
267
+ "step": 325
268
+ },
269
+ {
270
+ "epoch": 0.7322175732217573,
271
+ "grad_norm": 10.580938339233398,
272
+ "learning_rate": 1.0157994641835734e-07,
273
+ "logits/chosen": -3.1840951442718506,
274
+ "logits/rejected": -3.0981223583221436,
275
+ "logps/chosen": -430.7939453125,
276
+ "logps/rejected": -403.5060119628906,
277
+ "loss": 0.6447,
278
+ "rewards/accuracies": 0.6225000023841858,
279
+ "rewards/chosen": -0.15974169969558716,
280
+ "rewards/margins": 0.09658970683813095,
281
+ "rewards/rejected": -0.2563314139842987,
282
+ "step": 350
283
+ },
284
+ {
285
+ "epoch": 0.7845188284518828,
286
+ "grad_norm": 8.326279640197754,
287
+ "learning_rate": 6.75079717232744e-08,
288
+ "logits/chosen": -3.2011187076568604,
289
+ "logits/rejected": -3.0969460010528564,
290
+ "logps/chosen": -465.22344970703125,
291
+ "logps/rejected": -392.5793762207031,
292
+ "loss": 0.6411,
293
+ "rewards/accuracies": 0.6549999713897705,
294
+ "rewards/chosen": -0.11683624982833862,
295
+ "rewards/margins": 0.14286838471889496,
296
+ "rewards/rejected": -0.2597046196460724,
297
+ "step": 375
298
+ },
299
+ {
300
+ "epoch": 0.8368200836820083,
301
+ "grad_norm": 8.851872444152832,
302
+ "learning_rate": 3.9507259776993954e-08,
303
+ "logits/chosen": -3.221069097518921,
304
+ "logits/rejected": -3.1744155883789062,
305
+ "logps/chosen": -449.8016052246094,
306
+ "logps/rejected": -409.0178527832031,
307
+ "loss": 0.6428,
308
+ "rewards/accuracies": 0.6524999737739563,
309
+ "rewards/chosen": -0.08884063363075256,
310
+ "rewards/margins": 0.13940051198005676,
311
+ "rewards/rejected": -0.22824116051197052,
312
+ "step": 400
313
+ },
314
+ {
315
+ "epoch": 0.8368200836820083,
316
+ "eval_logits/chosen": -3.254263401031494,
317
+ "eval_logits/rejected": -3.204622507095337,
318
+ "eval_logps/chosen": -433.20001220703125,
319
+ "eval_logps/rejected": -428.88836669921875,
320
+ "eval_loss": 0.6415477395057678,
321
+ "eval_rewards/accuracies": 0.62109375,
322
+ "eval_rewards/chosen": -0.10007989406585693,
323
+ "eval_rewards/margins": 0.14358317852020264,
324
+ "eval_rewards/rejected": -0.24366310238838196,
325
+ "eval_runtime": 8.7287,
326
+ "eval_samples_per_second": 229.13,
327
+ "eval_steps_per_second": 3.666,
328
+ "step": 400
329
+ },
330
+ {
331
+ "epoch": 0.8891213389121339,
332
+ "grad_norm": 10.138018608093262,
333
+ "learning_rate": 1.850935636255496e-08,
334
+ "logits/chosen": -3.1941397190093994,
335
+ "logits/rejected": -3.1335413455963135,
336
+ "logps/chosen": -463.3765869140625,
337
+ "logps/rejected": -418.0171203613281,
338
+ "loss": 0.6393,
339
+ "rewards/accuracies": 0.6399999856948853,
340
+ "rewards/chosen": -0.11999661475419998,
341
+ "rewards/margins": 0.13007262349128723,
342
+ "rewards/rejected": -0.2500692307949066,
343
+ "step": 425
344
+ },
345
+ {
346
+ "epoch": 0.9414225941422594,
347
+ "grad_norm": 9.687002182006836,
348
+ "learning_rate": 5.212833302556258e-09,
349
+ "logits/chosen": -3.202920913696289,
350
+ "logits/rejected": -3.14504337310791,
351
+ "logps/chosen": -442.259765625,
352
+ "logps/rejected": -406.7275085449219,
353
+ "loss": 0.6419,
354
+ "rewards/accuracies": 0.6050000190734863,
355
+ "rewards/chosen": -0.13094988465309143,
356
+ "rewards/margins": 0.1017264574766159,
357
+ "rewards/rejected": -0.23267632722854614,
358
+ "step": 450
359
+ },
360
+ {
361
+ "epoch": 0.9937238493723849,
362
+ "grad_norm": 8.862220764160156,
363
+ "learning_rate": 6.004792024680294e-11,
364
+ "logits/chosen": -3.1527392864227295,
365
+ "logits/rejected": -3.1062843799591064,
366
+ "logps/chosen": -445.8639831542969,
367
+ "logps/rejected": -418.4268493652344,
368
+ "loss": 0.6377,
369
+ "rewards/accuracies": 0.6575000286102295,
370
+ "rewards/chosen": -0.11145105212926865,
371
+ "rewards/margins": 0.15232053399085999,
372
+ "rewards/rejected": -0.26377159357070923,
373
+ "step": 475
374
+ },
375
+ {
376
+ "epoch": 1.0,
377
+ "step": 478,
378
+ "total_flos": 0.0,
379
+ "train_loss": 0.6571792745689967,
380
+ "train_runtime": 784.6622,
381
+ "train_samples_per_second": 77.913,
382
+ "train_steps_per_second": 0.609
383
+ }
384
+ ],
385
+ "logging_steps": 25,
386
+ "max_steps": 478,
387
+ "num_input_tokens_seen": 0,
388
+ "num_train_epochs": 1,
389
+ "save_steps": 100,
390
+ "total_flos": 0.0,
391
+ "train_batch_size": 8,
392
+ "trial_name": null,
393
+ "trial_params": null
394
+ }
wandb/debug-internal.log CHANGED
@@ -747,3 +747,208 @@
747
  2024-04-26 16:08:23,064 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
748
  2024-04-26 16:08:24,187 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
749
  2024-04-26 16:08:29,188 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
747
  2024-04-26 16:08:23,064 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
748
  2024-04-26 16:08:24,187 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
749
  2024-04-26 16:08:29,188 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
750
+ 2024-04-26 16:08:33,026 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
751
+ 2024-04-26 16:08:34,871 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
752
+ 2024-04-26 16:08:37,030 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
753
+ 2024-04-26 16:08:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
754
+ 2024-04-26 16:08:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
755
+ 2024-04-26 16:08:37,058 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
756
+ 2024-04-26 16:08:39,032 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
757
+ 2024-04-26 16:08:39,996 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
758
+ 2024-04-26 16:08:41,034 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
759
+ 2024-04-26 16:08:43,038 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
760
+ 2024-04-26 16:08:45,040 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
761
+ 2024-04-26 16:08:45,420 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
762
+ 2024-04-26 16:08:46,337 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
763
+ 2024-04-26 16:08:47,042 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
764
+ 2024-04-26 16:08:49,044 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
765
+ 2024-04-26 16:08:50,801 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
766
+ 2024-04-26 16:08:51,045 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
767
+ 2024-04-26 16:08:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
768
+ 2024-04-26 16:08:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
769
+ 2024-04-26 16:08:52,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
770
+ 2024-04-26 16:08:53,051 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
771
+ 2024-04-26 16:08:55,053 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
772
+ 2024-04-26 16:08:56,161 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
773
+ 2024-04-26 16:08:57,055 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
774
+ 2024-04-26 16:08:59,057 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
775
+ 2024-04-26 16:09:01,059 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
776
+ 2024-04-26 16:09:01,568 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
777
+ 2024-04-26 16:09:03,060 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
778
+ 2024-04-26 16:09:05,062 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
779
+ 2024-04-26 16:09:06,912 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
780
+ 2024-04-26 16:09:07,057 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
781
+ 2024-04-26 16:09:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
782
+ 2024-04-26 16:09:07,058 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
783
+ 2024-04-26 16:09:07,064 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
784
+ 2024-04-26 16:09:07,332 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
785
+ 2024-04-26 16:09:07,334 DEBUG SenderThread:713738 [sender.py:send():379] send: history
786
+ 2024-04-26 16:09:07,334 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
787
+ 2024-04-26 16:09:07,335 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
788
+ 2024-04-26 16:09:08,066 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
789
+ 2024-04-26 16:09:09,067 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
790
+ 2024-04-26 16:09:11,069 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
791
+ 2024-04-26 16:09:12,304 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
792
+ 2024-04-26 16:09:13,071 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
793
+ 2024-04-26 16:09:15,073 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
794
+ 2024-04-26 16:09:16,339 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
795
+ 2024-04-26 16:09:17,076 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
796
+ 2024-04-26 16:09:17,760 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
797
+ 2024-04-26 16:09:19,077 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
798
+ 2024-04-26 16:09:21,079 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
799
+ 2024-04-26 16:09:22,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
800
+ 2024-04-26 16:09:22,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
801
+ 2024-04-26 16:09:22,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
802
+ 2024-04-26 16:09:23,081 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
803
+ 2024-04-26 16:09:23,203 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
804
+ 2024-04-26 16:09:25,083 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
805
+ 2024-04-26 16:09:27,085 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
806
+ 2024-04-26 16:09:28,623 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
807
+ 2024-04-26 16:09:29,087 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
808
+ 2024-04-26 16:09:31,089 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
809
+ 2024-04-26 16:09:33,091 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
810
+ 2024-04-26 16:09:34,019 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
811
+ 2024-04-26 16:09:35,093 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
812
+ 2024-04-26 16:09:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
813
+ 2024-04-26 16:09:37,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
814
+ 2024-04-26 16:09:37,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
815
+ 2024-04-26 16:09:37,095 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
816
+ 2024-04-26 16:09:39,097 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
817
+ 2024-04-26 16:09:39,390 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
818
+ 2024-04-26 16:09:41,099 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
819
+ 2024-04-26 16:09:41,173 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
820
+ 2024-04-26 16:09:41,175 DEBUG SenderThread:713738 [sender.py:send():379] send: history
821
+ 2024-04-26 16:09:41,176 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
822
+ 2024-04-26 16:09:41,177 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
823
+ 2024-04-26 16:09:42,101 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
824
+ 2024-04-26 16:09:43,102 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
825
+ 2024-04-26 16:09:44,810 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
826
+ 2024-04-26 16:09:45,104 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
827
+ 2024-04-26 16:09:46,340 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
828
+ 2024-04-26 16:09:47,106 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
829
+ 2024-04-26 16:09:49,109 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
830
+ 2024-04-26 16:09:50,192 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
831
+ 2024-04-26 16:09:51,111 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
832
+ 2024-04-26 16:09:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
833
+ 2024-04-26 16:09:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
834
+ 2024-04-26 16:09:52,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
835
+ 2024-04-26 16:09:53,112 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
836
+ 2024-04-26 16:09:55,115 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
837
+ 2024-04-26 16:09:55,629 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
838
+ 2024-04-26 16:09:57,117 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
839
+ 2024-04-26 16:09:59,119 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
840
+ 2024-04-26 16:10:01,005 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
841
+ 2024-04-26 16:10:01,121 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
842
+ 2024-04-26 16:10:03,123 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
843
+ 2024-04-26 16:10:05,125 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
844
+ 2024-04-26 16:10:06,489 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
845
+ 2024-04-26 16:10:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
846
+ 2024-04-26 16:10:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
847
+ 2024-04-26 16:10:07,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
848
+ 2024-04-26 16:10:07,127 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
849
+ 2024-04-26 16:10:09,129 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
850
+ 2024-04-26 16:10:11,132 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
851
+ 2024-04-26 16:10:11,931 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
852
+ 2024-04-26 16:10:13,134 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
853
+ 2024-04-26 16:10:15,038 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
854
+ 2024-04-26 16:10:15,039 DEBUG SenderThread:713738 [sender.py:send():379] send: history
855
+ 2024-04-26 16:10:15,040 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
856
+ 2024-04-26 16:10:15,041 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
857
+ 2024-04-26 16:10:15,136 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
858
+ 2024-04-26 16:10:15,137 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
859
+ 2024-04-26 16:10:16,342 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
860
+ 2024-04-26 16:10:17,138 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
861
+ 2024-04-26 16:10:17,356 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
862
+ 2024-04-26 16:10:19,091 DEBUG SenderThread:713738 [sender.py:send():379] send: telemetry
863
+ 2024-04-26 16:10:19,093 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
864
+ 2024-04-26 16:10:19,094 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
865
+ 2024-04-26 16:10:19,097 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
866
+ 2024-04-26 16:10:19,098 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
867
+ 2024-04-26 16:10:19,099 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
868
+ 2024-04-26 16:10:19,099 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
869
+ 2024-04-26 16:10:19,101 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
870
+ 2024-04-26 16:10:19,101 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
871
+ 2024-04-26 16:10:19,102 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
872
+ 2024-04-26 16:10:19,103 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
873
+ 2024-04-26 16:10:19,104 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
874
+ 2024-04-26 16:10:19,105 DEBUG SenderThread:713738 [sender.py:send():379] send: history
875
+ 2024-04-26 16:10:19,105 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
876
+ 2024-04-26 16:10:19,106 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
877
+ 2024-04-26 16:10:19,141 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
878
+ 2024-04-26 16:10:19,141 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
879
+ 2024-04-26 16:10:21,143 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
880
+ 2024-04-26 16:10:22,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
881
+ 2024-04-26 16:10:22,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
882
+ 2024-04-26 16:10:22,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
883
+ 2024-04-26 16:10:22,467 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
884
+ 2024-04-26 16:10:23,145 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
885
+ 2024-04-26 16:10:25,147 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
886
+ 2024-04-26 16:10:27,149 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
887
+ 2024-04-26 16:10:27,423 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
888
+ 2024-04-26 16:10:27,425 DEBUG SenderThread:713738 [sender.py:send():379] send: history
889
+ 2024-04-26 16:10:27,425 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
890
+ 2024-04-26 16:10:27,429 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
891
+ 2024-04-26 16:10:28,151 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
892
+ 2024-04-26 16:10:28,437 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
893
+ 2024-04-26 16:10:29,153 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
894
+ 2024-04-26 16:10:29,154 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/config.yaml
895
+ 2024-04-26 16:10:33,569 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
896
+ 2024-04-26 16:10:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
897
+ 2024-04-26 16:10:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
898
+ 2024-04-26 16:10:37,122 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
899
+ 2024-04-26 16:10:39,164 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
900
+ 2024-04-26 16:10:39,263 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
901
+ 2024-04-26 16:10:44,264 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
902
+ 2024-04-26 16:10:46,345 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
903
+ 2024-04-26 16:10:49,174 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
904
+ 2024-04-26 16:10:50,203 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
905
+ 2024-04-26 16:10:52,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
906
+ 2024-04-26 16:10:52,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
907
+ 2024-04-26 16:10:52,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
908
+ 2024-04-26 16:10:55,181 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
909
+ 2024-04-26 16:10:55,370 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
910
+ 2024-04-26 16:10:57,183 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
911
+ 2024-04-26 16:10:59,185 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
912
+ 2024-04-26 16:11:00,468 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
913
+ 2024-04-26 16:11:01,187 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
914
+ 2024-04-26 16:11:03,189 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
915
+ 2024-04-26 16:11:05,191 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
916
+ 2024-04-26 16:11:05,523 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
917
+ 2024-04-26 16:11:07,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
918
+ 2024-04-26 16:11:07,120 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
919
+ 2024-04-26 16:11:07,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
920
+ 2024-04-26 16:11:07,194 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
921
+ 2024-04-26 16:11:09,196 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
922
+ 2024-04-26 16:11:10,641 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
923
+ 2024-04-26 16:11:13,200 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
924
+ 2024-04-26 16:11:15,202 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
925
+ 2024-04-26 16:11:15,699 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
926
+ 2024-04-26 16:11:16,347 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
927
+ 2024-04-26 16:11:17,204 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
928
+ 2024-04-26 16:11:19,207 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
929
+ 2024-04-26 16:11:21,307 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
930
+ 2024-04-26 16:11:22,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
931
+ 2024-04-26 16:11:22,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
932
+ 2024-04-26 16:11:22,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
933
+ 2024-04-26 16:11:27,229 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
934
+ 2024-04-26 16:11:32,323 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
935
+ 2024-04-26 16:11:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
936
+ 2024-04-26 16:11:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
937
+ 2024-04-26 16:11:37,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
938
+ 2024-04-26 16:11:37,584 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
939
+ 2024-04-26 16:11:42,686 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
940
+ 2024-04-26 16:11:46,350 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
941
+ 2024-04-26 16:11:47,930 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
942
+ 2024-04-26 16:11:52,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
943
+ 2024-04-26 16:11:52,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
944
+ 2024-04-26 16:11:52,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
945
+ 2024-04-26 16:11:53,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
946
+ 2024-04-26 16:11:58,201 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
947
+ 2024-04-26 16:12:03,288 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
948
+ 2024-04-26 16:12:07,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
949
+ 2024-04-26 16:12:07,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
950
+ 2024-04-26 16:12:07,122 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
951
+ 2024-04-26 16:12:08,297 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
952
+ 2024-04-26 16:12:13,359 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
953
+ 2024-04-26 16:12:16,352 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
954
+ 2024-04-26 16:12:18,516 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
wandb/run-20240426_155715-vljs8xzo/files/config.yaml CHANGED
@@ -32,6 +32,7 @@ _wandb:
32
  3:
33
  - 7
34
  - 23
 
35
  4: 3.11.9
36
  5: 0.16.6
37
  6: 4.40.1
 
32
  3:
33
  - 7
34
  - 23
35
+ - 62
36
  4: 3.11.9
37
  5: 0.16.6
38
  6: 4.40.1
wandb/run-20240426_155715-vljs8xzo/files/output.log CHANGED
@@ -326,3 +326,127 @@
326
  {'eval_loss': 0.6415477395057678, 'eval_runtime': 8.7287, 'eval_samples_per_second': 229.13, 'eval_steps_per_second': 3.666, 'eval_rewards/chosen': -0.10007989406585693, 'eval_rewards/rejected': -0.24366310238838196, 'eval_rewards/accuracies': 0.62109375, 'eval_rewards/margins': 0.14358317852020264, 'eval_logps/rejected': -428.88836669921875, 'eval_logps/chosen': -433.20001220703125, 'eval_logits/rejected': -3.204622507095337, 'eval_logits/chosen': -3.254263401031494, 'epoch': 0.84}
327
  [INFO|modeling_utils.py:2598] 2024-04-26 16:08:13,074 >> The model is bigger than the maximum size per checkpoint (5GB) and is going to be split in 2 checkpoint shards. You can find where each parameters has been saved in the index located at ./checkpoint-400/model.safetensors.index.json.
328
  [INFO|tokenization_utils_base.py:2488] 2024-04-26 16:08:13,077 >> tokenizer config file saved in ./checkpoint-400/tokenizer_config.json
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
326
  {'eval_loss': 0.6415477395057678, 'eval_runtime': 8.7287, 'eval_samples_per_second': 229.13, 'eval_steps_per_second': 3.666, 'eval_rewards/chosen': -0.10007989406585693, 'eval_rewards/rejected': -0.24366310238838196, 'eval_rewards/accuracies': 0.62109375, 'eval_rewards/margins': 0.14358317852020264, 'eval_logps/rejected': -428.88836669921875, 'eval_logps/chosen': -433.20001220703125, 'eval_logits/rejected': -3.204622507095337, 'eval_logits/chosen': -3.254263401031494, 'epoch': 0.84}
327
  [INFO|modeling_utils.py:2598] 2024-04-26 16:08:13,074 >> The model is bigger than the maximum size per checkpoint (5GB) and is going to be split in 2 checkpoint shards. You can find where each parameters has been saved in the index located at ./checkpoint-400/model.safetensors.index.json.
328
  [INFO|tokenization_utils_base.py:2488] 2024-04-26 16:08:13,077 >> tokenizer config file saved in ./checkpoint-400/tokenizer_config.json
329
+ [INFO|tokenization_utils_base.py:2497] 2024-04-26 16:08:13,079 >> Special tokens file saved in ./checkpoint-400/special_tokens_map.json
330
+ [INFO|tokenization_utils_base.py:2488] 2024-04-26 16:08:31,839 >> tokenizer config file saved in ./tokenizer_config.json
331
+ [INFO|tokenization_utils_base.py:2497] 2024-04-26 16:08:31,841 >> Special tokens file saved in ./special_tokens_map.json
332
+ [INFO|trainer.py:3397] 2024-04-26 16:08:31,870 >> Deleting older checkpoint [checkpoint-300] due to args.save_total_limit
333
+
334
+
335
+
336
+
337
+
338
+
339
+
340
+
341
+
342
+
343
+
344
+
345
+
346
+
347
+
348
+
349
+ 89%|████████▉ | 425/478 [11:46<01:11, 1.35s/it]
350
+
351
+
352
+
353
+
354
+
355
+
356
+
357
+
358
+
359
+
360
+
361
+
362
+
363
+
364
+
365
+
366
+
367
+ 94%|█████████▍| 450/478 [12:20<00:37, 1.35s/it]
368
+
369
+
370
+
371
+
372
+
373
+
374
+
375
+
376
+
377
+
378
+
379
+
380
+
381
+
382
+
383
+
384
+
385
+ 99%|█████████▉| 475/478 [12:54<00:04, 1.35s/it]
386
+
387
+ 100%|██████████| 478/478 [12:58<00:00, 1.35s/it][INFO|trainer.py:2316] 2024-04-26 16:10:19,036 >>
388
+ Training completed. Do not forget to share your model on huggingface.co/models =)
389
+ 100%|██████████| 478/478 [12:58<00:00, 1.63s/it]
390
+ [INFO|trainer.py:3614] 2024-04-26 16:10:19,102 >> ***** Running Evaluation *****
391
+ [INFO|trainer.py:3616] 2024-04-26 16:10:19,102 >> Num examples = 2000
392
+ [INFO|trainer.py:3619] 2024-04-26 16:10:19,102 >> Batch size = 8
393
+ 12%|█▎ | 4/32 [00:00<00:05, 5.08it/s]
394
+ {'train_runtime': 784.6622, 'train_samples_per_second': 77.913, 'train_steps_per_second': 0.609, 'train_loss': 0.6571792745689967, 'epoch': 1.0}
395
+ ***** train metrics *****
396
+ epoch = 1.0
397
+ total_flos = 0GF
398
+ train_loss = 0.6572
399
+ train_runtime = 0:13:04.66
400
+ train_samples = 61135
401
+ train_samples_per_second = 77.913
402
+ train_steps_per_second = 0.609
403
+ 2024-04-26 16:10:19 - INFO - __main__ - *** Training complete ***
404
+
405
+
406
+
407
+ 100%|██████████| 32/32 [00:08<00:00, 3.97it/s]
408
+ [INFO|trainer.py:3305] 2024-04-26 16:10:27,430 >> Saving model checkpoint to ./
409
+ [INFO|configuration_utils.py:471] 2024-04-26 16:10:27,432 >> Configuration saved in ./config.json
410
+ [INFO|configuration_utils.py:697] 2024-04-26 16:10:27,434 >> Configuration saved in ./generation_config.json
411
+ ***** eval metrics *****
412
+ epoch = 1.0
413
+ eval_logits/chosen = -3.2544
414
+ eval_logits/rejected = -3.2047
415
+ eval_logps/chosen = -433.6304
416
+ eval_logps/rejected = -429.4582
417
+ eval_loss = 0.6412
418
+ eval_rewards/accuracies = 0.6445
419
+ eval_rewards/chosen = -0.1044
420
+ eval_rewards/margins = 0.145
421
+ eval_rewards/rejected = -0.2494
422
+ eval_runtime = 0:00:08.29
423
+ eval_samples = 2000
424
+ eval_samples_per_second = 241.204
425
+ eval_steps_per_second = 3.859
426
+ 2024-04-26 16:10:27 - INFO - __main__ - *** Save model ***
427
+ [INFO|modeling_utils.py:2598] 2024-04-26 16:10:37,122 >> The model is bigger than the maximum size per checkpoint (5GB) and is going to be split in 2 checkpoint shards. You can find where each parameters has been saved in the index located at ./model.safetensors.index.json.
428
+ [INFO|tokenization_utils_base.py:2488] 2024-04-26 16:10:37,133 >> tokenizer config file saved in ./tokenizer_config.json
429
+ [INFO|tokenization_utils_base.py:2497] 2024-04-26 16:10:37,135 >> Special tokens file saved in ./special_tokens_map.json
430
+ [INFO|trainer.py:3305] 2024-04-26 16:10:37,190 >> Saving model checkpoint to ./
431
+ [INFO|configuration_utils.py:471] 2024-04-26 16:10:37,192 >> Configuration saved in ./config.json
432
+ [INFO|configuration_utils.py:697] 2024-04-26 16:10:37,194 >> Configuration saved in ./generation_config.json
433
+ [INFO|modeling_utils.py:2598] 2024-04-26 16:10:48,100 >> The model is bigger than the maximum size per checkpoint (5GB) and is going to be split in 2 checkpoint shards. You can find where each parameters has been saved in the index located at ./model.safetensors.index.json.
434
+ [INFO|tokenization_utils_base.py:2488] 2024-04-26 16:10:48,103 >> tokenizer config file saved in ./tokenizer_config.json
435
+ [INFO|tokenization_utils_base.py:2497] 2024-04-26 16:10:48,105 >> Special tokens file saved in ./special_tokens_map.json
436
+ [INFO|modelcard.py:450] 2024-04-26 16:10:48,202 >> Dropping the following result as it does not have all the necessary fields:
437
+ {'task': {'name': 'Causal Language Modeling', 'type': 'text-generation'}}
438
+ events.out.tfevents.1714147827.ip-26-0-160-225.711598.1: 100%|██████████| 828/828 [00:00<00:00, 5.29kB/s]
439
+ events.out.tfevents.1714147034.ip-26-0-160-225.711598.0: 100%|██████████| 21.8k/21.8k [00:00<00:00, 108kB/s]
440
+ model-00001-of-00002.safetensors: 1%| | 32.0M/4.99G [00:00<02:33, 32.3MB/s]
441
+ events.out.tfevents.1714147034.ip-26-0-160-225.711598.0: 0%| | 0.00/21.8k [00:00<?, ?B/s]
442
+
443
+
444
+
445
+
446
+
447
+
448
+
449
+
450
+
451
+
452
+
wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json CHANGED
@@ -1 +1 @@
1
- {"train/loss": 0.6428, "train/grad_norm": 8.851872444152832, "train/learning_rate": 3.9507259776993954e-08, "train/rewards/chosen": -0.08884063363075256, "train/rewards/rejected": -0.22824116051197052, "train/rewards/accuracies": 0.6524999737739563, "train/rewards/margins": 0.13940051198005676, "train/logps/rejected": -409.0178527832031, "train/logps/chosen": -449.8016052246094, "train/logits/rejected": -3.1744155883789062, "train/logits/chosen": -3.221069097518921, "train/epoch": 0.8368200836820083, "train/global_step": 400, "_timestamp": 1714147683.4777088, "_runtime": 647.7768397331238, "_step": 20, "eval/loss": 0.6415477395057678, "eval/runtime": 8.7287, "eval/samples_per_second": 229.13, "eval/steps_per_second": 3.666, "eval/rewards/chosen": -0.10007989406585693, "eval/rewards/rejected": -0.24366310238838196, "eval/rewards/accuracies": 0.62109375, "eval/rewards/margins": 0.14358317852020264, "eval/logps/rejected": -428.88836669921875, "eval/logps/chosen": -433.20001220703125, "eval/logits/rejected": -3.204622507095337, "eval/logits/chosen": -3.254263401031494}
 
1
+ {"train/loss": 0.6377, "train/grad_norm": 8.862220764160156, "train/learning_rate": 6.004792024680294e-11, "train/rewards/chosen": -0.11145105212926865, "train/rewards/rejected": -0.26377159357070923, "train/rewards/accuracies": 0.6575000286102295, "train/rewards/margins": 0.15232053399085999, "train/logps/rejected": -418.4268493652344, "train/logps/chosen": -445.8639831542969, "train/logits/rejected": -3.1062843799591064, "train/logits/chosen": -3.1527392864227295, "train/epoch": 1.0, "train/global_step": 478, "_timestamp": 1714147827.4227393, "_runtime": 791.7218701839447, "_step": 25, "eval/loss": 0.6412035226821899, "eval/runtime": 8.2917, "eval/samples_per_second": 241.204, "eval/steps_per_second": 3.859, "eval/rewards/chosen": -0.10438449680805206, "eval/rewards/rejected": -0.24936166405677795, "eval/rewards/accuracies": 0.64453125, "eval/rewards/margins": 0.14497718214988708, "eval/logps/rejected": -429.45819091796875, "eval/logps/chosen": -433.63043212890625, "eval/logits/rejected": -3.204742431640625, "eval/logits/chosen": -3.25435733795166, "train_runtime": 784.6622, "train_samples_per_second": 77.913, "train_steps_per_second": 0.609, "total_flos": 0.0, "train_loss": 0.6571792745689967}
wandb/run-20240426_155715-vljs8xzo/logs/debug-internal.log CHANGED
@@ -747,3 +747,208 @@
747
  2024-04-26 16:08:23,064 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
748
  2024-04-26 16:08:24,187 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
749
  2024-04-26 16:08:29,188 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
747
  2024-04-26 16:08:23,064 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
748
  2024-04-26 16:08:24,187 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
749
  2024-04-26 16:08:29,188 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
750
+ 2024-04-26 16:08:33,026 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
751
+ 2024-04-26 16:08:34,871 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
752
+ 2024-04-26 16:08:37,030 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
753
+ 2024-04-26 16:08:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
754
+ 2024-04-26 16:08:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
755
+ 2024-04-26 16:08:37,058 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
756
+ 2024-04-26 16:08:39,032 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
757
+ 2024-04-26 16:08:39,996 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
758
+ 2024-04-26 16:08:41,034 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
759
+ 2024-04-26 16:08:43,038 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
760
+ 2024-04-26 16:08:45,040 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
761
+ 2024-04-26 16:08:45,420 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
762
+ 2024-04-26 16:08:46,337 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
763
+ 2024-04-26 16:08:47,042 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
764
+ 2024-04-26 16:08:49,044 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
765
+ 2024-04-26 16:08:50,801 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
766
+ 2024-04-26 16:08:51,045 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
767
+ 2024-04-26 16:08:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
768
+ 2024-04-26 16:08:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
769
+ 2024-04-26 16:08:52,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
770
+ 2024-04-26 16:08:53,051 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
771
+ 2024-04-26 16:08:55,053 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
772
+ 2024-04-26 16:08:56,161 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
773
+ 2024-04-26 16:08:57,055 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
774
+ 2024-04-26 16:08:59,057 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
775
+ 2024-04-26 16:09:01,059 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
776
+ 2024-04-26 16:09:01,568 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
777
+ 2024-04-26 16:09:03,060 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
778
+ 2024-04-26 16:09:05,062 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
779
+ 2024-04-26 16:09:06,912 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
780
+ 2024-04-26 16:09:07,057 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
781
+ 2024-04-26 16:09:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
782
+ 2024-04-26 16:09:07,058 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
783
+ 2024-04-26 16:09:07,064 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
784
+ 2024-04-26 16:09:07,332 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
785
+ 2024-04-26 16:09:07,334 DEBUG SenderThread:713738 [sender.py:send():379] send: history
786
+ 2024-04-26 16:09:07,334 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
787
+ 2024-04-26 16:09:07,335 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
788
+ 2024-04-26 16:09:08,066 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
789
+ 2024-04-26 16:09:09,067 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
790
+ 2024-04-26 16:09:11,069 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
791
+ 2024-04-26 16:09:12,304 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
792
+ 2024-04-26 16:09:13,071 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
793
+ 2024-04-26 16:09:15,073 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
794
+ 2024-04-26 16:09:16,339 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
795
+ 2024-04-26 16:09:17,076 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
796
+ 2024-04-26 16:09:17,760 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
797
+ 2024-04-26 16:09:19,077 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
798
+ 2024-04-26 16:09:21,079 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
799
+ 2024-04-26 16:09:22,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
800
+ 2024-04-26 16:09:22,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
801
+ 2024-04-26 16:09:22,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
802
+ 2024-04-26 16:09:23,081 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
803
+ 2024-04-26 16:09:23,203 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
804
+ 2024-04-26 16:09:25,083 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
805
+ 2024-04-26 16:09:27,085 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
806
+ 2024-04-26 16:09:28,623 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
807
+ 2024-04-26 16:09:29,087 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
808
+ 2024-04-26 16:09:31,089 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
809
+ 2024-04-26 16:09:33,091 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
810
+ 2024-04-26 16:09:34,019 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
811
+ 2024-04-26 16:09:35,093 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
812
+ 2024-04-26 16:09:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
813
+ 2024-04-26 16:09:37,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
814
+ 2024-04-26 16:09:37,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
815
+ 2024-04-26 16:09:37,095 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
816
+ 2024-04-26 16:09:39,097 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
817
+ 2024-04-26 16:09:39,390 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
818
+ 2024-04-26 16:09:41,099 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
819
+ 2024-04-26 16:09:41,173 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
820
+ 2024-04-26 16:09:41,175 DEBUG SenderThread:713738 [sender.py:send():379] send: history
821
+ 2024-04-26 16:09:41,176 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
822
+ 2024-04-26 16:09:41,177 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
823
+ 2024-04-26 16:09:42,101 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
824
+ 2024-04-26 16:09:43,102 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
825
+ 2024-04-26 16:09:44,810 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
826
+ 2024-04-26 16:09:45,104 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
827
+ 2024-04-26 16:09:46,340 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
828
+ 2024-04-26 16:09:47,106 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
829
+ 2024-04-26 16:09:49,109 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
830
+ 2024-04-26 16:09:50,192 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
831
+ 2024-04-26 16:09:51,111 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
832
+ 2024-04-26 16:09:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
833
+ 2024-04-26 16:09:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
834
+ 2024-04-26 16:09:52,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
835
+ 2024-04-26 16:09:53,112 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
836
+ 2024-04-26 16:09:55,115 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
837
+ 2024-04-26 16:09:55,629 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
838
+ 2024-04-26 16:09:57,117 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
839
+ 2024-04-26 16:09:59,119 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
840
+ 2024-04-26 16:10:01,005 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
841
+ 2024-04-26 16:10:01,121 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
842
+ 2024-04-26 16:10:03,123 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
843
+ 2024-04-26 16:10:05,125 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
844
+ 2024-04-26 16:10:06,489 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
845
+ 2024-04-26 16:10:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
846
+ 2024-04-26 16:10:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
847
+ 2024-04-26 16:10:07,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
848
+ 2024-04-26 16:10:07,127 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
849
+ 2024-04-26 16:10:09,129 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
850
+ 2024-04-26 16:10:11,132 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
851
+ 2024-04-26 16:10:11,931 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
852
+ 2024-04-26 16:10:13,134 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
853
+ 2024-04-26 16:10:15,038 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
854
+ 2024-04-26 16:10:15,039 DEBUG SenderThread:713738 [sender.py:send():379] send: history
855
+ 2024-04-26 16:10:15,040 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
856
+ 2024-04-26 16:10:15,041 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
857
+ 2024-04-26 16:10:15,136 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
858
+ 2024-04-26 16:10:15,137 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
859
+ 2024-04-26 16:10:16,342 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
860
+ 2024-04-26 16:10:17,138 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
861
+ 2024-04-26 16:10:17,356 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
862
+ 2024-04-26 16:10:19,091 DEBUG SenderThread:713738 [sender.py:send():379] send: telemetry
863
+ 2024-04-26 16:10:19,093 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
864
+ 2024-04-26 16:10:19,094 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
865
+ 2024-04-26 16:10:19,097 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
866
+ 2024-04-26 16:10:19,098 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
867
+ 2024-04-26 16:10:19,099 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
868
+ 2024-04-26 16:10:19,099 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
869
+ 2024-04-26 16:10:19,101 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
870
+ 2024-04-26 16:10:19,101 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
871
+ 2024-04-26 16:10:19,102 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
872
+ 2024-04-26 16:10:19,103 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
873
+ 2024-04-26 16:10:19,104 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
874
+ 2024-04-26 16:10:19,105 DEBUG SenderThread:713738 [sender.py:send():379] send: history
875
+ 2024-04-26 16:10:19,105 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
876
+ 2024-04-26 16:10:19,106 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
877
+ 2024-04-26 16:10:19,141 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
878
+ 2024-04-26 16:10:19,141 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
879
+ 2024-04-26 16:10:21,143 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
880
+ 2024-04-26 16:10:22,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
881
+ 2024-04-26 16:10:22,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
882
+ 2024-04-26 16:10:22,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
883
+ 2024-04-26 16:10:22,467 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
884
+ 2024-04-26 16:10:23,145 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
885
+ 2024-04-26 16:10:25,147 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
886
+ 2024-04-26 16:10:27,149 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
887
+ 2024-04-26 16:10:27,423 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
888
+ 2024-04-26 16:10:27,425 DEBUG SenderThread:713738 [sender.py:send():379] send: history
889
+ 2024-04-26 16:10:27,425 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
890
+ 2024-04-26 16:10:27,429 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
891
+ 2024-04-26 16:10:28,151 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
892
+ 2024-04-26 16:10:28,437 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
893
+ 2024-04-26 16:10:29,153 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
894
+ 2024-04-26 16:10:29,154 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/config.yaml
895
+ 2024-04-26 16:10:33,569 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
896
+ 2024-04-26 16:10:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
897
+ 2024-04-26 16:10:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
898
+ 2024-04-26 16:10:37,122 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
899
+ 2024-04-26 16:10:39,164 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
900
+ 2024-04-26 16:10:39,263 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
901
+ 2024-04-26 16:10:44,264 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
902
+ 2024-04-26 16:10:46,345 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
903
+ 2024-04-26 16:10:49,174 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
904
+ 2024-04-26 16:10:50,203 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
905
+ 2024-04-26 16:10:52,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
906
+ 2024-04-26 16:10:52,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
907
+ 2024-04-26 16:10:52,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
908
+ 2024-04-26 16:10:55,181 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
909
+ 2024-04-26 16:10:55,370 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
910
+ 2024-04-26 16:10:57,183 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
911
+ 2024-04-26 16:10:59,185 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
912
+ 2024-04-26 16:11:00,468 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
913
+ 2024-04-26 16:11:01,187 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
914
+ 2024-04-26 16:11:03,189 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
915
+ 2024-04-26 16:11:05,191 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
916
+ 2024-04-26 16:11:05,523 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
917
+ 2024-04-26 16:11:07,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
918
+ 2024-04-26 16:11:07,120 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
919
+ 2024-04-26 16:11:07,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
920
+ 2024-04-26 16:11:07,194 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
921
+ 2024-04-26 16:11:09,196 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
922
+ 2024-04-26 16:11:10,641 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
923
+ 2024-04-26 16:11:13,200 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
924
+ 2024-04-26 16:11:15,202 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
925
+ 2024-04-26 16:11:15,699 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
926
+ 2024-04-26 16:11:16,347 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
927
+ 2024-04-26 16:11:17,204 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
928
+ 2024-04-26 16:11:19,207 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
929
+ 2024-04-26 16:11:21,307 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
930
+ 2024-04-26 16:11:22,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
931
+ 2024-04-26 16:11:22,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
932
+ 2024-04-26 16:11:22,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
933
+ 2024-04-26 16:11:27,229 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
934
+ 2024-04-26 16:11:32,323 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
935
+ 2024-04-26 16:11:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
936
+ 2024-04-26 16:11:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
937
+ 2024-04-26 16:11:37,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
938
+ 2024-04-26 16:11:37,584 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
939
+ 2024-04-26 16:11:42,686 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
940
+ 2024-04-26 16:11:46,350 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
941
+ 2024-04-26 16:11:47,930 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
942
+ 2024-04-26 16:11:52,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
943
+ 2024-04-26 16:11:52,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
944
+ 2024-04-26 16:11:52,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
945
+ 2024-04-26 16:11:53,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
946
+ 2024-04-26 16:11:58,201 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
947
+ 2024-04-26 16:12:03,288 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
948
+ 2024-04-26 16:12:07,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
949
+ 2024-04-26 16:12:07,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
950
+ 2024-04-26 16:12:07,122 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
951
+ 2024-04-26 16:12:08,297 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
952
+ 2024-04-26 16:12:13,359 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
953
+ 2024-04-26 16:12:16,352 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
954
+ 2024-04-26 16:12:18,516 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
wandb/run-20240426_155715-vljs8xzo/run-vljs8xzo.wandb CHANGED
Binary files a/wandb/run-20240426_155715-vljs8xzo/run-vljs8xzo.wandb and b/wandb/run-20240426_155715-vljs8xzo/run-vljs8xzo.wandb differ