Commit
•
0354e7d
1
Parent(s):
b703d56
Model save
Browse files- README.md +75 -0
- all_results.json +22 -0
- eval_results.json +16 -0
- generation_config.json +6 -0
- model-00001-of-00002.safetensors +3 -0
- model-00002-of-00002.safetensors +3 -0
- model.safetensors.index.json +64 -0
- runs/Apr26_15-48-43_ip-26-0-160-225/events.out.tfevents.1714147034.ip-26-0-160-225.711598.0 +2 -2
- runs/Apr26_15-48-43_ip-26-0-160-225/events.out.tfevents.1714147827.ip-26-0-160-225.711598.1 +3 -0
- train_results.json +9 -0
- trainer_state.json +394 -0
- wandb/debug-internal.log +205 -0
- wandb/run-20240426_155715-vljs8xzo/files/config.yaml +1 -0
- wandb/run-20240426_155715-vljs8xzo/files/output.log +124 -0
- wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json +1 -1
- wandb/run-20240426_155715-vljs8xzo/logs/debug-internal.log +205 -0
- wandb/run-20240426_155715-vljs8xzo/run-vljs8xzo.wandb +0 -0
README.md
ADDED
@@ -0,0 +1,75 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model: sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat
|
3 |
+
tags:
|
4 |
+
- trl
|
5 |
+
- dpo
|
6 |
+
- generated_from_trainer
|
7 |
+
model-index:
|
8 |
+
- name: sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat
|
9 |
+
results: []
|
10 |
+
---
|
11 |
+
|
12 |
+
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
13 |
+
should probably proofread and complete it, then remove this comment. -->
|
14 |
+
|
15 |
+
# sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat
|
16 |
+
|
17 |
+
This model is a fine-tuned version of [sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat](https://huggingface.co/sanchit-gandhi/distil-zephyr-1.5b-ssft-ultrachat) on an unknown dataset.
|
18 |
+
It achieves the following results on the evaluation set:
|
19 |
+
- Loss: 0.6412
|
20 |
+
- Rewards/chosen: -0.1044
|
21 |
+
- Rewards/rejected: -0.2494
|
22 |
+
- Rewards/accuracies: 0.6445
|
23 |
+
- Rewards/margins: 0.1450
|
24 |
+
- Logps/rejected: -429.4582
|
25 |
+
- Logps/chosen: -433.6304
|
26 |
+
- Logits/rejected: -3.2047
|
27 |
+
- Logits/chosen: -3.2544
|
28 |
+
|
29 |
+
## Model description
|
30 |
+
|
31 |
+
More information needed
|
32 |
+
|
33 |
+
## Intended uses & limitations
|
34 |
+
|
35 |
+
More information needed
|
36 |
+
|
37 |
+
## Training and evaluation data
|
38 |
+
|
39 |
+
More information needed
|
40 |
+
|
41 |
+
## Training procedure
|
42 |
+
|
43 |
+
### Training hyperparameters
|
44 |
+
|
45 |
+
The following hyperparameters were used during training:
|
46 |
+
- learning_rate: 5e-07
|
47 |
+
- train_batch_size: 8
|
48 |
+
- eval_batch_size: 8
|
49 |
+
- seed: 42
|
50 |
+
- distributed_type: multi-GPU
|
51 |
+
- num_devices: 8
|
52 |
+
- gradient_accumulation_steps: 2
|
53 |
+
- total_train_batch_size: 128
|
54 |
+
- total_eval_batch_size: 64
|
55 |
+
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
56 |
+
- lr_scheduler_type: cosine
|
57 |
+
- lr_scheduler_warmup_ratio: 0.1
|
58 |
+
- num_epochs: 1
|
59 |
+
|
60 |
+
### Training results
|
61 |
+
|
62 |
+
| Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
|
63 |
+
|:-------------:|:------:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
|
64 |
+
| 0.6795 | 0.2092 | 100 | 0.6759 | 0.0017 | -0.0328 | 0.6289 | 0.0345 | -407.8037 | -423.0197 | -3.2565 | -3.3136 |
|
65 |
+
| 0.6584 | 0.4184 | 200 | 0.6534 | -0.0666 | -0.1617 | 0.6445 | 0.0951 | -420.6952 | -429.8561 | -3.2240 | -3.2768 |
|
66 |
+
| 0.6494 | 0.6276 | 300 | 0.6438 | -0.1077 | -0.2410 | 0.6211 | 0.1333 | -428.6237 | -433.9640 | -3.2050 | -3.2553 |
|
67 |
+
| 0.6428 | 0.8368 | 400 | 0.6415 | -0.1001 | -0.2437 | 0.6211 | 0.1436 | -428.8884 | -433.2000 | -3.2046 | -3.2543 |
|
68 |
+
|
69 |
+
|
70 |
+
### Framework versions
|
71 |
+
|
72 |
+
- Transformers 4.40.1
|
73 |
+
- Pytorch 2.2.2+cu121
|
74 |
+
- Datasets 2.19.0
|
75 |
+
- Tokenizers 0.19.1
|
all_results.json
ADDED
@@ -0,0 +1,22 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"epoch": 1.0,
|
3 |
+
"eval_logits/chosen": -3.25435733795166,
|
4 |
+
"eval_logits/rejected": -3.204742431640625,
|
5 |
+
"eval_logps/chosen": -433.63043212890625,
|
6 |
+
"eval_logps/rejected": -429.45819091796875,
|
7 |
+
"eval_loss": 0.6412035226821899,
|
8 |
+
"eval_rewards/accuracies": 0.64453125,
|
9 |
+
"eval_rewards/chosen": -0.10438449680805206,
|
10 |
+
"eval_rewards/margins": 0.14497718214988708,
|
11 |
+
"eval_rewards/rejected": -0.24936166405677795,
|
12 |
+
"eval_runtime": 8.2917,
|
13 |
+
"eval_samples": 2000,
|
14 |
+
"eval_samples_per_second": 241.204,
|
15 |
+
"eval_steps_per_second": 3.859,
|
16 |
+
"total_flos": 0.0,
|
17 |
+
"train_loss": 0.6571792745689967,
|
18 |
+
"train_runtime": 784.6622,
|
19 |
+
"train_samples": 61135,
|
20 |
+
"train_samples_per_second": 77.913,
|
21 |
+
"train_steps_per_second": 0.609
|
22 |
+
}
|
eval_results.json
ADDED
@@ -0,0 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"epoch": 1.0,
|
3 |
+
"eval_logits/chosen": -3.25435733795166,
|
4 |
+
"eval_logits/rejected": -3.204742431640625,
|
5 |
+
"eval_logps/chosen": -433.63043212890625,
|
6 |
+
"eval_logps/rejected": -429.45819091796875,
|
7 |
+
"eval_loss": 0.6412035226821899,
|
8 |
+
"eval_rewards/accuracies": 0.64453125,
|
9 |
+
"eval_rewards/chosen": -0.10438449680805206,
|
10 |
+
"eval_rewards/margins": 0.14497718214988708,
|
11 |
+
"eval_rewards/rejected": -0.24936166405677795,
|
12 |
+
"eval_runtime": 8.2917,
|
13 |
+
"eval_samples": 2000,
|
14 |
+
"eval_samples_per_second": 241.204,
|
15 |
+
"eval_steps_per_second": 3.859
|
16 |
+
}
|
generation_config.json
ADDED
@@ -0,0 +1,6 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"_from_model_config": true,
|
3 |
+
"bos_token_id": 1,
|
4 |
+
"eos_token_id": 2,
|
5 |
+
"transformers_version": "4.40.1"
|
6 |
+
}
|
model-00001-of-00002.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6d1c7b079c6fe16b8306f4ad14c51991d2f4687c5913f4c69aaca2a0348cc737
|
3 |
+
size 4987196936
|
model-00002-of-00002.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9629c04e2da5a942f500f775442430cc7cbc755c6e95af6b26a3c7de9af646e9
|
3 |
+
size 1296089984
|
model.safetensors.index.json
ADDED
@@ -0,0 +1,64 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"metadata": {
|
3 |
+
"total_size": 6283280384
|
4 |
+
},
|
5 |
+
"weight_map": {
|
6 |
+
"lm_head.weight": "model-00002-of-00002.safetensors",
|
7 |
+
"model.embed_tokens.weight": "model-00001-of-00002.safetensors",
|
8 |
+
"model.layers.0.input_layernorm.weight": "model-00001-of-00002.safetensors",
|
9 |
+
"model.layers.0.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
|
10 |
+
"model.layers.0.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
|
11 |
+
"model.layers.0.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
|
12 |
+
"model.layers.0.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
|
13 |
+
"model.layers.0.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
|
14 |
+
"model.layers.0.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
|
15 |
+
"model.layers.0.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
|
16 |
+
"model.layers.0.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
|
17 |
+
"model.layers.1.input_layernorm.weight": "model-00001-of-00002.safetensors",
|
18 |
+
"model.layers.1.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
|
19 |
+
"model.layers.1.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
|
20 |
+
"model.layers.1.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
|
21 |
+
"model.layers.1.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
|
22 |
+
"model.layers.1.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
|
23 |
+
"model.layers.1.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
|
24 |
+
"model.layers.1.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
|
25 |
+
"model.layers.1.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
|
26 |
+
"model.layers.2.input_layernorm.weight": "model-00001-of-00002.safetensors",
|
27 |
+
"model.layers.2.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
|
28 |
+
"model.layers.2.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
|
29 |
+
"model.layers.2.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
|
30 |
+
"model.layers.2.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
|
31 |
+
"model.layers.2.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
|
32 |
+
"model.layers.2.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
|
33 |
+
"model.layers.2.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
|
34 |
+
"model.layers.2.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
|
35 |
+
"model.layers.3.input_layernorm.weight": "model-00001-of-00002.safetensors",
|
36 |
+
"model.layers.3.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
|
37 |
+
"model.layers.3.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
|
38 |
+
"model.layers.3.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
|
39 |
+
"model.layers.3.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
|
40 |
+
"model.layers.3.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
|
41 |
+
"model.layers.3.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
|
42 |
+
"model.layers.3.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
|
43 |
+
"model.layers.3.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
|
44 |
+
"model.layers.4.input_layernorm.weight": "model-00001-of-00002.safetensors",
|
45 |
+
"model.layers.4.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
|
46 |
+
"model.layers.4.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
|
47 |
+
"model.layers.4.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
|
48 |
+
"model.layers.4.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
|
49 |
+
"model.layers.4.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
|
50 |
+
"model.layers.4.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
|
51 |
+
"model.layers.4.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
|
52 |
+
"model.layers.4.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
|
53 |
+
"model.layers.5.input_layernorm.weight": "model-00002-of-00002.safetensors",
|
54 |
+
"model.layers.5.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
|
55 |
+
"model.layers.5.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
|
56 |
+
"model.layers.5.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
|
57 |
+
"model.layers.5.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
|
58 |
+
"model.layers.5.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
|
59 |
+
"model.layers.5.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
|
60 |
+
"model.layers.5.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
|
61 |
+
"model.layers.5.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
|
62 |
+
"model.norm.weight": "model-00002-of-00002.safetensors"
|
63 |
+
}
|
64 |
+
}
|
runs/Apr26_15-48-43_ip-26-0-160-225/events.out.tfevents.1714147034.ip-26-0-160-225.711598.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2f5835ae8637ba91ca29d3afbfe85b2fdbacaa6b26c7ca7683878a0ac5a110e8
|
3 |
+
size 21764
|
runs/Apr26_15-48-43_ip-26-0-160-225/events.out.tfevents.1714147827.ip-26-0-160-225.711598.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b158da38885dfc714a392467a3ae1215e3432959ad5762c3ae7cf3464549eeac
|
3 |
+
size 828
|
train_results.json
ADDED
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"epoch": 1.0,
|
3 |
+
"total_flos": 0.0,
|
4 |
+
"train_loss": 0.6571792745689967,
|
5 |
+
"train_runtime": 784.6622,
|
6 |
+
"train_samples": 61135,
|
7 |
+
"train_samples_per_second": 77.913,
|
8 |
+
"train_steps_per_second": 0.609
|
9 |
+
}
|
trainer_state.json
ADDED
@@ -0,0 +1,394 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"best_metric": null,
|
3 |
+
"best_model_checkpoint": null,
|
4 |
+
"epoch": 1.0,
|
5 |
+
"eval_steps": 100,
|
6 |
+
"global_step": 478,
|
7 |
+
"is_hyper_param_search": false,
|
8 |
+
"is_local_process_zero": true,
|
9 |
+
"is_world_process_zero": true,
|
10 |
+
"log_history": [
|
11 |
+
{
|
12 |
+
"epoch": 0.0020920502092050207,
|
13 |
+
"grad_norm": 9.710838317871094,
|
14 |
+
"learning_rate": 1.0416666666666666e-08,
|
15 |
+
"logits/chosen": -3.161454916000366,
|
16 |
+
"logits/rejected": -3.0680501461029053,
|
17 |
+
"logps/chosen": -437.20782470703125,
|
18 |
+
"logps/rejected": -343.8380432128906,
|
19 |
+
"loss": 0.6931,
|
20 |
+
"rewards/accuracies": 0.0,
|
21 |
+
"rewards/chosen": 0.0,
|
22 |
+
"rewards/margins": 0.0,
|
23 |
+
"rewards/rejected": 0.0,
|
24 |
+
"step": 1
|
25 |
+
},
|
26 |
+
{
|
27 |
+
"epoch": 0.05230125523012552,
|
28 |
+
"grad_norm": 8.52570629119873,
|
29 |
+
"learning_rate": 2.604166666666667e-07,
|
30 |
+
"logits/chosen": -3.30265212059021,
|
31 |
+
"logits/rejected": -3.2195777893066406,
|
32 |
+
"logps/chosen": -432.7400207519531,
|
33 |
+
"logps/rejected": -391.2707824707031,
|
34 |
+
"loss": 0.6931,
|
35 |
+
"rewards/accuracies": 0.5234375,
|
36 |
+
"rewards/chosen": 0.0006945470231585205,
|
37 |
+
"rewards/margins": 0.0007254942320287228,
|
38 |
+
"rewards/rejected": -3.094731437158771e-05,
|
39 |
+
"step": 25
|
40 |
+
},
|
41 |
+
{
|
42 |
+
"epoch": 0.10460251046025104,
|
43 |
+
"grad_norm": 9.973247528076172,
|
44 |
+
"learning_rate": 4.999733114418725e-07,
|
45 |
+
"logits/chosen": -3.299149751663208,
|
46 |
+
"logits/rejected": -3.2438504695892334,
|
47 |
+
"logps/chosen": -446.8968811035156,
|
48 |
+
"logps/rejected": -404.587158203125,
|
49 |
+
"loss": 0.6914,
|
50 |
+
"rewards/accuracies": 0.5799999833106995,
|
51 |
+
"rewards/chosen": 0.005268932785838842,
|
52 |
+
"rewards/margins": 0.002896952675655484,
|
53 |
+
"rewards/rejected": 0.002371980343014002,
|
54 |
+
"step": 50
|
55 |
+
},
|
56 |
+
{
|
57 |
+
"epoch": 0.15690376569037656,
|
58 |
+
"grad_norm": 9.357086181640625,
|
59 |
+
"learning_rate": 4.951516761176343e-07,
|
60 |
+
"logits/chosen": -3.302896738052368,
|
61 |
+
"logits/rejected": -3.225714683532715,
|
62 |
+
"logps/chosen": -480.6836853027344,
|
63 |
+
"logps/rejected": -436.4853515625,
|
64 |
+
"loss": 0.6871,
|
65 |
+
"rewards/accuracies": 0.5975000262260437,
|
66 |
+
"rewards/chosen": 0.013074235059320927,
|
67 |
+
"rewards/margins": 0.012715624645352364,
|
68 |
+
"rewards/rejected": 0.0003586093371268362,
|
69 |
+
"step": 75
|
70 |
+
},
|
71 |
+
{
|
72 |
+
"epoch": 0.20920502092050208,
|
73 |
+
"grad_norm": 8.75236988067627,
|
74 |
+
"learning_rate": 4.821741763807186e-07,
|
75 |
+
"logits/chosen": -3.2598507404327393,
|
76 |
+
"logits/rejected": -3.2092175483703613,
|
77 |
+
"logps/chosen": -426.1629638671875,
|
78 |
+
"logps/rejected": -389.50421142578125,
|
79 |
+
"loss": 0.6795,
|
80 |
+
"rewards/accuracies": 0.612500011920929,
|
81 |
+
"rewards/chosen": 0.003723819274455309,
|
82 |
+
"rewards/margins": 0.02804265171289444,
|
83 |
+
"rewards/rejected": -0.02431883104145527,
|
84 |
+
"step": 100
|
85 |
+
},
|
86 |
+
{
|
87 |
+
"epoch": 0.20920502092050208,
|
88 |
+
"eval_logits/chosen": -3.313567638397217,
|
89 |
+
"eval_logits/rejected": -3.2565112113952637,
|
90 |
+
"eval_logps/chosen": -423.0196533203125,
|
91 |
+
"eval_logps/rejected": -407.8036804199219,
|
92 |
+
"eval_loss": 0.6759119629859924,
|
93 |
+
"eval_rewards/accuracies": 0.62890625,
|
94 |
+
"eval_rewards/chosen": 0.0017230990342795849,
|
95 |
+
"eval_rewards/margins": 0.0345395989716053,
|
96 |
+
"eval_rewards/rejected": -0.03281649947166443,
|
97 |
+
"eval_runtime": 8.2733,
|
98 |
+
"eval_samples_per_second": 241.742,
|
99 |
+
"eval_steps_per_second": 3.868,
|
100 |
+
"step": 100
|
101 |
+
},
|
102 |
+
{
|
103 |
+
"epoch": 0.2615062761506276,
|
104 |
+
"grad_norm": 8.889654159545898,
|
105 |
+
"learning_rate": 4.614725560802639e-07,
|
106 |
+
"logits/chosen": -3.2867064476013184,
|
107 |
+
"logits/rejected": -3.2023117542266846,
|
108 |
+
"logps/chosen": -430.924560546875,
|
109 |
+
"logps/rejected": -382.1640319824219,
|
110 |
+
"loss": 0.6731,
|
111 |
+
"rewards/accuracies": 0.6474999785423279,
|
112 |
+
"rewards/chosen": -0.01004675030708313,
|
113 |
+
"rewards/margins": 0.04363078624010086,
|
114 |
+
"rewards/rejected": -0.05367753654718399,
|
115 |
+
"step": 125
|
116 |
+
},
|
117 |
+
{
|
118 |
+
"epoch": 0.3138075313807531,
|
119 |
+
"grad_norm": 9.882610321044922,
|
120 |
+
"learning_rate": 4.337355301007335e-07,
|
121 |
+
"logits/chosen": -3.222916841506958,
|
122 |
+
"logits/rejected": -3.1899585723876953,
|
123 |
+
"logps/chosen": -443.3583679199219,
|
124 |
+
"logps/rejected": -414.344970703125,
|
125 |
+
"loss": 0.667,
|
126 |
+
"rewards/accuracies": 0.637499988079071,
|
127 |
+
"rewards/chosen": -0.017106110230088234,
|
128 |
+
"rewards/margins": 0.06052257865667343,
|
129 |
+
"rewards/rejected": -0.07762870192527771,
|
130 |
+
"step": 150
|
131 |
+
},
|
132 |
+
{
|
133 |
+
"epoch": 0.36610878661087864,
|
134 |
+
"grad_norm": 9.968969345092773,
|
135 |
+
"learning_rate": 3.9988587174999306e-07,
|
136 |
+
"logits/chosen": -3.2201168537139893,
|
137 |
+
"logits/rejected": -3.131910800933838,
|
138 |
+
"logps/chosen": -477.7035217285156,
|
139 |
+
"logps/rejected": -405.7104797363281,
|
140 |
+
"loss": 0.6574,
|
141 |
+
"rewards/accuracies": 0.6399999856948853,
|
142 |
+
"rewards/chosen": -0.03587143123149872,
|
143 |
+
"rewards/margins": 0.09045815467834473,
|
144 |
+
"rewards/rejected": -0.12632958590984344,
|
145 |
+
"step": 175
|
146 |
+
},
|
147 |
+
{
|
148 |
+
"epoch": 0.41841004184100417,
|
149 |
+
"grad_norm": 9.230700492858887,
|
150 |
+
"learning_rate": 3.610497133404795e-07,
|
151 |
+
"logits/chosen": -3.23995304107666,
|
152 |
+
"logits/rejected": -3.1550867557525635,
|
153 |
+
"logps/chosen": -431.6617126464844,
|
154 |
+
"logps/rejected": -396.62774658203125,
|
155 |
+
"loss": 0.6584,
|
156 |
+
"rewards/accuracies": 0.6349999904632568,
|
157 |
+
"rewards/chosen": -0.06991340965032578,
|
158 |
+
"rewards/margins": 0.08443903923034668,
|
159 |
+
"rewards/rejected": -0.15435244143009186,
|
160 |
+
"step": 200
|
161 |
+
},
|
162 |
+
{
|
163 |
+
"epoch": 0.41841004184100417,
|
164 |
+
"eval_logits/chosen": -3.2767982482910156,
|
165 |
+
"eval_logits/rejected": -3.2240023612976074,
|
166 |
+
"eval_logps/chosen": -429.85614013671875,
|
167 |
+
"eval_logps/rejected": -420.6952209472656,
|
168 |
+
"eval_loss": 0.6533502340316772,
|
169 |
+
"eval_rewards/accuracies": 0.64453125,
|
170 |
+
"eval_rewards/chosen": -0.06664139777421951,
|
171 |
+
"eval_rewards/margins": 0.09509073942899704,
|
172 |
+
"eval_rewards/rejected": -0.16173213720321655,
|
173 |
+
"eval_runtime": 8.2763,
|
174 |
+
"eval_samples_per_second": 241.653,
|
175 |
+
"eval_steps_per_second": 3.866,
|
176 |
+
"step": 200
|
177 |
+
},
|
178 |
+
{
|
179 |
+
"epoch": 0.4707112970711297,
|
180 |
+
"grad_norm": 10.79430103302002,
|
181 |
+
"learning_rate": 3.185190812915646e-07,
|
182 |
+
"logits/chosen": -3.1671783924102783,
|
183 |
+
"logits/rejected": -3.118861436843872,
|
184 |
+
"logps/chosen": -446.6968994140625,
|
185 |
+
"logps/rejected": -410.1864013671875,
|
186 |
+
"loss": 0.6518,
|
187 |
+
"rewards/accuracies": 0.6449999809265137,
|
188 |
+
"rewards/chosen": -0.08074235171079636,
|
189 |
+
"rewards/margins": 0.10100732743740082,
|
190 |
+
"rewards/rejected": -0.18174967169761658,
|
191 |
+
"step": 225
|
192 |
+
},
|
193 |
+
{
|
194 |
+
"epoch": 0.5230125523012552,
|
195 |
+
"grad_norm": 10.086767196655273,
|
196 |
+
"learning_rate": 2.7370891215954565e-07,
|
197 |
+
"logits/chosen": -3.1980080604553223,
|
198 |
+
"logits/rejected": -3.1623446941375732,
|
199 |
+
"logps/chosen": -440.4261474609375,
|
200 |
+
"logps/rejected": -439.9627685546875,
|
201 |
+
"loss": 0.6476,
|
202 |
+
"rewards/accuracies": 0.637499988079071,
|
203 |
+
"rewards/chosen": -0.07635506987571716,
|
204 |
+
"rewards/margins": 0.11709018051624298,
|
205 |
+
"rewards/rejected": -0.19344525039196014,
|
206 |
+
"step": 250
|
207 |
+
},
|
208 |
+
{
|
209 |
+
"epoch": 0.5753138075313807,
|
210 |
+
"grad_norm": 9.709197044372559,
|
211 |
+
"learning_rate": 2.2810997961375938e-07,
|
212 |
+
"logits/chosen": -3.216128349304199,
|
213 |
+
"logits/rejected": -3.1448960304260254,
|
214 |
+
"logps/chosen": -425.5062561035156,
|
215 |
+
"logps/rejected": -391.5084228515625,
|
216 |
+
"loss": 0.6461,
|
217 |
+
"rewards/accuracies": 0.6800000071525574,
|
218 |
+
"rewards/chosen": -0.072402223944664,
|
219 |
+
"rewards/margins": 0.1306913197040558,
|
220 |
+
"rewards/rejected": -0.2030935436487198,
|
221 |
+
"step": 275
|
222 |
+
},
|
223 |
+
{
|
224 |
+
"epoch": 0.6276150627615062,
|
225 |
+
"grad_norm": 10.980072975158691,
|
226 |
+
"learning_rate": 1.8323929841460178e-07,
|
227 |
+
"logits/chosen": -3.2006824016571045,
|
228 |
+
"logits/rejected": -3.1383109092712402,
|
229 |
+
"logps/chosen": -443.0799865722656,
|
230 |
+
"logps/rejected": -436.2774658203125,
|
231 |
+
"loss": 0.6494,
|
232 |
+
"rewards/accuracies": 0.625,
|
233 |
+
"rewards/chosen": -0.1245601624250412,
|
234 |
+
"rewards/margins": 0.10581608861684799,
|
235 |
+
"rewards/rejected": -0.2303762584924698,
|
236 |
+
"step": 300
|
237 |
+
},
|
238 |
+
{
|
239 |
+
"epoch": 0.6276150627615062,
|
240 |
+
"eval_logits/chosen": -3.2553329467773438,
|
241 |
+
"eval_logits/rejected": -3.2049574851989746,
|
242 |
+
"eval_logps/chosen": -433.9639892578125,
|
243 |
+
"eval_logps/rejected": -428.6236572265625,
|
244 |
+
"eval_loss": 0.6438009142875671,
|
245 |
+
"eval_rewards/accuracies": 0.62109375,
|
246 |
+
"eval_rewards/chosen": -0.10771973431110382,
|
247 |
+
"eval_rewards/margins": 0.13329659402370453,
|
248 |
+
"eval_rewards/rejected": -0.24101632833480835,
|
249 |
+
"eval_runtime": 8.3559,
|
250 |
+
"eval_samples_per_second": 239.351,
|
251 |
+
"eval_steps_per_second": 3.83,
|
252 |
+
"step": 300
|
253 |
+
},
|
254 |
+
{
|
255 |
+
"epoch": 0.6799163179916318,
|
256 |
+
"grad_norm": 9.95361614227295,
|
257 |
+
"learning_rate": 1.4058965538597032e-07,
|
258 |
+
"logits/chosen": -3.2326276302337646,
|
259 |
+
"logits/rejected": -3.198971748352051,
|
260 |
+
"logps/chosen": -443.37371826171875,
|
261 |
+
"logps/rejected": -433.8561706542969,
|
262 |
+
"loss": 0.6392,
|
263 |
+
"rewards/accuracies": 0.6399999856948853,
|
264 |
+
"rewards/chosen": -0.11981040984392166,
|
265 |
+
"rewards/margins": 0.12150833755731583,
|
266 |
+
"rewards/rejected": -0.2413187474012375,
|
267 |
+
"step": 325
|
268 |
+
},
|
269 |
+
{
|
270 |
+
"epoch": 0.7322175732217573,
|
271 |
+
"grad_norm": 10.580938339233398,
|
272 |
+
"learning_rate": 1.0157994641835734e-07,
|
273 |
+
"logits/chosen": -3.1840951442718506,
|
274 |
+
"logits/rejected": -3.0981223583221436,
|
275 |
+
"logps/chosen": -430.7939453125,
|
276 |
+
"logps/rejected": -403.5060119628906,
|
277 |
+
"loss": 0.6447,
|
278 |
+
"rewards/accuracies": 0.6225000023841858,
|
279 |
+
"rewards/chosen": -0.15974169969558716,
|
280 |
+
"rewards/margins": 0.09658970683813095,
|
281 |
+
"rewards/rejected": -0.2563314139842987,
|
282 |
+
"step": 350
|
283 |
+
},
|
284 |
+
{
|
285 |
+
"epoch": 0.7845188284518828,
|
286 |
+
"grad_norm": 8.326279640197754,
|
287 |
+
"learning_rate": 6.75079717232744e-08,
|
288 |
+
"logits/chosen": -3.2011187076568604,
|
289 |
+
"logits/rejected": -3.0969460010528564,
|
290 |
+
"logps/chosen": -465.22344970703125,
|
291 |
+
"logps/rejected": -392.5793762207031,
|
292 |
+
"loss": 0.6411,
|
293 |
+
"rewards/accuracies": 0.6549999713897705,
|
294 |
+
"rewards/chosen": -0.11683624982833862,
|
295 |
+
"rewards/margins": 0.14286838471889496,
|
296 |
+
"rewards/rejected": -0.2597046196460724,
|
297 |
+
"step": 375
|
298 |
+
},
|
299 |
+
{
|
300 |
+
"epoch": 0.8368200836820083,
|
301 |
+
"grad_norm": 8.851872444152832,
|
302 |
+
"learning_rate": 3.9507259776993954e-08,
|
303 |
+
"logits/chosen": -3.221069097518921,
|
304 |
+
"logits/rejected": -3.1744155883789062,
|
305 |
+
"logps/chosen": -449.8016052246094,
|
306 |
+
"logps/rejected": -409.0178527832031,
|
307 |
+
"loss": 0.6428,
|
308 |
+
"rewards/accuracies": 0.6524999737739563,
|
309 |
+
"rewards/chosen": -0.08884063363075256,
|
310 |
+
"rewards/margins": 0.13940051198005676,
|
311 |
+
"rewards/rejected": -0.22824116051197052,
|
312 |
+
"step": 400
|
313 |
+
},
|
314 |
+
{
|
315 |
+
"epoch": 0.8368200836820083,
|
316 |
+
"eval_logits/chosen": -3.254263401031494,
|
317 |
+
"eval_logits/rejected": -3.204622507095337,
|
318 |
+
"eval_logps/chosen": -433.20001220703125,
|
319 |
+
"eval_logps/rejected": -428.88836669921875,
|
320 |
+
"eval_loss": 0.6415477395057678,
|
321 |
+
"eval_rewards/accuracies": 0.62109375,
|
322 |
+
"eval_rewards/chosen": -0.10007989406585693,
|
323 |
+
"eval_rewards/margins": 0.14358317852020264,
|
324 |
+
"eval_rewards/rejected": -0.24366310238838196,
|
325 |
+
"eval_runtime": 8.7287,
|
326 |
+
"eval_samples_per_second": 229.13,
|
327 |
+
"eval_steps_per_second": 3.666,
|
328 |
+
"step": 400
|
329 |
+
},
|
330 |
+
{
|
331 |
+
"epoch": 0.8891213389121339,
|
332 |
+
"grad_norm": 10.138018608093262,
|
333 |
+
"learning_rate": 1.850935636255496e-08,
|
334 |
+
"logits/chosen": -3.1941397190093994,
|
335 |
+
"logits/rejected": -3.1335413455963135,
|
336 |
+
"logps/chosen": -463.3765869140625,
|
337 |
+
"logps/rejected": -418.0171203613281,
|
338 |
+
"loss": 0.6393,
|
339 |
+
"rewards/accuracies": 0.6399999856948853,
|
340 |
+
"rewards/chosen": -0.11999661475419998,
|
341 |
+
"rewards/margins": 0.13007262349128723,
|
342 |
+
"rewards/rejected": -0.2500692307949066,
|
343 |
+
"step": 425
|
344 |
+
},
|
345 |
+
{
|
346 |
+
"epoch": 0.9414225941422594,
|
347 |
+
"grad_norm": 9.687002182006836,
|
348 |
+
"learning_rate": 5.212833302556258e-09,
|
349 |
+
"logits/chosen": -3.202920913696289,
|
350 |
+
"logits/rejected": -3.14504337310791,
|
351 |
+
"logps/chosen": -442.259765625,
|
352 |
+
"logps/rejected": -406.7275085449219,
|
353 |
+
"loss": 0.6419,
|
354 |
+
"rewards/accuracies": 0.6050000190734863,
|
355 |
+
"rewards/chosen": -0.13094988465309143,
|
356 |
+
"rewards/margins": 0.1017264574766159,
|
357 |
+
"rewards/rejected": -0.23267632722854614,
|
358 |
+
"step": 450
|
359 |
+
},
|
360 |
+
{
|
361 |
+
"epoch": 0.9937238493723849,
|
362 |
+
"grad_norm": 8.862220764160156,
|
363 |
+
"learning_rate": 6.004792024680294e-11,
|
364 |
+
"logits/chosen": -3.1527392864227295,
|
365 |
+
"logits/rejected": -3.1062843799591064,
|
366 |
+
"logps/chosen": -445.8639831542969,
|
367 |
+
"logps/rejected": -418.4268493652344,
|
368 |
+
"loss": 0.6377,
|
369 |
+
"rewards/accuracies": 0.6575000286102295,
|
370 |
+
"rewards/chosen": -0.11145105212926865,
|
371 |
+
"rewards/margins": 0.15232053399085999,
|
372 |
+
"rewards/rejected": -0.26377159357070923,
|
373 |
+
"step": 475
|
374 |
+
},
|
375 |
+
{
|
376 |
+
"epoch": 1.0,
|
377 |
+
"step": 478,
|
378 |
+
"total_flos": 0.0,
|
379 |
+
"train_loss": 0.6571792745689967,
|
380 |
+
"train_runtime": 784.6622,
|
381 |
+
"train_samples_per_second": 77.913,
|
382 |
+
"train_steps_per_second": 0.609
|
383 |
+
}
|
384 |
+
],
|
385 |
+
"logging_steps": 25,
|
386 |
+
"max_steps": 478,
|
387 |
+
"num_input_tokens_seen": 0,
|
388 |
+
"num_train_epochs": 1,
|
389 |
+
"save_steps": 100,
|
390 |
+
"total_flos": 0.0,
|
391 |
+
"train_batch_size": 8,
|
392 |
+
"trial_name": null,
|
393 |
+
"trial_params": null
|
394 |
+
}
|
wandb/debug-internal.log
CHANGED
@@ -747,3 +747,208 @@
|
|
747 |
2024-04-26 16:08:23,064 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
748 |
2024-04-26 16:08:24,187 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
749 |
2024-04-26 16:08:29,188 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
747 |
2024-04-26 16:08:23,064 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
748 |
2024-04-26 16:08:24,187 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
749 |
2024-04-26 16:08:29,188 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
750 |
+
2024-04-26 16:08:33,026 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
751 |
+
2024-04-26 16:08:34,871 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
752 |
+
2024-04-26 16:08:37,030 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
753 |
+
2024-04-26 16:08:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
754 |
+
2024-04-26 16:08:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
755 |
+
2024-04-26 16:08:37,058 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
756 |
+
2024-04-26 16:08:39,032 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
757 |
+
2024-04-26 16:08:39,996 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
758 |
+
2024-04-26 16:08:41,034 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
759 |
+
2024-04-26 16:08:43,038 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
760 |
+
2024-04-26 16:08:45,040 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
761 |
+
2024-04-26 16:08:45,420 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
762 |
+
2024-04-26 16:08:46,337 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
763 |
+
2024-04-26 16:08:47,042 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
764 |
+
2024-04-26 16:08:49,044 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
765 |
+
2024-04-26 16:08:50,801 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
766 |
+
2024-04-26 16:08:51,045 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
767 |
+
2024-04-26 16:08:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
768 |
+
2024-04-26 16:08:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
769 |
+
2024-04-26 16:08:52,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
770 |
+
2024-04-26 16:08:53,051 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
771 |
+
2024-04-26 16:08:55,053 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
772 |
+
2024-04-26 16:08:56,161 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
773 |
+
2024-04-26 16:08:57,055 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
774 |
+
2024-04-26 16:08:59,057 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
775 |
+
2024-04-26 16:09:01,059 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
776 |
+
2024-04-26 16:09:01,568 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
777 |
+
2024-04-26 16:09:03,060 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
778 |
+
2024-04-26 16:09:05,062 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
779 |
+
2024-04-26 16:09:06,912 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
780 |
+
2024-04-26 16:09:07,057 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
781 |
+
2024-04-26 16:09:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
782 |
+
2024-04-26 16:09:07,058 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
783 |
+
2024-04-26 16:09:07,064 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
784 |
+
2024-04-26 16:09:07,332 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
785 |
+
2024-04-26 16:09:07,334 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
786 |
+
2024-04-26 16:09:07,334 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
787 |
+
2024-04-26 16:09:07,335 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
788 |
+
2024-04-26 16:09:08,066 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
789 |
+
2024-04-26 16:09:09,067 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
790 |
+
2024-04-26 16:09:11,069 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
791 |
+
2024-04-26 16:09:12,304 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
792 |
+
2024-04-26 16:09:13,071 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
793 |
+
2024-04-26 16:09:15,073 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
794 |
+
2024-04-26 16:09:16,339 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
795 |
+
2024-04-26 16:09:17,076 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
796 |
+
2024-04-26 16:09:17,760 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
797 |
+
2024-04-26 16:09:19,077 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
798 |
+
2024-04-26 16:09:21,079 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
799 |
+
2024-04-26 16:09:22,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
800 |
+
2024-04-26 16:09:22,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
801 |
+
2024-04-26 16:09:22,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
802 |
+
2024-04-26 16:09:23,081 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
803 |
+
2024-04-26 16:09:23,203 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
804 |
+
2024-04-26 16:09:25,083 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
805 |
+
2024-04-26 16:09:27,085 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
806 |
+
2024-04-26 16:09:28,623 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
807 |
+
2024-04-26 16:09:29,087 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
808 |
+
2024-04-26 16:09:31,089 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
809 |
+
2024-04-26 16:09:33,091 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
810 |
+
2024-04-26 16:09:34,019 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
811 |
+
2024-04-26 16:09:35,093 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
812 |
+
2024-04-26 16:09:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
813 |
+
2024-04-26 16:09:37,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
814 |
+
2024-04-26 16:09:37,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
815 |
+
2024-04-26 16:09:37,095 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
816 |
+
2024-04-26 16:09:39,097 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
817 |
+
2024-04-26 16:09:39,390 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
818 |
+
2024-04-26 16:09:41,099 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
819 |
+
2024-04-26 16:09:41,173 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
820 |
+
2024-04-26 16:09:41,175 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
821 |
+
2024-04-26 16:09:41,176 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
822 |
+
2024-04-26 16:09:41,177 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
823 |
+
2024-04-26 16:09:42,101 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
824 |
+
2024-04-26 16:09:43,102 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
825 |
+
2024-04-26 16:09:44,810 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
826 |
+
2024-04-26 16:09:45,104 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
827 |
+
2024-04-26 16:09:46,340 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
828 |
+
2024-04-26 16:09:47,106 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
829 |
+
2024-04-26 16:09:49,109 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
830 |
+
2024-04-26 16:09:50,192 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
831 |
+
2024-04-26 16:09:51,111 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
832 |
+
2024-04-26 16:09:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
833 |
+
2024-04-26 16:09:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
834 |
+
2024-04-26 16:09:52,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
835 |
+
2024-04-26 16:09:53,112 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
836 |
+
2024-04-26 16:09:55,115 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
837 |
+
2024-04-26 16:09:55,629 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
838 |
+
2024-04-26 16:09:57,117 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
839 |
+
2024-04-26 16:09:59,119 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
840 |
+
2024-04-26 16:10:01,005 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
841 |
+
2024-04-26 16:10:01,121 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
842 |
+
2024-04-26 16:10:03,123 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
843 |
+
2024-04-26 16:10:05,125 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
844 |
+
2024-04-26 16:10:06,489 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
845 |
+
2024-04-26 16:10:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
846 |
+
2024-04-26 16:10:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
847 |
+
2024-04-26 16:10:07,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
848 |
+
2024-04-26 16:10:07,127 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
849 |
+
2024-04-26 16:10:09,129 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
850 |
+
2024-04-26 16:10:11,132 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
851 |
+
2024-04-26 16:10:11,931 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
852 |
+
2024-04-26 16:10:13,134 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
853 |
+
2024-04-26 16:10:15,038 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
854 |
+
2024-04-26 16:10:15,039 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
855 |
+
2024-04-26 16:10:15,040 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
856 |
+
2024-04-26 16:10:15,041 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
857 |
+
2024-04-26 16:10:15,136 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
858 |
+
2024-04-26 16:10:15,137 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
859 |
+
2024-04-26 16:10:16,342 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
860 |
+
2024-04-26 16:10:17,138 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
861 |
+
2024-04-26 16:10:17,356 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
862 |
+
2024-04-26 16:10:19,091 DEBUG SenderThread:713738 [sender.py:send():379] send: telemetry
|
863 |
+
2024-04-26 16:10:19,093 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
864 |
+
2024-04-26 16:10:19,094 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
865 |
+
2024-04-26 16:10:19,097 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
866 |
+
2024-04-26 16:10:19,098 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
867 |
+
2024-04-26 16:10:19,099 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
868 |
+
2024-04-26 16:10:19,099 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
869 |
+
2024-04-26 16:10:19,101 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
870 |
+
2024-04-26 16:10:19,101 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
871 |
+
2024-04-26 16:10:19,102 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
872 |
+
2024-04-26 16:10:19,103 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
873 |
+
2024-04-26 16:10:19,104 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
874 |
+
2024-04-26 16:10:19,105 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
875 |
+
2024-04-26 16:10:19,105 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
876 |
+
2024-04-26 16:10:19,106 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
877 |
+
2024-04-26 16:10:19,141 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
878 |
+
2024-04-26 16:10:19,141 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
879 |
+
2024-04-26 16:10:21,143 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
880 |
+
2024-04-26 16:10:22,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
881 |
+
2024-04-26 16:10:22,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
882 |
+
2024-04-26 16:10:22,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
883 |
+
2024-04-26 16:10:22,467 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
884 |
+
2024-04-26 16:10:23,145 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
885 |
+
2024-04-26 16:10:25,147 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
886 |
+
2024-04-26 16:10:27,149 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
887 |
+
2024-04-26 16:10:27,423 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
888 |
+
2024-04-26 16:10:27,425 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
889 |
+
2024-04-26 16:10:27,425 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
890 |
+
2024-04-26 16:10:27,429 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
891 |
+
2024-04-26 16:10:28,151 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
892 |
+
2024-04-26 16:10:28,437 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
893 |
+
2024-04-26 16:10:29,153 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
894 |
+
2024-04-26 16:10:29,154 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/config.yaml
|
895 |
+
2024-04-26 16:10:33,569 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
896 |
+
2024-04-26 16:10:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
897 |
+
2024-04-26 16:10:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
898 |
+
2024-04-26 16:10:37,122 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
899 |
+
2024-04-26 16:10:39,164 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
900 |
+
2024-04-26 16:10:39,263 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
901 |
+
2024-04-26 16:10:44,264 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
902 |
+
2024-04-26 16:10:46,345 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
903 |
+
2024-04-26 16:10:49,174 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
904 |
+
2024-04-26 16:10:50,203 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
905 |
+
2024-04-26 16:10:52,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
906 |
+
2024-04-26 16:10:52,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
907 |
+
2024-04-26 16:10:52,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
908 |
+
2024-04-26 16:10:55,181 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
909 |
+
2024-04-26 16:10:55,370 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
910 |
+
2024-04-26 16:10:57,183 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
911 |
+
2024-04-26 16:10:59,185 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
912 |
+
2024-04-26 16:11:00,468 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
913 |
+
2024-04-26 16:11:01,187 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
914 |
+
2024-04-26 16:11:03,189 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
915 |
+
2024-04-26 16:11:05,191 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
916 |
+
2024-04-26 16:11:05,523 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
917 |
+
2024-04-26 16:11:07,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
918 |
+
2024-04-26 16:11:07,120 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
919 |
+
2024-04-26 16:11:07,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
920 |
+
2024-04-26 16:11:07,194 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
921 |
+
2024-04-26 16:11:09,196 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
922 |
+
2024-04-26 16:11:10,641 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
923 |
+
2024-04-26 16:11:13,200 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
924 |
+
2024-04-26 16:11:15,202 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
925 |
+
2024-04-26 16:11:15,699 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
926 |
+
2024-04-26 16:11:16,347 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
927 |
+
2024-04-26 16:11:17,204 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
928 |
+
2024-04-26 16:11:19,207 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
929 |
+
2024-04-26 16:11:21,307 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
930 |
+
2024-04-26 16:11:22,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
931 |
+
2024-04-26 16:11:22,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
932 |
+
2024-04-26 16:11:22,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
933 |
+
2024-04-26 16:11:27,229 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
934 |
+
2024-04-26 16:11:32,323 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
935 |
+
2024-04-26 16:11:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
936 |
+
2024-04-26 16:11:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
937 |
+
2024-04-26 16:11:37,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
938 |
+
2024-04-26 16:11:37,584 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
939 |
+
2024-04-26 16:11:42,686 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
940 |
+
2024-04-26 16:11:46,350 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
941 |
+
2024-04-26 16:11:47,930 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
942 |
+
2024-04-26 16:11:52,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
943 |
+
2024-04-26 16:11:52,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
944 |
+
2024-04-26 16:11:52,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
945 |
+
2024-04-26 16:11:53,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
946 |
+
2024-04-26 16:11:58,201 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
947 |
+
2024-04-26 16:12:03,288 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
948 |
+
2024-04-26 16:12:07,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
949 |
+
2024-04-26 16:12:07,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
950 |
+
2024-04-26 16:12:07,122 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
951 |
+
2024-04-26 16:12:08,297 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
952 |
+
2024-04-26 16:12:13,359 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
953 |
+
2024-04-26 16:12:16,352 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
954 |
+
2024-04-26 16:12:18,516 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
wandb/run-20240426_155715-vljs8xzo/files/config.yaml
CHANGED
@@ -32,6 +32,7 @@ _wandb:
|
|
32 |
3:
|
33 |
- 7
|
34 |
- 23
|
|
|
35 |
4: 3.11.9
|
36 |
5: 0.16.6
|
37 |
6: 4.40.1
|
|
|
32 |
3:
|
33 |
- 7
|
34 |
- 23
|
35 |
+
- 62
|
36 |
4: 3.11.9
|
37 |
5: 0.16.6
|
38 |
6: 4.40.1
|
wandb/run-20240426_155715-vljs8xzo/files/output.log
CHANGED
@@ -326,3 +326,127 @@
|
|
326 |
{'eval_loss': 0.6415477395057678, 'eval_runtime': 8.7287, 'eval_samples_per_second': 229.13, 'eval_steps_per_second': 3.666, 'eval_rewards/chosen': -0.10007989406585693, 'eval_rewards/rejected': -0.24366310238838196, 'eval_rewards/accuracies': 0.62109375, 'eval_rewards/margins': 0.14358317852020264, 'eval_logps/rejected': -428.88836669921875, 'eval_logps/chosen': -433.20001220703125, 'eval_logits/rejected': -3.204622507095337, 'eval_logits/chosen': -3.254263401031494, 'epoch': 0.84}
|
327 |
[INFO|modeling_utils.py:2598] 2024-04-26 16:08:13,074 >> The model is bigger than the maximum size per checkpoint (5GB) and is going to be split in 2 checkpoint shards. You can find where each parameters has been saved in the index located at ./checkpoint-400/model.safetensors.index.json.
|
328 |
[INFO|tokenization_utils_base.py:2488] 2024-04-26 16:08:13,077 >> tokenizer config file saved in ./checkpoint-400/tokenizer_config.json
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
326 |
{'eval_loss': 0.6415477395057678, 'eval_runtime': 8.7287, 'eval_samples_per_second': 229.13, 'eval_steps_per_second': 3.666, 'eval_rewards/chosen': -0.10007989406585693, 'eval_rewards/rejected': -0.24366310238838196, 'eval_rewards/accuracies': 0.62109375, 'eval_rewards/margins': 0.14358317852020264, 'eval_logps/rejected': -428.88836669921875, 'eval_logps/chosen': -433.20001220703125, 'eval_logits/rejected': -3.204622507095337, 'eval_logits/chosen': -3.254263401031494, 'epoch': 0.84}
|
327 |
[INFO|modeling_utils.py:2598] 2024-04-26 16:08:13,074 >> The model is bigger than the maximum size per checkpoint (5GB) and is going to be split in 2 checkpoint shards. You can find where each parameters has been saved in the index located at ./checkpoint-400/model.safetensors.index.json.
|
328 |
[INFO|tokenization_utils_base.py:2488] 2024-04-26 16:08:13,077 >> tokenizer config file saved in ./checkpoint-400/tokenizer_config.json
|
329 |
+
[INFO|tokenization_utils_base.py:2497] 2024-04-26 16:08:13,079 >> Special tokens file saved in ./checkpoint-400/special_tokens_map.json
|
330 |
+
[INFO|tokenization_utils_base.py:2488] 2024-04-26 16:08:31,839 >> tokenizer config file saved in ./tokenizer_config.json
|
331 |
+
[INFO|tokenization_utils_base.py:2497] 2024-04-26 16:08:31,841 >> Special tokens file saved in ./special_tokens_map.json
|
332 |
+
[INFO|trainer.py:3397] 2024-04-26 16:08:31,870 >> Deleting older checkpoint [checkpoint-300] due to args.save_total_limit
|
333 |
+
|
334 |
+
|
335 |
+
|
336 |
+
|
337 |
+
|
338 |
+
|
339 |
+
|
340 |
+
|
341 |
+
|
342 |
+
|
343 |
+
|
344 |
+
|
345 |
+
|
346 |
+
|
347 |
+
|
348 |
+
|
349 |
+
89%|████████▉ | 425/478 [11:46<01:11, 1.35s/it]
|
350 |
+
|
351 |
+
|
352 |
+
|
353 |
+
|
354 |
+
|
355 |
+
|
356 |
+
|
357 |
+
|
358 |
+
|
359 |
+
|
360 |
+
|
361 |
+
|
362 |
+
|
363 |
+
|
364 |
+
|
365 |
+
|
366 |
+
|
367 |
+
94%|█████████▍| 450/478 [12:20<00:37, 1.35s/it]
|
368 |
+
|
369 |
+
|
370 |
+
|
371 |
+
|
372 |
+
|
373 |
+
|
374 |
+
|
375 |
+
|
376 |
+
|
377 |
+
|
378 |
+
|
379 |
+
|
380 |
+
|
381 |
+
|
382 |
+
|
383 |
+
|
384 |
+
|
385 |
+
99%|█████████▉| 475/478 [12:54<00:04, 1.35s/it]
|
386 |
+
|
387 |
+
100%|██████████| 478/478 [12:58<00:00, 1.35s/it][INFO|trainer.py:2316] 2024-04-26 16:10:19,036 >>
|
388 |
+
Training completed. Do not forget to share your model on huggingface.co/models =)
|
389 |
+
100%|██████████| 478/478 [12:58<00:00, 1.63s/it]
|
390 |
+
[INFO|trainer.py:3614] 2024-04-26 16:10:19,102 >> ***** Running Evaluation *****
|
391 |
+
[INFO|trainer.py:3616] 2024-04-26 16:10:19,102 >> Num examples = 2000
|
392 |
+
[INFO|trainer.py:3619] 2024-04-26 16:10:19,102 >> Batch size = 8
|
393 |
+
12%|█▎ | 4/32 [00:00<00:05, 5.08it/s]
|
394 |
+
{'train_runtime': 784.6622, 'train_samples_per_second': 77.913, 'train_steps_per_second': 0.609, 'train_loss': 0.6571792745689967, 'epoch': 1.0}
|
395 |
+
***** train metrics *****
|
396 |
+
epoch = 1.0
|
397 |
+
total_flos = 0GF
|
398 |
+
train_loss = 0.6572
|
399 |
+
train_runtime = 0:13:04.66
|
400 |
+
train_samples = 61135
|
401 |
+
train_samples_per_second = 77.913
|
402 |
+
train_steps_per_second = 0.609
|
403 |
+
2024-04-26 16:10:19 - INFO - __main__ - *** Training complete ***
|
404 |
+
|
405 |
+
|
406 |
+
|
407 |
+
100%|██████████| 32/32 [00:08<00:00, 3.97it/s]
|
408 |
+
[INFO|trainer.py:3305] 2024-04-26 16:10:27,430 >> Saving model checkpoint to ./
|
409 |
+
[INFO|configuration_utils.py:471] 2024-04-26 16:10:27,432 >> Configuration saved in ./config.json
|
410 |
+
[INFO|configuration_utils.py:697] 2024-04-26 16:10:27,434 >> Configuration saved in ./generation_config.json
|
411 |
+
***** eval metrics *****
|
412 |
+
epoch = 1.0
|
413 |
+
eval_logits/chosen = -3.2544
|
414 |
+
eval_logits/rejected = -3.2047
|
415 |
+
eval_logps/chosen = -433.6304
|
416 |
+
eval_logps/rejected = -429.4582
|
417 |
+
eval_loss = 0.6412
|
418 |
+
eval_rewards/accuracies = 0.6445
|
419 |
+
eval_rewards/chosen = -0.1044
|
420 |
+
eval_rewards/margins = 0.145
|
421 |
+
eval_rewards/rejected = -0.2494
|
422 |
+
eval_runtime = 0:00:08.29
|
423 |
+
eval_samples = 2000
|
424 |
+
eval_samples_per_second = 241.204
|
425 |
+
eval_steps_per_second = 3.859
|
426 |
+
2024-04-26 16:10:27 - INFO - __main__ - *** Save model ***
|
427 |
+
[INFO|modeling_utils.py:2598] 2024-04-26 16:10:37,122 >> The model is bigger than the maximum size per checkpoint (5GB) and is going to be split in 2 checkpoint shards. You can find where each parameters has been saved in the index located at ./model.safetensors.index.json.
|
428 |
+
[INFO|tokenization_utils_base.py:2488] 2024-04-26 16:10:37,133 >> tokenizer config file saved in ./tokenizer_config.json
|
429 |
+
[INFO|tokenization_utils_base.py:2497] 2024-04-26 16:10:37,135 >> Special tokens file saved in ./special_tokens_map.json
|
430 |
+
[INFO|trainer.py:3305] 2024-04-26 16:10:37,190 >> Saving model checkpoint to ./
|
431 |
+
[INFO|configuration_utils.py:471] 2024-04-26 16:10:37,192 >> Configuration saved in ./config.json
|
432 |
+
[INFO|configuration_utils.py:697] 2024-04-26 16:10:37,194 >> Configuration saved in ./generation_config.json
|
433 |
+
[INFO|modeling_utils.py:2598] 2024-04-26 16:10:48,100 >> The model is bigger than the maximum size per checkpoint (5GB) and is going to be split in 2 checkpoint shards. You can find where each parameters has been saved in the index located at ./model.safetensors.index.json.
|
434 |
+
[INFO|tokenization_utils_base.py:2488] 2024-04-26 16:10:48,103 >> tokenizer config file saved in ./tokenizer_config.json
|
435 |
+
[INFO|tokenization_utils_base.py:2497] 2024-04-26 16:10:48,105 >> Special tokens file saved in ./special_tokens_map.json
|
436 |
+
[INFO|modelcard.py:450] 2024-04-26 16:10:48,202 >> Dropping the following result as it does not have all the necessary fields:
|
437 |
+
{'task': {'name': 'Causal Language Modeling', 'type': 'text-generation'}}
|
438 |
+
events.out.tfevents.1714147827.ip-26-0-160-225.711598.1: 100%|██████████| 828/828 [00:00<00:00, 5.29kB/s]
|
439 |
+
events.out.tfevents.1714147034.ip-26-0-160-225.711598.0: 100%|██████████| 21.8k/21.8k [00:00<00:00, 108kB/s]
|
440 |
+
model-00001-of-00002.safetensors: 1%| | 32.0M/4.99G [00:00<02:33, 32.3MB/s]
|
441 |
+
events.out.tfevents.1714147034.ip-26-0-160-225.711598.0: 0%| | 0.00/21.8k [00:00<?, ?B/s]
|
442 |
+
|
443 |
+
|
444 |
+
|
445 |
+
|
446 |
+
|
447 |
+
|
448 |
+
|
449 |
+
|
450 |
+
|
451 |
+
|
452 |
+
|
wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"train/loss": 0.
|
|
|
1 |
+
{"train/loss": 0.6377, "train/grad_norm": 8.862220764160156, "train/learning_rate": 6.004792024680294e-11, "train/rewards/chosen": -0.11145105212926865, "train/rewards/rejected": -0.26377159357070923, "train/rewards/accuracies": 0.6575000286102295, "train/rewards/margins": 0.15232053399085999, "train/logps/rejected": -418.4268493652344, "train/logps/chosen": -445.8639831542969, "train/logits/rejected": -3.1062843799591064, "train/logits/chosen": -3.1527392864227295, "train/epoch": 1.0, "train/global_step": 478, "_timestamp": 1714147827.4227393, "_runtime": 791.7218701839447, "_step": 25, "eval/loss": 0.6412035226821899, "eval/runtime": 8.2917, "eval/samples_per_second": 241.204, "eval/steps_per_second": 3.859, "eval/rewards/chosen": -0.10438449680805206, "eval/rewards/rejected": -0.24936166405677795, "eval/rewards/accuracies": 0.64453125, "eval/rewards/margins": 0.14497718214988708, "eval/logps/rejected": -429.45819091796875, "eval/logps/chosen": -433.63043212890625, "eval/logits/rejected": -3.204742431640625, "eval/logits/chosen": -3.25435733795166, "train_runtime": 784.6622, "train_samples_per_second": 77.913, "train_steps_per_second": 0.609, "total_flos": 0.0, "train_loss": 0.6571792745689967}
|
wandb/run-20240426_155715-vljs8xzo/logs/debug-internal.log
CHANGED
@@ -747,3 +747,208 @@
|
|
747 |
2024-04-26 16:08:23,064 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
748 |
2024-04-26 16:08:24,187 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
749 |
2024-04-26 16:08:29,188 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
747 |
2024-04-26 16:08:23,064 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
748 |
2024-04-26 16:08:24,187 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
749 |
2024-04-26 16:08:29,188 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
750 |
+
2024-04-26 16:08:33,026 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
751 |
+
2024-04-26 16:08:34,871 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
752 |
+
2024-04-26 16:08:37,030 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
753 |
+
2024-04-26 16:08:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
754 |
+
2024-04-26 16:08:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
755 |
+
2024-04-26 16:08:37,058 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
756 |
+
2024-04-26 16:08:39,032 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
757 |
+
2024-04-26 16:08:39,996 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
758 |
+
2024-04-26 16:08:41,034 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
759 |
+
2024-04-26 16:08:43,038 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
760 |
+
2024-04-26 16:08:45,040 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
761 |
+
2024-04-26 16:08:45,420 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
762 |
+
2024-04-26 16:08:46,337 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
763 |
+
2024-04-26 16:08:47,042 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
764 |
+
2024-04-26 16:08:49,044 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
765 |
+
2024-04-26 16:08:50,801 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
766 |
+
2024-04-26 16:08:51,045 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
767 |
+
2024-04-26 16:08:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
768 |
+
2024-04-26 16:08:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
769 |
+
2024-04-26 16:08:52,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
770 |
+
2024-04-26 16:08:53,051 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
771 |
+
2024-04-26 16:08:55,053 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
772 |
+
2024-04-26 16:08:56,161 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
773 |
+
2024-04-26 16:08:57,055 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
774 |
+
2024-04-26 16:08:59,057 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
775 |
+
2024-04-26 16:09:01,059 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
776 |
+
2024-04-26 16:09:01,568 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
777 |
+
2024-04-26 16:09:03,060 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
778 |
+
2024-04-26 16:09:05,062 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
779 |
+
2024-04-26 16:09:06,912 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
780 |
+
2024-04-26 16:09:07,057 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
781 |
+
2024-04-26 16:09:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
782 |
+
2024-04-26 16:09:07,058 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
783 |
+
2024-04-26 16:09:07,064 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
784 |
+
2024-04-26 16:09:07,332 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
785 |
+
2024-04-26 16:09:07,334 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
786 |
+
2024-04-26 16:09:07,334 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
787 |
+
2024-04-26 16:09:07,335 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
788 |
+
2024-04-26 16:09:08,066 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
789 |
+
2024-04-26 16:09:09,067 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
790 |
+
2024-04-26 16:09:11,069 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
791 |
+
2024-04-26 16:09:12,304 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
792 |
+
2024-04-26 16:09:13,071 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
793 |
+
2024-04-26 16:09:15,073 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
794 |
+
2024-04-26 16:09:16,339 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
795 |
+
2024-04-26 16:09:17,076 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
796 |
+
2024-04-26 16:09:17,760 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
797 |
+
2024-04-26 16:09:19,077 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
798 |
+
2024-04-26 16:09:21,079 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
799 |
+
2024-04-26 16:09:22,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
800 |
+
2024-04-26 16:09:22,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
801 |
+
2024-04-26 16:09:22,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
802 |
+
2024-04-26 16:09:23,081 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
803 |
+
2024-04-26 16:09:23,203 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
804 |
+
2024-04-26 16:09:25,083 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
805 |
+
2024-04-26 16:09:27,085 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
806 |
+
2024-04-26 16:09:28,623 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
807 |
+
2024-04-26 16:09:29,087 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
808 |
+
2024-04-26 16:09:31,089 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
809 |
+
2024-04-26 16:09:33,091 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
810 |
+
2024-04-26 16:09:34,019 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
811 |
+
2024-04-26 16:09:35,093 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
812 |
+
2024-04-26 16:09:37,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
813 |
+
2024-04-26 16:09:37,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
814 |
+
2024-04-26 16:09:37,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
815 |
+
2024-04-26 16:09:37,095 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
816 |
+
2024-04-26 16:09:39,097 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
817 |
+
2024-04-26 16:09:39,390 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
818 |
+
2024-04-26 16:09:41,099 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
819 |
+
2024-04-26 16:09:41,173 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
820 |
+
2024-04-26 16:09:41,175 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
821 |
+
2024-04-26 16:09:41,176 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
822 |
+
2024-04-26 16:09:41,177 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
823 |
+
2024-04-26 16:09:42,101 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
824 |
+
2024-04-26 16:09:43,102 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
825 |
+
2024-04-26 16:09:44,810 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
826 |
+
2024-04-26 16:09:45,104 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
827 |
+
2024-04-26 16:09:46,340 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
828 |
+
2024-04-26 16:09:47,106 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
829 |
+
2024-04-26 16:09:49,109 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
830 |
+
2024-04-26 16:09:50,192 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
831 |
+
2024-04-26 16:09:51,111 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
832 |
+
2024-04-26 16:09:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
833 |
+
2024-04-26 16:09:52,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
834 |
+
2024-04-26 16:09:52,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
835 |
+
2024-04-26 16:09:53,112 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
836 |
+
2024-04-26 16:09:55,115 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
837 |
+
2024-04-26 16:09:55,629 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
838 |
+
2024-04-26 16:09:57,117 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
839 |
+
2024-04-26 16:09:59,119 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
840 |
+
2024-04-26 16:10:01,005 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
841 |
+
2024-04-26 16:10:01,121 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
842 |
+
2024-04-26 16:10:03,123 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
843 |
+
2024-04-26 16:10:05,125 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
844 |
+
2024-04-26 16:10:06,489 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
845 |
+
2024-04-26 16:10:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
846 |
+
2024-04-26 16:10:07,058 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
847 |
+
2024-04-26 16:10:07,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
848 |
+
2024-04-26 16:10:07,127 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
849 |
+
2024-04-26 16:10:09,129 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
850 |
+
2024-04-26 16:10:11,132 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
851 |
+
2024-04-26 16:10:11,931 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
852 |
+
2024-04-26 16:10:13,134 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
853 |
+
2024-04-26 16:10:15,038 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
854 |
+
2024-04-26 16:10:15,039 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
855 |
+
2024-04-26 16:10:15,040 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
856 |
+
2024-04-26 16:10:15,041 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
857 |
+
2024-04-26 16:10:15,136 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
858 |
+
2024-04-26 16:10:15,137 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
859 |
+
2024-04-26 16:10:16,342 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
860 |
+
2024-04-26 16:10:17,138 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
861 |
+
2024-04-26 16:10:17,356 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
862 |
+
2024-04-26 16:10:19,091 DEBUG SenderThread:713738 [sender.py:send():379] send: telemetry
|
863 |
+
2024-04-26 16:10:19,093 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
864 |
+
2024-04-26 16:10:19,094 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
865 |
+
2024-04-26 16:10:19,097 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
866 |
+
2024-04-26 16:10:19,098 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
867 |
+
2024-04-26 16:10:19,099 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
868 |
+
2024-04-26 16:10:19,099 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
869 |
+
2024-04-26 16:10:19,101 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
870 |
+
2024-04-26 16:10:19,101 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
871 |
+
2024-04-26 16:10:19,102 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
872 |
+
2024-04-26 16:10:19,103 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
873 |
+
2024-04-26 16:10:19,104 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
874 |
+
2024-04-26 16:10:19,105 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
875 |
+
2024-04-26 16:10:19,105 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
876 |
+
2024-04-26 16:10:19,106 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
877 |
+
2024-04-26 16:10:19,141 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
878 |
+
2024-04-26 16:10:19,141 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
879 |
+
2024-04-26 16:10:21,143 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
880 |
+
2024-04-26 16:10:22,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
881 |
+
2024-04-26 16:10:22,059 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
882 |
+
2024-04-26 16:10:22,059 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
883 |
+
2024-04-26 16:10:22,467 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
884 |
+
2024-04-26 16:10:23,145 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
885 |
+
2024-04-26 16:10:25,147 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
886 |
+
2024-04-26 16:10:27,149 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
887 |
+
2024-04-26 16:10:27,423 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: partial_history
|
888 |
+
2024-04-26 16:10:27,425 DEBUG SenderThread:713738 [sender.py:send():379] send: history
|
889 |
+
2024-04-26 16:10:27,425 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: summary_record
|
890 |
+
2024-04-26 16:10:27,429 INFO SenderThread:713738 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
|
891 |
+
2024-04-26 16:10:28,151 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/wandb-summary.json
|
892 |
+
2024-04-26 16:10:28,437 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
893 |
+
2024-04-26 16:10:29,153 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
894 |
+
2024-04-26 16:10:29,154 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/config.yaml
|
895 |
+
2024-04-26 16:10:33,569 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
896 |
+
2024-04-26 16:10:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
897 |
+
2024-04-26 16:10:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
898 |
+
2024-04-26 16:10:37,122 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
899 |
+
2024-04-26 16:10:39,164 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
900 |
+
2024-04-26 16:10:39,263 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
901 |
+
2024-04-26 16:10:44,264 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
902 |
+
2024-04-26 16:10:46,345 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
903 |
+
2024-04-26 16:10:49,174 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
904 |
+
2024-04-26 16:10:50,203 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
905 |
+
2024-04-26 16:10:52,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
906 |
+
2024-04-26 16:10:52,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
907 |
+
2024-04-26 16:10:52,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
908 |
+
2024-04-26 16:10:55,181 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
909 |
+
2024-04-26 16:10:55,370 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
910 |
+
2024-04-26 16:10:57,183 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
911 |
+
2024-04-26 16:10:59,185 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
912 |
+
2024-04-26 16:11:00,468 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
913 |
+
2024-04-26 16:11:01,187 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
914 |
+
2024-04-26 16:11:03,189 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
915 |
+
2024-04-26 16:11:05,191 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
916 |
+
2024-04-26 16:11:05,523 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
917 |
+
2024-04-26 16:11:07,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
918 |
+
2024-04-26 16:11:07,120 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
919 |
+
2024-04-26 16:11:07,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
920 |
+
2024-04-26 16:11:07,194 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
921 |
+
2024-04-26 16:11:09,196 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
922 |
+
2024-04-26 16:11:10,641 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
923 |
+
2024-04-26 16:11:13,200 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
924 |
+
2024-04-26 16:11:15,202 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
925 |
+
2024-04-26 16:11:15,699 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
926 |
+
2024-04-26 16:11:16,347 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
927 |
+
2024-04-26 16:11:17,204 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
928 |
+
2024-04-26 16:11:19,207 INFO Thread-12 :713738 [dir_watcher.py:_on_file_modified():288] file/dir modified: /fsx/sanchit/distil-zephyr-1.5b-dpo-ultrafeedback/wandb/run-20240426_155715-vljs8xzo/files/output.log
|
929 |
+
2024-04-26 16:11:21,307 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
930 |
+
2024-04-26 16:11:22,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
931 |
+
2024-04-26 16:11:22,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
932 |
+
2024-04-26 16:11:22,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
933 |
+
2024-04-26 16:11:27,229 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
934 |
+
2024-04-26 16:11:32,323 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
935 |
+
2024-04-26 16:11:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
936 |
+
2024-04-26 16:11:37,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
937 |
+
2024-04-26 16:11:37,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
938 |
+
2024-04-26 16:11:37,584 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
939 |
+
2024-04-26 16:11:42,686 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
940 |
+
2024-04-26 16:11:46,350 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
941 |
+
2024-04-26 16:11:47,930 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
942 |
+
2024-04-26 16:11:52,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
943 |
+
2024-04-26 16:11:52,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
944 |
+
2024-04-26 16:11:52,121 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
945 |
+
2024-04-26 16:11:53,120 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
946 |
+
2024-04-26 16:11:58,201 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
947 |
+
2024-04-26 16:12:03,288 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
948 |
+
2024-04-26 16:12:07,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: stop_status
|
949 |
+
2024-04-26 16:12:07,121 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: internal_messages
|
950 |
+
2024-04-26 16:12:07,122 DEBUG SenderThread:713738 [sender.py:send_request():406] send_request: stop_status
|
951 |
+
2024-04-26 16:12:08,297 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
952 |
+
2024-04-26 16:12:13,359 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
953 |
+
2024-04-26 16:12:16,352 DEBUG SenderThread:713738 [sender.py:send():379] send: stats
|
954 |
+
2024-04-26 16:12:18,516 DEBUG HandlerThread:713738 [handler.py:handle_request():146] handle_request: status_report
|
wandb/run-20240426_155715-vljs8xzo/run-vljs8xzo.wandb
CHANGED
Binary files a/wandb/run-20240426_155715-vljs8xzo/run-vljs8xzo.wandb and b/wandb/run-20240426_155715-vljs8xzo/run-vljs8xzo.wandb differ
|
|