readme
Browse files
README.md
CHANGED
@@ -66,30 +66,31 @@ CUDA_VISIBLE_DEVICES=0 CUDA_LAUNCH_BLOCKING=0 PYTORCH_CUDA_ALLOC_CONF=expandable
|
|
66 |
|
67 |
```
|
68 |
Seed set to 23
|
69 |
-
Time to instantiate model: 0.
|
70 |
-
Total parameters:
|
71 |
Verifying settings ...
|
72 |
-
Measured TFLOPs:
|
73 |
-
|
74 |
-
Epoch 1 | iter
|
75 |
-
Epoch 1 | iter
|
76 |
-
Epoch 1 | iter
|
77 |
-
Epoch 1 | iter
|
78 |
-
Epoch 1 | iter
|
79 |
-
Epoch 1 | iter
|
80 |
-
Epoch 1 | iter
|
81 |
-
Epoch 1 | iter
|
82 |
-
Epoch 1 | iter
|
83 |
-
Epoch 1 | iter
|
84 |
-
Epoch 1 | iter
|
85 |
-
Epoch 1 | iter
|
86 |
-
Epoch 1 | iter
|
87 |
-
Epoch 1 | iter
|
88 |
-
Epoch 1 | iter
|
89 |
-
Epoch 1 | iter
|
90 |
-
Epoch 1 | iter
|
91 |
-
Epoch 1 | iter
|
92 |
-
Epoch 1 | iter
|
|
|
93 |
# ...
|
94 |
```
|
95 |
|
|
|
66 |
|
67 |
```
|
68 |
Seed set to 23
|
69 |
+
Time to instantiate model: 0.32 seconds.
|
70 |
+
Total parameters: 217,088,512
|
71 |
Verifying settings ...
|
72 |
+
Measured TFLOPs: 3548.40
|
73 |
+
|
74 |
+
Epoch 1 | iter 256 step 1 | loss train: 11.716, val: n/a | iter time: 1735.26 ms (step) remaining time: 4 days, 11:06:29
|
75 |
+
Epoch 1 | iter 512 step 2 | loss train: 11.534, val: n/a | iter time: 1102.77 ms (step) remaining time: 4 days, 2:31:30
|
76 |
+
Epoch 1 | iter 768 step 3 | loss train: 11.356, val: n/a | iter time: 1095.87 ms (step) remaining time: 3 days, 23:44:12
|
77 |
+
Epoch 1 | iter 1024 step 4 | loss train: 11.162, val: n/a | iter time: 1099.92 ms (step) remaining time: 3 days, 22:18:27
|
78 |
+
Epoch 1 | iter 1280 step 5 | loss train: 11.018, val: n/a | iter time: 1096.45 ms (step) remaining time: 3 days, 21:24:35
|
79 |
+
Epoch 1 | iter 1536 step 6 | loss train: 10.901, val: n/a | iter time: 1093.65 ms (step) remaining time: 3 days, 20:48:11
|
80 |
+
Epoch 1 | iter 1792 step 7 | loss train: 10.850, val: n/a | iter time: 1100.16 ms (step) remaining time: 3 days, 20:22:00
|
81 |
+
Epoch 1 | iter 2048 step 8 | loss train: 10.780, val: n/a | iter time: 1092.67 ms (step) remaining time: 3 days, 20:01:57
|
82 |
+
Epoch 1 | iter 2304 step 9 | loss train: 10.692, val: n/a | iter time: 1095.77 ms (step) remaining time: 3 days, 19:45:57
|
83 |
+
Epoch 1 | iter 2560 step 10 | loss train: 10.678, val: n/a | iter time: 1092.12 ms (step) remaining time: 3 days, 19:32:43
|
84 |
+
Epoch 1 | iter 2816 step 11 | loss train: 10.619, val: n/a | iter time: 1094.44 ms (step) remaining time: 3 days, 19:21:32
|
85 |
+
Epoch 1 | iter 3072 step 12 | loss train: 10.588, val: n/a | iter time: 1102.51 ms (step) remaining time: 3 days, 19:12:30
|
86 |
+
Epoch 1 | iter 3328 step 13 | loss train: 10.514, val: n/a | iter time: 1095.57 ms (step) remaining time: 3 days, 19:04:07
|
87 |
+
Epoch 1 | iter 3584 step 14 | loss train: 10.472, val: n/a | iter time: 1104.00 ms (step) remaining time: 3 days, 18:56:56
|
88 |
+
Epoch 1 | iter 3840 step 15 | loss train: 10.431, val: n/a | iter time: 1096.00 ms (step) remaining time: 3 days, 18:50:21
|
89 |
+
Epoch 1 | iter 4096 step 16 | loss train: 10.392, val: n/a | iter time: 1098.34 ms (step) remaining time: 3 days, 18:44:25
|
90 |
+
Epoch 1 | iter 4352 step 17 | loss train: 10.360, val: n/a | iter time: 1106.53 ms (step) remaining time: 3 days, 18:38:58
|
91 |
+
Epoch 1 | iter 4608 step 18 | loss train: 10.329, val: n/a | iter time: 1084.95 ms (step) remaining time: 3 days, 18:33:58
|
92 |
+
Epoch 1 | iter 4864 step 19 | loss train: 10.296, val: n/a | iter time: 1096.22 ms (step) remaining time: 3 days, 18:29:12
|
93 |
+
Epoch 1 | iter 5120 step 20 | loss train: 10.236, val: n/a | iter time: 1093.39 ms (step) remaining time: 3 days, 18:24:51
|
94 |
# ...
|
95 |
```
|
96 |
|