hexuan21 commited on
Commit
6df0da5
·
verified ·
1 Parent(s): aec4ad5

Model save

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -38,10 +38,10 @@ The following hyperparameters were used during training:
38
  - eval_batch_size: 1
39
  - seed: 42
40
  - distributed_type: multi-GPU
41
- - num_devices: 8
42
- - gradient_accumulation_steps: 8
43
- - total_train_batch_size: 64
44
- - total_eval_batch_size: 8
45
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: cosine
47
  - lr_scheduler_warmup_ratio: 0.03
 
38
  - eval_batch_size: 1
39
  - seed: 42
40
  - distributed_type: multi-GPU
41
+ - num_devices: 7
42
+ - gradient_accumulation_steps: 9
43
+ - total_train_batch_size: 63
44
+ - total_eval_batch_size: 7
45
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: cosine
47
  - lr_scheduler_warmup_ratio: 0.03