End of training
Browse files
README.md
CHANGED
@@ -163,7 +163,7 @@ The following hyperparameters were used during training:
|
|
163 |
weight=0
|
164 |
)
|
165 |
)`
|
166 |
-
- lr_scheduler: `<torch.optim.lr_scheduler.LambdaLR object at
|
167 |
- student_model_name_or_path: `None`
|
168 |
- student_config_name_or_path: `None`
|
169 |
- student_model_config: `{'num_hidden_layers': 15}`
|
|
|
163 |
weight=0
|
164 |
)
|
165 |
)`
|
166 |
+
- lr_scheduler: `<torch.optim.lr_scheduler.LambdaLR object at 0x777cbafb23b0>`
|
167 |
- student_model_name_or_path: `None`
|
168 |
- student_config_name_or_path: `None`
|
169 |
- student_model_config: `{'num_hidden_layers': 15}`
|
logs/learning_rate=0.0001, lr_scheduler_kwargs=__power___1.5___lr_end___2e-05_, lr_scheduler_type=polynomial, per_device_train_batch_size=8/events.out.tfevents.1726692188.1c1a426a2fee
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:21ba6a2d3291128dc9414f965e090856eca58129fe4a9b5e5ee7e9446a137c03
|
3 |
+
size 529
|