bart_tech_keywords

This model is a fine-tuned version of facebook/bart-large on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 4
eval_batch_size: 4
seed: 42
gradient_accumulation_steps: 16
total_train_batch_size: 64
optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 500
num_epochs: 3

Training Loss	Epoch	Step	Validation Loss
1.395	0.4447	50	1.1718
1.1326	0.8894	100	0.9652
0.9907	1.3341	150	0.9109
0.9297	1.7788	200	0.8911
0.8629	2.2235	250	0.9051
0.8599	2.6681	300	0.8341