End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ license: apache-2.0
 base_model: facebook/bart-base
 tags:
 - generated_from_trainer
 model-index:
 - name: bart_qa_model
   results: []
@@ -15,7 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2453
 ## Model description
@@ -34,21 +38,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 250  | 1.4087          |
-| 1.9051        | 2.0   | 500  | 1.2457          |
-| 1.9051        | 3.0   | 750  | 1.2453          |
 ### Framework versions

 base_model: facebook/bart-base
 tags:
 - generated_from_trainer
+metrics:
+- f1
 model-index:
 - name: bart_qa_model
   results: []
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1504
+- F1: 0.7493
+- Exact Match: 0.608
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3.7185140364032e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1     | Exact Match |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:-----------:|
+| 2.4874        | 1.0   | 125  | 1.2569          | 0.6897 | 0.545       |
+| 1.1954        | 2.0   | 250  | 1.1084          | 0.7424 | 0.6         |
+| 0.904         | 3.0   | 375  | 1.1504          | 0.7493 | 0.608       |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:31e87838e713d5cd1b2f5bf225e9d1b5dada6162b002d448584d71ba04dca4df
 size 557717800

 version https://git-lfs.github.com/spec/v1
+oid sha256:c9deaeec004134b513cb974bb80414601da030c7ddc6ff368517714a1d4e2e19
 size 557717800

runs/Jan07_17-27-25_fad11acb78a4/events.out.tfevents.1704648448.fad11acb78a4.587.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:77a6bd9cb3e62fc03cea93d7416c526dbb3b691bc73eb4578919df8912082879
+size 7341

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c4d7e329438fa2b7a0e9a46baf6defe9231101c113f1f3fb204268d27efbe78
 size 4664

 version https://git-lfs.github.com/spec/v1
+oid sha256:3764eb5ff9cd1a0d8c826516fcb044a5705f8ac3ecad513906f38d76e6754f15
 size 4664