Sarmila
/

pubmed-bert-squad-covidqa

Question Answering

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

Sarmila commited on Sep 18, 2023

Commit

343875a

•

1 Parent(s): 66663d7

End of training

Files changed (4) hide show

README.md +8 -8
config.json +1 -1
pytorch_model.bin +1 -1
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -1,10 +1,10 @@
 ---
 license: mit
-base_model: microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract
 tags:
 - generated_from_trainer
 datasets:
-- squad
 model-index:
 - name: pubmed-bert-squad-covidqa
  results: []
@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
 # pubmed-bert-squad-covidqa
-This model is a fine-tuned version of [microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract](https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract) on the squad dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0342
 ## Model description
@@ -39,7 +39,7 @@ The following hyperparameters were used during training:
 - learning_rate: 2e-05
 - train_batch_size: 32
 - eval_batch_size: 32
-- seed: 0
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
@@ -48,9 +48,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.0731 | 1.0 | 2738 | 1.0432 |
-| 0.8584 | 2.0 | 5476 | 1.0055 |
-| 0.6878 | 3.0 | 8214 | 1.0342 |
 ### Framework versions

 ---
 license: mit
+base_model: Sarmila/pubmed-bert-squad-covidqa
 tags:
 - generated_from_trainer
 datasets:
+- covid_qa_deepset
 model-index:
 - name: pubmed-bert-squad-covidqa
  results: []
 # pubmed-bert-squad-covidqa
+This model is a fine-tuned version of [Sarmila/pubmed-bert-squad-covidqa](https://huggingface.co/Sarmila/pubmed-bert-squad-covidqa) on the covid_qa_deepset dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4876
 ## Model description
 - learning_rate: 2e-05
 - train_batch_size: 32
 - eval_batch_size: 32
+- seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log | 1.0 | 51 | 0.4001 |
+| No log | 2.0 | 102 | 0.4524 |
+| No log | 3.0 | 153 | 0.4876 |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
- "_name_or_path": "microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract",
  "architectures": [
  "BertForQuestionAnswering"
  ],

 {
+ "_name_or_path": "Sarmila/pubmed-bert-squad-covidqa",
  "architectures": [
  "BertForQuestionAnswering"
  ],

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ab37892f8be6ad458f5cb070358ee42d00cee31023add669df8eb875cc87e4f7
 size 435640489

 version https://git-lfs.github.com/spec/v1
+oid sha256:af0faafd433075dc50861b0fb403b96aac7d4a82bc5d17d3167e6f7e4efe9fc4
 size 435640489

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a95eb3ce7695fce56a6e7dbe91a618faef19ee4c74886e46ecc2964b8dd9478d
-size 4091

 version https://git-lfs.github.com/spec/v1
+oid sha256:eb7d718ea7e3f72dd3bd810ac5214dd79e7d7ccf2b040fc5af5fb5a92466ae93
+size 4027