Sarmila commited on
Commit
849f3d6
1 Parent(s): 66562c7

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -8
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [allenai/scibert_scivocab_uncased](https://huggingface.co/allenai/scibert_scivocab_uncased) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 0.1722
17
 
18
  ## Model description
19
 
@@ -32,22 +32,29 @@ More information needed
32
  ### Training hyperparameters
33
 
34
  The following hyperparameters were used during training:
35
- - learning_rate: 2e-05
36
- - train_batch_size: 64
37
- - eval_batch_size: 64
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
- - num_epochs: 3.0
42
  - mixed_precision_training: Native AMP
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
- | No log | 1.0 | 30 | 0.4661 |
49
- | No log | 2.0 | 60 | 0.2044 |
50
- | No log | 3.0 | 90 | 0.1722 |
 
 
 
 
 
 
 
51
 
52
 
53
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [allenai/scibert_scivocab_uncased](https://huggingface.co/allenai/scibert_scivocab_uncased) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 0.3494
17
 
18
  ## Model description
19
 
 
32
  ### Training hyperparameters
33
 
34
  The following hyperparameters were used during training:
35
+ - learning_rate: 3e-05
36
+ - train_batch_size: 32
37
+ - eval_batch_size: 32
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
+ - num_epochs: 10
42
  - mixed_precision_training: Native AMP
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
+ | No log | 1.0 | 60 | 0.4161 |
49
+ | No log | 2.0 | 120 | 0.3882 |
50
+ | No log | 3.0 | 180 | 0.3667 |
51
+ | No log | 4.0 | 240 | 0.3318 |
52
+ | No log | 5.0 | 300 | 0.3162 |
53
+ | No log | 6.0 | 360 | 0.3145 |
54
+ | No log | 7.0 | 420 | 0.3277 |
55
+ | No log | 8.0 | 480 | 0.3375 |
56
+ | 0.4754 | 9.0 | 540 | 0.3431 |
57
+ | 0.4754 | 10.0 | 600 | 0.3494 |
58
 
59
 
60
  ### Framework versions