fdaudens HF staff commited on
Commit
cd160b3
·
verified ·
1 Parent(s): 9582f94

End of training

Browse files
README.md CHANGED
@@ -4,6 +4,8 @@ license: apache-2.0
4
  base_model: answerdotai/ModernBERT-base
5
  tags:
6
  - generated_from_trainer
 
 
7
  model-index:
8
  - name: ModernBERT-hf-posts-classifier
9
  results: []
@@ -16,10 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.2372
20
- - Micro F1: 0.4251
21
- - Macro F1: 0.0459
22
- - Weighted F1: 0.2837
23
 
24
  ## Model description
25
 
@@ -39,30 +39,27 @@ More information needed
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 5e-05
42
- - train_batch_size: 8
43
  - eval_batch_size: 8
44
  - seed: 42
45
- - gradient_accumulation_steps: 4
46
- - total_train_batch_size: 32
47
  - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
  - num_epochs: 5
50
- - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
- | Training Loss | Epoch | Step | Validation Loss | Micro F1 | Macro F1 | Weighted F1 |
55
- |:-------------:|:------:|:----:|:---------------:|:--------:|:--------:|:-----------:|
56
- | No log | 1.0 | 15 | 0.2547 | 0.4621 | 0.0473 | 0.2926 |
57
- | No log | 2.0 | 30 | 0.2524 | 0.3676 | 0.0386 | 0.2377 |
58
- | No log | 3.0 | 45 | 0.2412 | 0.4291 | 0.0491 | 0.2895 |
59
- | No log | 4.0 | 60 | 0.2378 | 0.4291 | 0.0460 | 0.2846 |
60
- | No log | 4.7018 | 70 | 0.2372 | 0.4251 | 0.0459 | 0.2837 |
61
 
62
 
63
  ### Framework versions
64
 
65
- - Transformers 4.48.0.dev0
66
  - Pytorch 2.5.0+cu124
67
  - Datasets 3.1.0
68
  - Tokenizers 0.21.0
 
4
  base_model: answerdotai/ModernBERT-base
5
  tags:
6
  - generated_from_trainer
7
+ metrics:
8
+ - f1
9
  model-index:
10
  - name: ModernBERT-hf-posts-classifier
11
  results: []
 
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.3951
22
+ - F1: 0.6703
 
 
23
 
24
  ## Model description
25
 
 
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 5e-05
42
+ - train_batch_size: 16
43
  - eval_batch_size: 8
44
  - seed: 42
 
 
45
  - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
  - num_epochs: 5
 
48
 
49
  ### Training results
50
 
51
+ | Training Loss | Epoch | Step | Validation Loss | F1 |
52
+ |:-------------:|:-----:|:----:|:---------------:|:------:|
53
+ | No log | 1.0 | 26 | 1.2084 | 0.6381 |
54
+ | No log | 2.0 | 52 | 1.7850 | 0.5018 |
55
+ | No log | 3.0 | 78 | 1.1985 | 0.7118 |
56
+ | 0.4128 | 4.0 | 104 | 1.3353 | 0.6716 |
57
+ | 0.4128 | 5.0 | 130 | 1.3951 | 0.6703 |
58
 
59
 
60
  ### Framework versions
61
 
62
+ - Transformers 4.48.0
63
  - Pytorch 2.5.0+cu124
64
  - Datasets 3.1.0
65
  - Tokenizers 0.21.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:129d1974934c34fdd86afe34846d4fb97fdafb4b3ac1dabb7fd08297d69a0c0b
3
  size 598476704
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:54c90872842f81bba5861b59b0680e651914d891ee5ecef6a67d367d93fba3f6
3
  size 598476704
runs/Jan10_22-06-41_5e4eb05f69bb/events.out.tfevents.1736546813.5e4eb05f69bb.2578.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5cdad69d94039983046c7ba7d782cba75e585498edef9e39378b6b2aee3c79d4
3
- size 8339
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7615d2a2daf57630e6c598b5270948f3128368025ad0a06c53fb1660f321e65a
3
+ size 8693
tokenizer.json CHANGED
@@ -1,7 +1,19 @@
1
  {
2
  "version": "1.0",
3
- "truncation": null,
4
- "padding": null,
 
 
 
 
 
 
 
 
 
 
 
 
5
  "added_tokens": [
6
  {
7
  "id": 0,
 
1
  {
2
  "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 512,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
+ "padding": {
10
+ "strategy": "BatchLongest",
11
+ "direction": "Right",
12
+ "pad_to_multiple_of": null,
13
+ "pad_id": 50283,
14
+ "pad_type_id": 0,
15
+ "pad_token": "[PAD]"
16
+ },
17
  "added_tokens": [
18
  {
19
  "id": 0,