End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -1,12 +1,9 @@
 ---
 library_name: transformers
 license: apache-2.0
-base_model: MoritzLaurer/ModernBERT-large-zeroshot-v2.0
 tags:
 - generated_from_trainer
-metrics:
-- f1
-- accuracy
 model-index:
 - name: ModernBERT-hf-posts-classifier
   results: []
@@ -17,11 +14,12 @@ should probably proofread and complete it, then remove this comment. -->
 # ModernBERT-hf-posts-classifier
-This model is a fine-tuned version of [MoritzLaurer/ModernBERT-large-zeroshot-v2.0](https://huggingface.co/MoritzLaurer/ModernBERT-large-zeroshot-v2.0) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1848
-- F1: 0.2079
-- Accuracy: 0.1569
 ## Model description
@@ -53,13 +51,13 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | F1     | Accuracy |
-|:-------------:|:------:|:----:|:---------------:|:------:|:--------:|
-| No log        | 1.0    | 15   | 0.2568          | 0.0560 | 0.0196   |
-| No log        | 2.0    | 30   | 0.2353          | 0.0665 | 0.0784   |
-| No log        | 3.0    | 45   | 0.2007          | 0.1981 | 0.0980   |
-| No log        | 4.0    | 60   | 0.1878          | 0.2129 | 0.0980   |
-| No log        | 4.7018 | 70   | 0.1848          | 0.2079 | 0.1569   |
 ### Framework versions

 ---
 library_name: transformers
 license: apache-2.0
+base_model: answerdotai/ModernBERT-base
 tags:
 - generated_from_trainer
 model-index:
 - name: ModernBERT-hf-posts-classifier
   results: []
 # ModernBERT-hf-posts-classifier
+This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2372
+- Micro F1: 0.4251
+- Macro F1: 0.0459
+- Weighted F1: 0.2837
 ## Model description
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Micro F1 | Macro F1 | Weighted F1 |
+|:-------------:|:------:|:----:|:---------------:|:--------:|:--------:|:-----------:|
+| No log        | 1.0    | 15   | 0.2547          | 0.4621   | 0.0473   | 0.2926      |
+| No log        | 2.0    | 30   | 0.2524          | 0.3676   | 0.0386   | 0.2377      |
+| No log        | 3.0    | 45   | 0.2412          | 0.4291   | 0.0491   | 0.2895      |
+| No log        | 4.0    | 60   | 0.2378          | 0.4291   | 0.0460   | 0.2846      |
+| No log        | 4.7018 | 70   | 0.2372          | 0.4251   | 0.0459   | 0.2837      |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e467169057ccd60ab25a540b9a4ee9eb094305b1edf7de11543386594f101940
 size 598541300

 version https://git-lfs.github.com/spec/v1
+oid sha256:d194be30d181b8cfb9202739e9f4b26972d406990975b65d04cc0e9f175d81bb
 size 598541300

runs/Jan09_18-55-21_c50f821e5c9f/events.out.tfevents.1736448922.c50f821e5c9f.1900.3 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a3a51e9a3cbc2b4afc72fc3c67dab106c06f9c3b4856063ab3ffcb4f5b836ee6
-size 9086

 version https://git-lfs.github.com/spec/v1
+oid sha256:0b8c3bb7c73d48a8d3d88e72a1ff73c974496148395fd13dfd4ba5bb9b8f8e9e
+size 9856

tokenizer.json CHANGED Viewed

@@ -1,21 +1,7 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 512,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
-  "padding": {
-    "strategy": {
-      "Fixed": 512
-    },
-    "direction": "Right",
-    "pad_to_multiple_of": null,
-    "pad_id": 50283,
-    "pad_type_id": 0,
-    "pad_token": "[PAD]"
-  },
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": null,
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,

tokenizer_config.json CHANGED Viewed

@@ -937,7 +937,7 @@
     "input_ids",
     "attention_mask"
   ],
-  "model_max_length": 512,
   "pad_token": "[PAD]",
   "sep_token": "[SEP]",
   "tokenizer_class": "PreTrainedTokenizerFast",

     "input_ids",
     "attention_mask"
   ],
+  "model_max_length": 1000000000000000019884624838656,
   "pad_token": "[PAD]",
   "sep_token": "[SEP]",
   "tokenizer_class": "PreTrainedTokenizerFast",