DarshanDeshpande
/

distilbert_social_reasoning_reward_model

@@ -19,8 +19,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6145
-- Accuracy: 0.6871
 ## Model description
@@ -54,18 +54,18 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.6733        | 0.24  | 10   | 0.6564          | 0.6725   |
-| 0.6475        | 0.48  | 20   | 0.6191          | 0.6708   |
-| 0.655         | 0.72  | 30   | 0.6216          | 0.6708   |
-| 0.6489        | 0.96  | 40   | 0.6311          | 0.6708   |
-| 0.6204        | 1.2   | 50   | 0.6837          | 0.6147   |
-| 0.5924        | 1.44  | 60   | 0.6329          | 0.6988   |
-| 0.6124        | 1.68  | 70   | 0.6220          | 0.6620   |
-| 0.6123        | 1.92  | 80   | 0.6366          | 0.6515   |
-| 0.562         | 2.16  | 90   | 0.6584          | 0.6532   |
-| 0.5169        | 2.4   | 100  | 0.6956          | 0.6410   |
-| 0.5045        | 2.63  | 110  | 0.6823          | 0.6392   |
-| 0.4712        | 2.87  | 120  | 0.6927          | 0.6375   |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6309
+- Accuracy: 0.6958
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.6618        | 0.24  | 10   | 0.6505          | 0.6725   |
+| 0.6357        | 0.48  | 20   | 0.6373          | 0.6497   |
+| 0.6457        | 0.72  | 30   | 0.6226          | 0.6725   |
+| 0.646         | 0.96  | 40   | 0.6437          | 0.6778   |
+| 0.6448        | 1.2   | 50   | 0.7565          | 0.6287   |
+| 0.6339        | 1.44  | 60   | 0.6365          | 0.6655   |
+| 0.6207        | 1.68  | 70   | 0.6694          | 0.6778   |
+| 0.6217        | 1.92  | 80   | 0.6351          | 0.6340   |
+| 0.5928        | 2.16  | 90   | 0.7245          | 0.6497   |
+| 0.5938        | 2.4   | 100  | 0.6739          | 0.6497   |
+| 0.5873        | 2.63  | 110  | 0.6811          | 0.6357   |
+| 0.5442        | 2.87  | 120  | 0.6774          | 0.6375   |
 ### Framework versions

config.json CHANGED Viewed

@@ -8,7 +8,13 @@
   "dim": 768,
   "dropout": 0.1,
   "hidden_dim": 3072,
   "initializer_range": 0.02,
   "max_position_embeddings": 512,
   "model_type": "distilbert",
   "n_heads": 12,

   "dim": 768,
   "dropout": 0.1,
   "hidden_dim": 3072,
+  "id2label": {
+    "0": "LABEL_0"
+  },
   "initializer_range": 0.02,
+  "label2id": {
+    "LABEL_0": 0
+  },
   "max_position_embeddings": 512,
   "model_type": "distilbert",
   "n_heads": 12,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a20dc24c0161467ae7ff0c8624f3f14e5f4db0ecaa6ab2598194953089433bfe
-size 267832560

 version https://git-lfs.github.com/spec/v1
+oid sha256:646513e42759a94d1296e28e149ebe7a02d837f6e862c8c856dde8ad7cc3d9d7
+size 267829484