DarshanDeshpande commited on
Commit
bdeb961
·
verified ·
1 Parent(s): 4ad45b2

distilbert_social_reasoning_reward_model

Browse files
Files changed (3) hide show
  1. README.md +14 -14
  2. config.json +6 -0
  3. model.safetensors +2 -2
README.md CHANGED
@@ -19,8 +19,8 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.6145
23
- - Accuracy: 0.6871
24
 
25
  ## Model description
26
 
@@ -54,18 +54,18 @@ The following hyperparameters were used during training:
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
56
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
57
- | 0.6733 | 0.24 | 10 | 0.6564 | 0.6725 |
58
- | 0.6475 | 0.48 | 20 | 0.6191 | 0.6708 |
59
- | 0.655 | 0.72 | 30 | 0.6216 | 0.6708 |
60
- | 0.6489 | 0.96 | 40 | 0.6311 | 0.6708 |
61
- | 0.6204 | 1.2 | 50 | 0.6837 | 0.6147 |
62
- | 0.5924 | 1.44 | 60 | 0.6329 | 0.6988 |
63
- | 0.6124 | 1.68 | 70 | 0.6220 | 0.6620 |
64
- | 0.6123 | 1.92 | 80 | 0.6366 | 0.6515 |
65
- | 0.562 | 2.16 | 90 | 0.6584 | 0.6532 |
66
- | 0.5169 | 2.4 | 100 | 0.6956 | 0.6410 |
67
- | 0.5045 | 2.63 | 110 | 0.6823 | 0.6392 |
68
- | 0.4712 | 2.87 | 120 | 0.6927 | 0.6375 |
69
 
70
 
71
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.6309
23
+ - Accuracy: 0.6958
24
 
25
  ## Model description
26
 
 
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
56
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
57
+ | 0.6618 | 0.24 | 10 | 0.6505 | 0.6725 |
58
+ | 0.6357 | 0.48 | 20 | 0.6373 | 0.6497 |
59
+ | 0.6457 | 0.72 | 30 | 0.6226 | 0.6725 |
60
+ | 0.646 | 0.96 | 40 | 0.6437 | 0.6778 |
61
+ | 0.6448 | 1.2 | 50 | 0.7565 | 0.6287 |
62
+ | 0.6339 | 1.44 | 60 | 0.6365 | 0.6655 |
63
+ | 0.6207 | 1.68 | 70 | 0.6694 | 0.6778 |
64
+ | 0.6217 | 1.92 | 80 | 0.6351 | 0.6340 |
65
+ | 0.5928 | 2.16 | 90 | 0.7245 | 0.6497 |
66
+ | 0.5938 | 2.4 | 100 | 0.6739 | 0.6497 |
67
+ | 0.5873 | 2.63 | 110 | 0.6811 | 0.6357 |
68
+ | 0.5442 | 2.87 | 120 | 0.6774 | 0.6375 |
69
 
70
 
71
  ### Framework versions
config.json CHANGED
@@ -8,7 +8,13 @@
8
  "dim": 768,
9
  "dropout": 0.1,
10
  "hidden_dim": 3072,
 
 
 
11
  "initializer_range": 0.02,
 
 
 
12
  "max_position_embeddings": 512,
13
  "model_type": "distilbert",
14
  "n_heads": 12,
 
8
  "dim": 768,
9
  "dropout": 0.1,
10
  "hidden_dim": 3072,
11
+ "id2label": {
12
+ "0": "LABEL_0"
13
+ },
14
  "initializer_range": 0.02,
15
+ "label2id": {
16
+ "LABEL_0": 0
17
+ },
18
  "max_position_embeddings": 512,
19
  "model_type": "distilbert",
20
  "n_heads": 12,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a20dc24c0161467ae7ff0c8624f3f14e5f4db0ecaa6ab2598194953089433bfe
3
- size 267832560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:646513e42759a94d1296e28e149ebe7a02d837f6e862c8c856dde8ad7cc3d9d7
3
+ size 267829484