haryoaw
/

scenario-NON-KD-PR-COPY-CDF-ALL-D2_data-cardiffnlp_tweet_sentiment_multilingual_

@@ -10,23 +10,7 @@ metrics:
 - f1
 model-index:
 - name: scenario-NON-KD-PR-COPY-CDF-ALL-D2_data-cardiffnlp_tweet_sentiment_multilingual_
-  results:
-  - task:
-      name: Text Classification
-      type: text-classification
-    dataset:
-      name: tweet_sentiment_multilingual
-      type: tweet_sentiment_multilingual
-      config: all
-      split: validation
-      args: all
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.589891975308642
-    - name: F1
-      type: f1
-      value: 0.588413122388427
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -36,9 +20,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the tweet_sentiment_multilingual dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.1967
-- Accuracy: 0.5899
-- F1: 0.5884
 ## Model description
@@ -60,7 +44,7 @@ The following hyperparameters were used during training:
 - learning_rate: 5e-05
 - train_batch_size: 32
 - eval_batch_size: 32
-- seed: 11213
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 50
@@ -69,52 +53,52 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|
-| 1.0743        | 1.09  | 500   | 1.0031          | 0.5066   | 0.5042 |
-| 0.9346        | 2.17  | 1000  | 0.9437          | 0.5656   | 0.5679 |
-| 0.8123        | 3.26  | 1500  | 0.9108          | 0.5949   | 0.5905 |
-| 0.6842        | 4.35  | 2000  | 1.1082          | 0.5756   | 0.5661 |
-| 0.5603        | 5.43  | 2500  | 1.1812          | 0.5907   | 0.5828 |
-| 0.4284        | 6.52  | 3000  | 1.3230          | 0.5895   | 0.5870 |
-| 0.3295        | 7.61  | 3500  | 1.4855          | 0.5637   | 0.5638 |
-| 0.2589        | 8.7   | 4000  | 1.5869          | 0.5837   | 0.5784 |
-| 0.2035        | 9.78  | 4500  | 1.8098          | 0.5826   | 0.5776 |
-| 0.1755        | 10.87 | 5000  | 1.7393          | 0.5887   | 0.5856 |
-| 0.1497        | 11.96 | 5500  | 2.1213          | 0.5887   | 0.5828 |
-| 0.13          | 13.04 | 6000  | 2.2126          | 0.5833   | 0.5827 |
-| 0.1151        | 14.13 | 6500  | 2.2685          | 0.5818   | 0.5811 |
-| 0.1028        | 15.22 | 7000  | 2.5633          | 0.5826   | 0.5827 |
-| 0.0962        | 16.3  | 7500  | 2.4350          | 0.5795   | 0.5770 |
-| 0.0804        | 17.39 | 8000  | 2.6830          | 0.5806   | 0.5752 |
-| 0.0781        | 18.48 | 8500  | 2.6389          | 0.5818   | 0.5811 |
-| 0.0677        | 19.57 | 9000  | 2.6490          | 0.5806   | 0.5788 |
-| 0.0593        | 20.65 | 9500  | 2.9908          | 0.5768   | 0.5732 |
-| 0.0578        | 21.74 | 10000 | 2.9127          | 0.5845   | 0.5828 |
-| 0.0493        | 22.83 | 10500 | 3.0101          | 0.5802   | 0.5744 |
-| 0.0455        | 23.91 | 11000 | 2.9419          | 0.5795   | 0.5779 |
-| 0.0351        | 25.0  | 11500 | 3.2339          | 0.5752   | 0.5742 |
-| 0.0369        | 26.09 | 12000 | 3.2997          | 0.5899   | 0.5818 |
-| 0.0291        | 27.17 | 12500 | 3.5819          | 0.5833   | 0.5804 |
-| 0.0281        | 28.26 | 13000 | 3.4498          | 0.5795   | 0.5798 |
-| 0.0258        | 29.35 | 13500 | 3.5006          | 0.5768   | 0.5768 |
-| 0.027         | 30.43 | 14000 | 3.4740          | 0.5849   | 0.5832 |
-| 0.0218        | 31.52 | 14500 | 3.2293          | 0.5918   | 0.5907 |
-| 0.0227        | 32.61 | 15000 | 3.4840          | 0.5876   | 0.5861 |
-| 0.0212        | 33.7  | 15500 | 3.2922          | 0.5845   | 0.5841 |
-| 0.0119        | 34.78 | 16000 | 3.9035          | 0.5729   | 0.5744 |
-| 0.019         | 35.87 | 16500 | 3.5470          | 0.5795   | 0.5781 |
-| 0.0146        | 36.96 | 17000 | 3.7651          | 0.5795   | 0.5772 |
-| 0.0144        | 38.04 | 17500 | 3.7248          | 0.5829   | 0.5787 |
-| 0.0077        | 39.13 | 18000 | 4.1509          | 0.5806   | 0.5754 |
-| 0.0097        | 40.22 | 18500 | 3.8829          | 0.5829   | 0.5796 |
-| 0.0092        | 41.3  | 19000 | 3.8987          | 0.5853   | 0.5842 |
-| 0.0087        | 42.39 | 19500 | 3.8544          | 0.5899   | 0.5882 |
-| 0.0083        | 43.48 | 20000 | 3.9211          | 0.5895   | 0.5855 |
-| 0.006         | 44.57 | 20500 | 3.9856          | 0.5868   | 0.5856 |
-| 0.0062        | 45.65 | 21000 | 4.0873          | 0.5891   | 0.5872 |
-| 0.0027        | 46.74 | 21500 | 4.1639          | 0.5891   | 0.5888 |
-| 0.0052        | 47.83 | 22000 | 4.1754          | 0.5914   | 0.5893 |
-| 0.0031        | 48.91 | 22500 | 4.1914          | 0.5887   | 0.5879 |
-| 0.0035        | 50.0  | 23000 | 4.1967          | 0.5899   | 0.5884 |
 ### Framework versions

 - f1
 model-index:
 - name: scenario-NON-KD-PR-COPY-CDF-ALL-D2_data-cardiffnlp_tweet_sentiment_multilingual_
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the tweet_sentiment_multilingual dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.2776
+- Accuracy: 0.5490
+- F1: 0.5470
 ## Model description
 - learning_rate: 5e-05
 - train_batch_size: 32
 - eval_batch_size: 32
+- seed: 333
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 50
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|
+| 1.0901        | 1.09  | 500   | 1.0564          | 0.4379   | 0.4046 |
+| 1.0001        | 2.17  | 1000  | 1.0287          | 0.5085   | 0.4941 |
+| 0.9108        | 3.26  | 1500  | 1.0254          | 0.5316   | 0.5273 |
+| 0.8453        | 4.35  | 2000  | 0.9739          | 0.5390   | 0.5363 |
+| 0.786         | 5.43  | 2500  | 0.9965          | 0.5540   | 0.5500 |
+| 0.7317        | 6.52  | 3000  | 1.0309          | 0.5505   | 0.5452 |
+| 0.6714        | 7.61  | 3500  | 1.1479          | 0.5444   | 0.5466 |
+| 0.6192        | 8.7   | 4000  | 1.0839          | 0.5536   | 0.5533 |
+| 0.5693        | 9.78  | 4500  | 1.2411          | 0.5382   | 0.5259 |
+| 0.5114        | 10.87 | 5000  | 1.2202          | 0.5486   | 0.5502 |
+| 0.4705        | 11.96 | 5500  | 1.4185          | 0.5478   | 0.5445 |
+| 0.425         | 13.04 | 6000  | 1.3994          | 0.5417   | 0.5314 |
+| 0.3815        | 14.13 | 6500  | 1.5880          | 0.5475   | 0.5475 |
+| 0.3405        | 15.22 | 7000  | 1.5789          | 0.5405   | 0.5330 |
+| 0.3046        | 16.3  | 7500  | 1.7872          | 0.5405   | 0.5328 |
+| 0.279         | 17.39 | 8000  | 1.7094          | 0.5417   | 0.5390 |
+| 0.2488        | 18.48 | 8500  | 1.7790          | 0.5471   | 0.5451 |
+| 0.2203        | 19.57 | 9000  | 1.8204          | 0.5478   | 0.5464 |
+| 0.2145        | 20.65 | 9500  | 1.9339          | 0.5448   | 0.5386 |
+| 0.1869        | 21.74 | 10000 | 2.1092          | 0.5390   | 0.5360 |
+| 0.1788        | 22.83 | 10500 | 1.9770          | 0.5540   | 0.5513 |
+| 0.1473        | 23.91 | 11000 | 2.1967          | 0.5471   | 0.5425 |
+| 0.1437        | 25.0  | 11500 | 2.1961          | 0.5513   | 0.5431 |
+| 0.1296        | 26.09 | 12000 | 2.2828          | 0.5536   | 0.5518 |
+| 0.1151        | 27.17 | 12500 | 2.3900          | 0.5405   | 0.5346 |
+| 0.1151        | 28.26 | 13000 | 2.5206          | 0.5440   | 0.5394 |
+| 0.1058        | 29.35 | 13500 | 2.5638          | 0.5463   | 0.5413 |
+| 0.1056        | 30.43 | 14000 | 2.6504          | 0.5417   | 0.5351 |
+| 0.098         | 31.52 | 14500 | 2.6291          | 0.5571   | 0.5544 |
+| 0.0918        | 32.61 | 15000 | 2.6844          | 0.5421   | 0.5408 |
+| 0.0873        | 33.7  | 15500 | 2.7813          | 0.5401   | 0.5403 |
+| 0.0897        | 34.78 | 16000 | 2.8257          | 0.5459   | 0.5428 |
+| 0.0781        | 35.87 | 16500 | 2.8813          | 0.5478   | 0.5450 |
+| 0.0698        | 36.96 | 17000 | 3.0486          | 0.5336   | 0.5303 |
+| 0.0674        | 38.04 | 17500 | 3.1261          | 0.5475   | 0.5417 |
+| 0.0756        | 39.13 | 18000 | 3.0463          | 0.5482   | 0.5480 |
+| 0.0592        | 40.22 | 18500 | 3.1190          | 0.5440   | 0.5412 |
+| 0.0562        | 41.3  | 19000 | 3.1770          | 0.5370   | 0.5342 |
+| 0.0575        | 42.39 | 19500 | 3.1928          | 0.5432   | 0.5405 |
+| 0.0534        | 43.48 | 20000 | 3.2141          | 0.5494   | 0.5462 |
+| 0.0487        | 44.57 | 20500 | 3.2784          | 0.5440   | 0.5376 |
+| 0.0472        | 45.65 | 21000 | 3.2675          | 0.5451   | 0.5420 |
+| 0.0495        | 46.74 | 21500 | 3.2487          | 0.5502   | 0.5474 |
+| 0.0411        | 47.83 | 22000 | 3.2628          | 0.5486   | 0.5468 |
+| 0.0417        | 48.91 | 22500 | 3.2780          | 0.5494   | 0.5476 |
+| 0.0412        | 50.0  | 23000 | 3.2776          | 0.5490   | 0.5470 |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "_name_or_path": "xlm-roberta-base",
   "architectures": [
-    "XLMRobertaForSequenceClassification"
   ],
   "attention_probs_dropout_prob": 0.1,
   "bos_token_id": 0,
@@ -9,14 +9,14 @@
   "eos_token_id": 2,
   "hidden_act": "gelu",
   "hidden_dropout_prob": 0.1,
-  "hidden_size": 768,
   "id2label": {
     "0": "LABEL_0",
     "1": "LABEL_1",
     "2": "LABEL_2"
   },
   "initializer_range": 0.02,
-  "intermediate_size": 3072,
   "label2id": {
     "LABEL_0": 0,
     "LABEL_1": 1,

 {
   "_name_or_path": "xlm-roberta-base",
   "architectures": [
+    "XLMRobertaForSequenceClassificationKD"
   ],
   "attention_probs_dropout_prob": 0.1,
   "bos_token_id": 0,
   "eos_token_id": 2,
   "hidden_act": "gelu",
   "hidden_dropout_prob": 0.1,
+  "hidden_size": 384,
   "id2label": {
     "0": "LABEL_0",
     "1": "LABEL_1",
     "2": "LABEL_2"
   },
   "initializer_range": 0.02,
+  "intermediate_size": 1536,
   "label2id": {
     "LABEL_0": 0,
     "LABEL_1": 1,

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f2b00f463126da80528767f9f7ba7b18d771484abce738c5d2b1e8bbbee1f62a
-size 942111086

 version https://git-lfs.github.com/spec/v1
+oid sha256:f105078f6541c246d9a0ab2cd1e3b7423591a6a6fa7484509b8fa313d42f4d25
+size 429199798

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:40174e22a5a2e05c1689834ecc61f6377ae5b665a9c88a2c3d896fcd77e524bd
 size 4664

 version https://git-lfs.github.com/spec/v1
+oid sha256:166ec74f89602a4d853987af4b4f99ed1bccaff0844ae7d838470c6f12d39ac9
 size 4664