albarpambagio commited on
Commit
16a0925
·
verified ·
1 Parent(s): 606bc3b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -7
README.md CHANGED
@@ -11,6 +11,13 @@ metrics:
11
  model-index:
12
  - name: er-model
13
  results: []
 
 
 
 
 
 
 
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,7 +25,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # er-model
20
 
21
- This model is a fine-tuned version of [indolem/indobertweet-base-uncased](https://huggingface.co/indolem/indobertweet-base-uncased) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
  - Loss: 0.6762
24
  - Accuracy: 0.6981
@@ -28,17 +35,15 @@ It achieves the following results on the evaluation set:
28
 
29
  ## Model description
30
 
31
- More information needed
32
 
33
- ## Intended uses & limitations
34
 
35
- More information needed
36
 
37
  ## Training and evaluation data
38
 
39
- More information needed
 
40
 
41
- ## Training procedure
42
 
43
  ### Training hyperparameters
44
 
@@ -68,4 +73,4 @@ The following hyperparameters were used during training:
68
  - Transformers 4.41.2
69
  - Pytorch 2.1.2
70
  - Datasets 2.19.2
71
- - Tokenizers 0.19.1
 
11
  model-index:
12
  - name: er-model
13
  results: []
14
+ datasets:
15
+ - SEACrowd/prdect_id
16
+ language:
17
+ - id
18
+ widget:
19
+ - text: Ini toko korup.,ga sesuai sama isinya..not recommended
20
+ example_title: Contoh
21
  ---
22
 
23
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
25
 
26
  # er-model
27
 
28
+ This model is a fine-tuned version of [indolem/indobertweet-base-uncased](https://huggingface.co/indolem/indobertweet-base-uncased) on [The PRDECT-ID Dataset](https://www.kaggle.com/datasets/jocelyndumlao/prdect-id-indonesian-emotion-classification), it is a compilation of Indonesian product reviews that come with emotion and sentiment labels. These reviews were gathered from one of Indonesia's largest e-commerce platforms, Tokopedia..
29
  It achieves the following results on the evaluation set:
30
  - Loss: 0.6762
31
  - Accuracy: 0.6981
 
35
 
36
  ## Model description
37
 
38
+ It has been trained to classify text into six different emotion categories: happy, sadness, anger, love, and fear.
39
 
 
40
 
 
41
 
42
  ## Training and evaluation data
43
 
44
+ I split my dataframe df into training, validation, and testing sets (train_df, val_df, test_df) using the train_test_split function from sklearn.model_selection. I set the test size to 20% for the initial split and further divided the remaining data equally between validation and testing sets. This process ensures that each split (val_df and test_df) maintains the same class distribution as the original dataset (stratify=df['label']).
45
+
46
 
 
47
 
48
  ### Training hyperparameters
49
 
 
73
  - Transformers 4.41.2
74
  - Pytorch 2.1.2
75
  - Datasets 2.19.2
76
+ - Tokenizers 0.19.1