albarpambagio
/

indobertweet-base-uncased-emotion-recognition

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

albarpambagio commited on Jul 10, 2024

Commit

16a0925

·

verified ·

1 Parent(s): 606bc3b

Update README.md

Files changed (1) hide show

README.md +12 -7

README.md CHANGED Viewed

@@ -11,6 +11,13 @@ metrics:
 model-index:
 - name: er-model
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,7 +25,7 @@ should probably proofread and complete it, then remove this comment. -->
 # er-model
-This model is a fine-tuned version of [indolem/indobertweet-base-uncased](https://huggingface.co/indolem/indobertweet-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.6762
 - Accuracy: 0.6981
@@ -28,17 +35,15 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -68,4 +73,4 @@ The following hyperparameters were used during training:
 - Transformers 4.41.2
 - Pytorch 2.1.2
 - Datasets 2.19.2
-- Tokenizers 0.19.1

 model-index:
 - name: er-model
   results: []
+datasets:
+- SEACrowd/prdect_id
+language:
+- id
+widget:
+- text: Ini toko korup.,ga sesuai sama isinya..not recommended
+  example_title: Contoh
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # er-model
+This model is a fine-tuned version of [indolem/indobertweet-base-uncased](https://huggingface.co/indolem/indobertweet-base-uncased) on [The PRDECT-ID Dataset](https://www.kaggle.com/datasets/jocelyndumlao/prdect-id-indonesian-emotion-classification), it is a compilation of Indonesian product reviews that come with emotion and sentiment labels. These reviews were gathered from one of Indonesia's largest e-commerce platforms, Tokopedia..
 It achieves the following results on the evaluation set:
 - Loss: 0.6762
 - Accuracy: 0.6981
 ## Model description
+It has been trained to classify text into six different emotion categories: happy, sadness, anger, love, and fear.
 ## Training and evaluation data
+I split my dataframe df into training, validation, and testing sets (train_df, val_df, test_df) using the train_test_split function from sklearn.model_selection. I set the test size to 20% for the initial split and further divided the remaining data equally between validation and testing sets. This process ensures that each split (val_df and test_df) maintains the same class distribution as the original dataset (stratify=df['label']).
 ### Training hyperparameters
 - Transformers 4.41.2
 - Pytorch 2.1.2
 - Datasets 2.19.2
+- Tokenizers 0.19.1