wonrax
/

phobert-base-vietnamese-sentiment

Text Classification

Inference Endpoints

Model card Files Files and versions Community

wonrax commited on May 4, 2022

Commit

5621bf6

·

1 Parent(s): 82cd347

Update README.md

Files changed (1) hide show

README.md +29 -1

README.md CHANGED Viewed

@@ -11,7 +11,35 @@ widget:
 - text: "Cái này giá ổn không nhỉ?"
 ---
 Dataset: [30K e-commerce reviews](https://www.kaggle.com/datasets/linhlpv/vietnamese-sentiment-analyst)
-I'll add some more info soon

 - text: "Cái này giá ổn không nhỉ?"
 ---
+A model fine-tuned for sentiment analysis based on [vinai/phobert-base](https://huggingface.co/vinai/phobert-base).
+Labels:
+- NEG: Negative
+- POS: Positive
+- NEU: Neutral
 Dataset: [30K e-commerce reviews](https://www.kaggle.com/datasets/linhlpv/vietnamese-sentiment-analyst)
+## Usage
+```python
+import torch
+from transformers import RobertaForSequenceClassification, AutoTokenizer
+model = RobertaForSequenceClassification.from_pretrained("wonrax/phobert-base-vietnamese-sentiment")
+tokenizer = AutoTokenizer.from_pretrained("wonrax/phobert-base-vietnamese-sentiment", use_fast=False)
+# Just like PhoBERT: INPUT TEXT MUST BE ALREADY WORD-SEGMENTED!
+sentence = 'Đây là mô_hình rất hay , phù_hợp với điều_kiện và như cầu của nhiều người .'
+input_ids = torch.tensor([tokenizer.encode(sentence)])
+with torch.no_grad():
+    out = model(input_ids)
+    print(out.logits.softmax(dim=-1).tolist())
+    # Output:
+    # [[0.002, 0.988, 0.01]]
+    #     ^      ^      ^
+    #    NEG    POS    NEU
+```