mastikaui
/

NLP-Sentiment-Analysis-Airline-Tweets-with-BERT-V2

Text Classification

Inference Endpoints

Model card Files Files and versions Community

NLP-Sentiment-Analysis-Airline-Tweets-with-BERT-V2 / README.md

mastikaui's picture

Update README.md

edc708a verified 6 months ago

|

history blame contribute delete

2.13 kB

	#NLP-Sentiment-Analysis-Airline-Tweets-with-BERT-V2

	This repository features sentiment analysis projects that leverage BERT, a leading NLP model.
	This project involves pre-processing, tokenization, and BERT customization for airline tweet sentiment classification.
	The tasks in this model use the original model "BERT base model (no casing)",
	uses a data set: https://www.kaggle.com/datasets/crowdflower/twitter-airline-sentiment,
	and there are several stages in achieving results, below are the evaluation sets
	Accuracy: 0.8203551912568307

	Colab notebook for improvements: https://colab.research.google.com/drive/1IQen2iNXkjOgdzjyi7PQyLFqHyqHTF3A?usp=sharing

	#classification report for more detailed evaluation :

	\| \| precision \| recall \| f1-score \| support \|
	\|-----------\|-----------\|--------\|----------\|---------\|
	\| negative \| 0.88 \| 0.90 \| 0.89 \| 959 \|
	\| neutral \| 0.68 \| 0.58 \| 0.62 \| 293 \|
	\| positive \| 0.72 \| 0.81 \| 0.76 \| 212 \|
	\|-----------\|-----------\|--------\|----------\|---------\|
	\| accuracy \| \| \| 0.82 \| 1464 \|
	\| macro avg \| 0.76 \| 0.76 \| 0.76 \| 1464 \|
	\| weighted avg \| 0.82 \| 0.82 \| 0.82 \| 1464 \|

	The sentiment classification model achieved a promising
	overall accuracy of 82.04%, built on BertForSequenceClassifi-
	cation and trained for 10 epochs using AdamW optimization.
	The model exhibited stable performance, with validation ac-
	curacy consistently between 0.79 to 0.81, indicating effective
	learning. Additionally, it showed high precision, particularly
	for negative sentiment (0.88), along with moderate scores for
	neutral (0.68) and positive (0.72) sentiments. These results
	were supported by recall and F1-score metrics, providing a
	comprehensive understanding of performance across sentiment
	classes. The analysis of the confusion matrix revealed strong
	alignment between model predictions and actual labels, al-
	beit with opportunities for improvement, such as addressing
	overfitting or parameter adjustment, evident from performance
	fluctuations across epochs.

	Developed by:Mastika