--- language: mr tags: - bert license: cc-by-4.0 datasets: - L3Cube-MahaSent-MD - L3CubeMahaSent widget: - text: "I like you. I love you." --- ## MahaSent-PT MahaSent-PT is a MahaBERT(l3cube-pune/marathi-bert-v2) model fine-tuned on L3Cube-MahaSent-PT Corpus, a political-domain Marathi sentiment analysis dataset.
This dataset was originally introduced in L3CubeMahaSent and is now part of extended corpus L3Cube-MahaSent-MD, a multi-domain Marathi sentiment analysis dataset.
The MahaSent-MD dataset contains domains like movie reviews, generic tweets, subtitles, and political tweets. This model is trained specifically on the political tweets domain.
The recommended multi-domain version of this model covering all domains is shared here: marathi-sentiment-md
[dataset link] (https://github.com/l3cube-pune/MarathiNLP) More details on the dataset, models, and baseline results can be found in our [paper] (coming soon)
Citing: ``` @inproceedings{kulkarni2021l3cubemahasent, title={L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset}, author={Kulkarni, Atharva and Mandhane, Meet and Likhitkar, Manali and Kshirsagar, Gayatri and Joshi, Raviraj}, booktitle={Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis}, pages={213--220}, year={2021} } ``` ``` @article{joshi2022l3cube, title={L3cube-mahanlp: Marathi natural language processing datasets, models, and library}, author={Joshi, Raviraj}, journal={arXiv preprint arXiv:2205.14728}, year={2022} } ``` Other Marathi Sentiment models from the MahaSent family are shared here:
MahaSent-MD (multi domain)
MahaSent-GT (generic tweets)
MahaSent-MR (movie reviews)
MahaSent-PT (political tweets)
MahaSent-ST (TV subtitles)
MahaSent v1 (political tweets)