Multilingual Models for Sentiment and Abusive Language Detection for Dravidian Languages

Anand Kumar M


Abstract
This paper presents the TFIDF based LSTM and Hierarchical Attention Networks (HAN) for code-mixed abusive comment detection and sentiment analysis for Dravidian languages. The traditional TF-IDF-based techniques have out- performed the Hierarchical Attention models in both the sentiment analysis and abusive language detection tasks. The Tulu sentiment analysis system demonstrated better performance for the Positive and Neutral classes, whereas the Tamil sentiment analysis system exhibited lower performance overall. This highlights the need for more balanced datasets and additional research to enhance the accuracy of sentiment analysis in the Tamil language. In terms of abusive language detection, the TF-IDF-LSTM models generally outperformed the Hierarchical Attention models. However, the mixed models displayed better performance for specific classes such as “Homophobia” and “Xenophobia.” This implies that considering both code-mixed and original script data can offer a different perspective for research in social media analysis.
Anthology ID:
2023.ltedi-1.3
Volume:
Proceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Bharathi R. Chakravarthi, B. Bharathi, Joephine Griffith, Kalika Bali, Paul Buitelaar
Venues:
LTEDI | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
17–24
Language:
URL:
https://aclanthology.org/2023.ltedi-1.3
DOI:
Bibkey:
Cite (ACL):
Anand Kumar M. 2023. Multilingual Models for Sentiment and Abusive Language Detection for Dravidian Languages. In Proceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion, pages 17–24, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Multilingual Models for Sentiment and Abusive Language Detection for Dravidian Languages (M, LTEDI-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.ltedi-1.3.pdf