Cross-Platform and Cross-Domain Abusive Language Detection with Supervised Contrastive Learning

Md Tawkat Islam Khondaker, Muhammad Abdul-mageed, Laks Lakshmanan, V.s.


Abstract
The prevalence of abusive language on different online platforms has been a major concern that raises the need for automated cross-platform abusive language detection. However, prior works focus on concatenating data from multiple platforms, inherently adopting Empirical Risk Minimization (ERM) method. In this work, we address this challenge from the perspective of domain generalization objective. We design SCL-Fish, a supervised contrastive learning integrated meta-learning algorithm to detect abusive language on unseen platforms. Our experimental analysis shows that SCL-Fish achieves better performance over ERM and the existing state-of-the-art models. We also show that SCL-Fish is data-efficient and achieves comparable performance with the large-scale pre-trained models upon finetuning for the abusive language detection task.
Anthology ID:
2023.woah-1.9
Volume:
The 7th Workshop on Online Abuse and Harms (WOAH)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Yi-ling Chung, Paul R{\"ottger}, Debora Nozza, Zeerak Talat, Aida Mostafazadeh Davani
Venue:
WOAH
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
96–112
Language:
URL:
https://aclanthology.org/2023.woah-1.9
DOI:
10.18653/v1/2023.woah-1.9
Bibkey:
Cite (ACL):
Md Tawkat Islam Khondaker, Muhammad Abdul-mageed, and Laks Lakshmanan, V.s.. 2023. Cross-Platform and Cross-Domain Abusive Language Detection with Supervised Contrastive Learning. In The 7th Workshop on Online Abuse and Harms (WOAH), pages 96–112, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Cross-Platform and Cross-Domain Abusive Language Detection with Supervised Contrastive Learning (Khondaker et al., WOAH 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.woah-1.9.pdf