Exploiting Emojis for Abusive Language Detection

Michael Wiegand, Josef Ruppenhofer


Abstract
We propose to use abusive emojis, such as the “middle finger” or “face vomiting”, as a proxy for learning a lexicon of abusive words. Since it represents extralinguistic information, a single emoji can co-occur with different forms of explicitly abusive utterances. We show that our approach generates a lexicon that offers the same performance in cross-domain classification of abusive microposts as the most advanced lexicon induction method. Such an approach, in contrast, is dependent on manually annotated seed words and expensive lexical resources for bootstrapping (e.g. WordNet). We demonstrate that the same emojis can also be effectively used in languages other than English. Finally, we also show that emojis can be exploited for classifying mentions of ambiguous words, such as “fuck” and “bitch”, into generally abusive and just profane usages.
Anthology ID:
2021.eacl-main.28
Volume:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
Month:
April
Year:
2021
Address:
Online
Editors:
Paola Merlo, Jorg Tiedemann, Reut Tsarfaty
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
369–380
Language:
URL:
https://aclanthology.org/2021.eacl-main.28
DOI:
10.18653/v1/2021.eacl-main.28
Bibkey:
Cite (ACL):
Michael Wiegand and Josef Ruppenhofer. 2021. Exploiting Emojis for Abusive Language Detection. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 369–380, Online. Association for Computational Linguistics.
Cite (Informal):
Exploiting Emojis for Abusive Language Detection (Wiegand & Ruppenhofer, EACL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.eacl-main.28.pdf