Corpus Creation and Emotion Prediction for Hindi-English Code-Mixed Social Media Text

Deepanshu Vijay, Aditya Bohra, Vinay Singh, Syed Sarfaraz Akhtar, Manish Shrivastava


Abstract
Emotion Prediction is a Natural Language Processing (NLP) task dealing with detection and classification of emotions in various monolingual and bilingual texts. While some work has been done on code-mixed social media text and in emotion prediction separately, our work is the first attempt which aims at identifying the emotion associated with Hindi-English code-mixed social media text. In this paper, we analyze the problem of emotion identification in code-mixed content and present a Hindi-English code-mixed corpus extracted from twitter and annotated with the associated emotion. For every tweet in the dataset, we annotate the source language of all the words present, and also the causal language of the expressed emotion. Finally, we propose a supervised classification system which uses various machine learning techniques for detecting the emotion associated with the text using a variety of character level, word level, and lexicon based features.
Anthology ID:
N18-4018
Volume:
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop
Month:
June
Year:
2018
Address:
New Orleans, Louisiana, USA
Editors:
Silvio Ricardo Cordeiro, Shereen Oraby, Umashanthi Pavalanathan, Kyeongmin Rim
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
128–135
Language:
URL:
https://aclanthology.org/N18-4018
DOI:
10.18653/v1/N18-4018
Bibkey:
Cite (ACL):
Deepanshu Vijay, Aditya Bohra, Vinay Singh, Syed Sarfaraz Akhtar, and Manish Shrivastava. 2018. Corpus Creation and Emotion Prediction for Hindi-English Code-Mixed Social Media Text. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 128–135, New Orleans, Louisiana, USA. Association for Computational Linguistics.
Cite (Informal):
Corpus Creation and Emotion Prediction for Hindi-English Code-Mixed Social Media Text (Vijay et al., NAACL 2018)
Copy Citation:
PDF:
https://aclanthology.org/N18-4018.pdf