An Emotional Mess! Deciding on a Framework for Building a Dutch Emotion-Annotated Corpus

Luna De Bruyne, Orphee De Clercq, Veronique Hoste


Abstract
Seeing the myriad of existing emotion models, with the categorical versus dimensional opposition the most important dividing line, building an emotion-annotated corpus requires some well thought-out strategies concerning framework choice. In our work on automatic emotion detection in Dutch texts, we investigate this problem by means of two case studies. We find that the labels joy, love, anger, sadness and fear are well-suited to annotate texts coming from various domains and topics, but that the connotation of the labels strongly depends on the origin of the texts. Moreover, it seems that information is lost when an emotional state is forcedly classified in a limited set of categories, indicating that a bi-representational format is desirable when creating an emotion corpus.
Anthology ID:
2020.lrec-1.204
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1643–1651
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.204
DOI:
Bibkey:
Cite (ACL):
Luna De Bruyne, Orphee De Clercq, and Veronique Hoste. 2020. An Emotional Mess! Deciding on a Framework for Building a Dutch Emotion-Annotated Corpus. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 1643–1651, Marseille, France. European Language Resources Association.
Cite (Informal):
An Emotional Mess! Deciding on a Framework for Building a Dutch Emotion-Annotated Corpus (De Bruyne et al., LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.204.pdf