Capturing Chat: Annotation and Tools for Multiparty Casual Conversation.

Emer Gilmartin, Nick Campbell


Abstract
Casual multiparty conversation is an understudied but very common genre of spoken interaction, whose analysis presents a number of challenges in terms of data scarcity and annotation. We describe the annotation process used on the d64 and DANS multimodal corpora of multiparty casual talk, which have been manually segmented, transcribed, annotated for laughter and disfluencies, and aligned using the Penn Aligner. We also describe a visualization tool, STAVE, developed during the annotation process, which allows long stretches of talk or indeed entire conversations to be viewed, aiding preliminary identification of features and patterns worthy of analysis. It is hoped that this tool will be of use to other researchers working in this field.
Anthology ID:
L16-1705
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4453–4457
Language:
URL:
https://aclanthology.org/L16-1705
DOI:
Bibkey:
Cite (ACL):
Emer Gilmartin and Nick Campbell. 2016. Capturing Chat: Annotation and Tools for Multiparty Casual Conversation.. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 4453–4457, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Capturing Chat: Annotation and Tools for Multiparty Casual Conversation. (Gilmartin & Campbell, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1705.pdf