A Multimodal Corpus of Rapid Dialogue Games

Maike Paetzel, David Nicolas Racca, David DeVault


Abstract
This paper presents a multimodal corpus of spoken human-human dialogues collected as participants played a series of Rapid Dialogue Games (RDGs). The corpus consists of a collection of about 11 hours of spoken audio, video, and Microsoft Kinect data taken from 384 game interactions (dialogues). The games used for collecting the corpus required participants to give verbal descriptions of linguistic expressions or visual images and were specifically designed to engage players in a fast-paced conversation under time pressure. As a result, the corpus contains many examples of participants attempting to communicate quickly in specific game situations, and it also includes a variety of spontaneous conversational phenomena such as hesitations, filled pauses, overlapping speech, and low-latency responses. The corpus has been created to facilitate research in incremental speech processing for spoken dialogue systems. Potentially, the corpus could be used in several areas of speech and language research, including speech recognition, natural language understanding, natural language generation, and dialogue management.
Anthology ID:
L14-1548
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4189–4195
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/697_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Maike Paetzel, David Nicolas Racca, and David DeVault. 2014. A Multimodal Corpus of Rapid Dialogue Games. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 4189–4195, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
A Multimodal Corpus of Rapid Dialogue Games (Paetzel et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/697_Paper.pdf