Designing a GWAP for Collecting Naturally Produced Dialogues for Low Resourced Languages

Zulipiye Yusupujiang, Jonathan Ginzburg


Abstract
In this paper we present a new method for collecting naturally generated dialogue data for a low resourced language, (specifically here—Uyghur). We plan to build a games with a purpose (GWAPs) to encourage native speakers to actively contribute dialogue data to our research project. Since we aim to characterize the response space of queries in Uyghur, we design various scenarios for conversations that yield to questions being posed and responded to. We will implement the GWAP with the RPG Maker MV Game Engine, and will integrate the chatroom system in the game with the Dialogue Experimental Toolkit (DiET). DiET will help us improve the data collection process, and most importantly, make us have some control over the interactions among the participants.
Anthology ID:
2020.gamnlp-1.7
Volume:
Workshop on Games and Natural Language Processing
Month:
May
Year:
2020
Address:
Marseille, France
Editor:
Stephanie M. Lukin
Venue:
GAMESandNLP
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
44–48
Language:
English
URL:
https://aclanthology.org/2020.gamnlp-1.7
DOI:
Bibkey:
Cite (ACL):
Zulipiye Yusupujiang and Jonathan Ginzburg. 2020. Designing a GWAP for Collecting Naturally Produced Dialogues for Low Resourced Languages. In Workshop on Games and Natural Language Processing, pages 44–48, Marseille, France. European Language Resources Association.
Cite (Informal):
Designing a GWAP for Collecting Naturally Produced Dialogues for Low Resourced Languages (Yusupujiang & Ginzburg, GAMESandNLP 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.gamnlp-1.7.pdf