A Document Repository for Social Media and Speech Conversations

Adam Funk, Robert Gaizauskas, Benoit Favre


Abstract
We present a successfully implemented document repository REST service for flexible SCRUD (search, crate, read, update, delete) storage of social media conversations, using a GATE/TIPSTER-like document object model and providing a query language for document features. This software is currently being used in the SENSEI research project and will be published as open-source software before the project ends. It is, to the best of our knowledge, the first freely available, general purpose data repository to support large-scale multimodal (i.e., speech or text) conversation analytics.
Anthology ID:
L16-1070
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
436–440
Language:
URL:
https://aclanthology.org/L16-1070
DOI:
Bibkey:
Cite (ACL):
Adam Funk, Robert Gaizauskas, and Benoit Favre. 2016. A Document Repository for Social Media and Speech Conversations. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 436–440, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
A Document Repository for Social Media and Speech Conversations (Funk et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1070.pdf