A Scalable Architecture For Web Deployment of Spoken Dialogue Systems

Matthew Fuchs, Nikos Tsourakis, Manny Rayner


Abstract
We describe a scalable architecture, particularly well-suited to cloud-based computing, which can be used for Web-deployment of spoken dialogue systems. In common with similar platforms, like WAMI and the Nuance Mobile Developer Platform, we use a client/server approach in which speech recognition is carried out on the server side; our architecture, however, differs from these systems in offering considerably more elaborate server-side functionality, based on large-scale grammar-based language processing and generic dialogue management. We describe two substantial applications, built using our framework, which we argue would have been hard to construct in WAMI or NMDP. Finally, we present a series of evaluations carried out using CALL-SLT, a speech translation game, where we contrast performance in Web and desktop versions. Task Error Rate in the Web version is only slightly inferior that in the desktop one, and the average additional latency is under half a second. The software is generally available for research purposes.
Anthology ID:
L12-1226
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1309–1314
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/436_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Matthew Fuchs, Nikos Tsourakis, and Manny Rayner. 2012. A Scalable Architecture For Web Deployment of Spoken Dialogue Systems. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1309–1314, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
A Scalable Architecture For Web Deployment of Spoken Dialogue Systems (Fuchs et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/436_Paper.pdf