QCRI Live Speech Translation System

Fahim Dalvi, Yifan Zhang, Sameer Khurana, Nadir Durrani, Hassan Sajjad, Ahmed Abdelali, Hamdy Mubarak, Ahmed Ali, Stephan Vogel


Abstract
This paper presents QCRI’s Arabic-to-English live speech translation system. It features modern web technologies to capture live audio, and broadcasts Arabic transcriptions and English translations simultaneously. Our Kaldi-based ASR system uses the Time Delay Neural Network (TDNN) architecture, while our Machine Translation (MT) system uses both phrase-based and neural frameworks. Although our neural MT system is slower than the phrase-based system, it produces significantly better translations and is memory efficient. The demo is available at https://st.qcri.org/demos/livetranslation.
Anthology ID:
E17-3016
Volume:
Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
André Martins, Anselmo Peñas
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
61–64
Language:
URL:
https://aclanthology.org/E17-3016
DOI:
Bibkey:
Cite (ACL):
Fahim Dalvi, Yifan Zhang, Sameer Khurana, Nadir Durrani, Hassan Sajjad, Ahmed Abdelali, Hamdy Mubarak, Ahmed Ali, and Stephan Vogel. 2017. QCRI Live Speech Translation System. In Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pages 61–64, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
QCRI Live Speech Translation System (Dalvi et al., EACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/E17-3016.pdf