Ambiguity Reduction for Machine Translation: Human-Computer Collaboration

Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff, Oren Etzioni


Abstract
Statistical Machine Translation (SMT) accuracy degrades when there is only a limited amount of training, or when the training is not from the same domain or genre of text as the target application. However, cross-domain applications are typical of many real world tasks. We demonstrate that SMT accuracy can be improved in a cross-domain application by using a controlled language (CL) interface to help reduce lexical ambiguity in the input text. Our system, CL-MT, presents a monolingual user with a choice of word senses for each content word in the input text. CL-MT temporarily adjusts the underlying SMT system's phrase table, boosting the scores of translations that include the word senses preferred by the user and lowering scores for disfavored translations. We demonstrate that this improves translation adequacy in 33.8% of the sentences in Spanish to English translation of news stories, where the SMT system was trained on proceedings of the European Parliament.
Anthology ID:
2006.amta-papers.22
Volume:
Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers
Month:
August 8-12
Year:
2006
Address:
Cambridge, Massachusetts, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
193–202
Language:
URL:
https://aclanthology.org/2006.amta-papers.22
DOI:
Bibkey:
Cite (ACL):
Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff, and Oren Etzioni. 2006. Ambiguity Reduction for Machine Translation: Human-Computer Collaboration. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, pages 193–202, Cambridge, Massachusetts, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Ambiguity Reduction for Machine Translation: Human-Computer Collaboration (Sammer et al., AMTA 2006)
Copy Citation:
PDF:
https://aclanthology.org/2006.amta-papers.22.pdf