Induction of Probabilistic Synchronous Tree-Insertion Grammars for Machine Translation

Rebecca Nesson, Stuart Shieber, Alexander Rush


Abstract
The more expressive and flexible a base formalism for machine translation is, the less efficient parsing of it will be. However, even among formalisms with the same parse complexity, some formalisms better realize the desired characteristics for machine translation formalisms than others. We introduce a particular formalism, probabilistic synchronous tree-insertion grammar (PSTIG) that we argue satisfies the desiderata optimally within the class of formalisms that can be parsed no less efficiently than context-free grammars and demonstrate that it outperforms state-of-the-art word-based and phrase-based finite-state translation models on training and test data taken from the EuroParl corpus (Koehn, 2005). We then argue that a higher level of translation quality can be achieved by hybridizing our in- duced model with elementary structures produced using supervised techniques such as those of Groves et al. (2004).
Anthology ID:
2006.amta-papers.15
Volume:
Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers
Month:
August 8-12
Year:
2006
Address:
Cambridge, Massachusetts, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
128–137
Language:
URL:
https://aclanthology.org/2006.amta-papers.15
DOI:
Bibkey:
Cite (ACL):
Rebecca Nesson, Stuart Shieber, and Alexander Rush. 2006. Induction of Probabilistic Synchronous Tree-Insertion Grammars for Machine Translation. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, pages 128–137, Cambridge, Massachusetts, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Induction of Probabilistic Synchronous Tree-Insertion Grammars for Machine Translation (Nesson et al., AMTA 2006)
Copy Citation:
PDF:
https://aclanthology.org/2006.amta-papers.15.pdf