Two-Step Machine Translation with Lattices

Bushra Jawaid, Ondřej Bojar


Abstract
The idea of two-step machine translation was introduced to divide the complexity of the search space into two independent steps: (1) lexical translation and reordering, and (2) conjugation and declination in the target language. In this paper, we extend the two-step machine translation structure by replacing state-of-the-art phrase-based machine translation with the hierarchical machine translation in the 1st step. We further extend the fixed string-based input format of the 2nd step with word lattices (Dyer et al., 2008); this provides the 2nd step with the opportunity to choose among a sample of possible reorderings instead of relying on the single best one as produced by the 1st step.
Anthology ID:
L14-1490
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
682–686
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/610_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Bushra Jawaid and Ondřej Bojar. 2014. Two-Step Machine Translation with Lattices. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 682–686, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Two-Step Machine Translation with Lattices (Jawaid & Bojar, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/610_Paper.pdf