The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation

Christian Federmann, Eleftherios Avramidis, Marta R. Costa-jussà, Josef van Genabith, Maite Melero, Pavel Pecina


Abstract
We describe the “Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation” (ML4HMT) which aims to foster research on improved system combination approaches for machine translation (MT). Participants of the challenge are requested to build hybrid translations by combining the output of several MT systems of different types. We first describe the ML4HMT corpus used in the shared task, then explain the XLIFF-based annotation format we have designed for it, and briefly summarize the participating systems. Using both automated metrics scores and extensive manual evaluation, we discuss the individual performance of the various systems. An interesting result from the shared task is the fact that we were able to observe different systems winning according to the automated metrics scores when compared to the results from the manual evaluation. We conclude by summarising the first edition of the challenge and by giving an outlook to future work.
Anthology ID:
L12-1592
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3430–3435
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/996_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Christian Federmann, Eleftherios Avramidis, Marta R. Costa-jussà, Josef van Genabith, Maite Melero, and Pavel Pecina. 2012. The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3430–3435, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation (Federmann et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/996_Paper.pdf