CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task

Shuo Sun, Suzanna Sia, Kevin Duh


Abstract
We present CLIReval, an easy-to-use toolkit for evaluating machine translation (MT) with the proxy task of cross-lingual information retrieval (CLIR). Contrary to what the project name might suggest, CLIReval does not actually require any annotated CLIR dataset. Instead, it automatically transforms translations and references used in MT evaluations into a synthetic CLIR dataset; it then sets up a standard search engine (Elasticsearch) and computes various information retrieval metrics (e.g., mean average precision) by treating the translations as documents to be retrieved. The idea is to gauge the quality of MT by its impact on the document translation approach to CLIR. As a case study, we run CLIReval on the “metrics shared task” of WMT2019; while this extrinsic metric is not intended to replace popular intrinsic metrics such as BLEU, results suggest CLIReval is competitive in many language pairs in terms of correlation to human judgments of quality. CLIReval is publicly available at https://github.com/ssun32/CLIReval.
Anthology ID:
2020.acl-demos.18
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
Month:
July
Year:
2020
Address:
Online
Editors:
Asli Celikyilmaz, Tsung-Hsien Wen
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
134–141
Language:
URL:
https://aclanthology.org/2020.acl-demos.18
DOI:
10.18653/v1/2020.acl-demos.18
Bibkey:
Cite (ACL):
Shuo Sun, Suzanna Sia, and Kevin Duh. 2020. CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 134–141, Online. Association for Computational Linguistics.
Cite (Informal):
CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task (Sun et al., ACL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.acl-demos.18.pdf
Video:
 http://slideslive.com/38928602
Code
 ssun32/CLIReval