Overview of the Fourth BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora

Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff


Abstract
The shared task of the 13th Workshop on Building and Using Comparable Corpora was devoted to the induction of bilingual dictionaries from comparable rather than parallel corpora. In this task, for a number of language pairs involving Chinese, English, French, German, Russian and Spanish, the participants were supposed to determine automatically the target language translations of several thousand source language test words of three frequency ranges. We describe here some background, the task definition, the training and test data sets and the evaluation used for ranking the participating systems. We also summarize the approaches used and present the results of the evaluation. In conclusion, the outcome of the competition are the results of a number of systems which provide surprisingly good solutions to the ambitious problem.
Anthology ID:
2020.bucc-1.2
Volume:
Proceedings of the 13th Workshop on Building and Using Comparable Corpora
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff
Venue:
BUCC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
6–13
Language:
English
URL:
https://aclanthology.org/2020.bucc-1.2
DOI:
Bibkey:
Cite (ACL):
Reinhard Rapp, Pierre Zweigenbaum, and Serge Sharoff. 2020. Overview of the Fourth BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora. In Proceedings of the 13th Workshop on Building and Using Comparable Corpora, pages 6–13, Marseille, France. European Language Resources Association.
Cite (Informal):
Overview of the Fourth BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora (Rapp et al., BUCC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.bucc-1.2.pdf