Pivot-based multilingual dictionary building using Wiktionary

Judit Ács


Abstract
We describe a method for expanding existing dictionaries in several languages by discovering previously non-existent links between translations. We call this method triangulation and we present and compare several variations of it. We assess precision manually, and recall by comparing the extracted dictionaries with independently obtained basic vocabulary sets. We featurize the translation candidates and train a maximum entropy classifier to identify correct translations in the noisy data.
Anthology ID:
L14-1669
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1938–1942
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/864_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Judit Ács. 2014. Pivot-based multilingual dictionary building using Wiktionary. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 1938–1942, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Pivot-based multilingual dictionary building using Wiktionary (Ács, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/864_Paper.pdf