Building Domain Specific Bilingual Dictionaries

Lucas Hilgert, Lucelene Lopes, Artur Freitas, Renata Vieira, Denise Hogetop, Aline Vanin


Abstract
This paper proposes a method to build bilingual dictionaries for specific domains defined by a parallel corpora. The proposed method is based on an original method that is not domain specific. Both the original and the proposed methods are constructed with previously available natural language processing tools. Therefore, this paper contribution resides in the choice and parametrization of the chosen tools. To illustrate the proposed method benefits we conduct an experiment over technical manuals in English and Portuguese. The results of our proposed method were analyzed by human specialists and our results indicates significant increases in precision for unigrams and muli-grams. Numerically, the precision increase is as big as 15% according to our evaluation.
Anthology ID:
L14-1084
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2772–2777
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1112_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Lucas Hilgert, Lucelene Lopes, Artur Freitas, Renata Vieira, Denise Hogetop, and Aline Vanin. 2014. Building Domain Specific Bilingual Dictionaries. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2772–2777, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Building Domain Specific Bilingual Dictionaries (Hilgert et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1112_Paper.pdf