Bring vs. MTRoget: Evaluating automatic thesaurus translation

Lars Borin, Jens Allwood, Gerard de Melo


Abstract
Evaluation of automatic language-independent methods for language technology resource creation is difficult, and confounded by a largely unknown quantity, viz. to what extent typological differences among languages are significant for results achieved for one language or language pair to be applicable across languages generally. In the work presented here, as a simplifying assumption, language-independence is taken as axiomatic within certain specified bounds. We evaluate the automatic translation of Roget’s “Thesaurus” from English into Swedish using an independently compiled Roget-style Swedish thesaurus, S.C. Bring’s “Swedish vocabulary arranged into conceptual classes” (1930). Our expectation is that this explicit evaluation of one of the thesaureses created in the MTRoget project will provide a good estimate of the quality of the other thesauruses created using similar methods.
Anthology ID:
L14-1343
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2115–2121
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/397_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Lars Borin, Jens Allwood, and Gerard de Melo. 2014. Bring vs. MTRoget: Evaluating automatic thesaurus translation. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2115–2121, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Bring vs. MTRoget: Evaluating automatic thesaurus translation (Borin et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/397_Paper.pdf