The ACoLi Dictionary Graph

Christian Chiarcos, Christian Fäth, Maxim Ionov


Abstract
In this paper, we report the release of the ACoLi Dictionary Graph, a large-scale collection of multilingual open source dictionaries available in two machine-readable formats, a graph representation in RDF, using the OntoLex-Lemon vocabulary, and a simple tabular data format to facilitate their use in NLP tasks, such as translation inference across dictionaries. We describe the mapping and harmonization of the underlying data structures into a unified representation, its serialization in RDF and TSV, and the release of a massive and coherent amount of lexical data under open licenses.
Anthology ID:
2020.lrec-1.401
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3281–3290
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.401
DOI:
Bibkey:
Cite (ACL):
Christian Chiarcos, Christian Fäth, and Maxim Ionov. 2020. The ACoLi Dictionary Graph. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 3281–3290, Marseille, France. European Language Resources Association.
Cite (Informal):
The ACoLi Dictionary Graph (Chiarcos et al., LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.401.pdf