Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory

Bettina Klimek, Natanael Arndt, Sebastian Krause, Timotheus Arndt


Abstract
The development of standard models for describing general lexical resources has led to the emergence of numerous lexical datasets of various languages in the Semantic Web. However, equivalent models covering the linguistic domain of morphology do not exist. As a result, there are hardly any language resources of morphemic data available in RDF to date. This paper presents the creation of the Hebrew Morpheme Inventory from a manually compiled tabular dataset comprising around 52.000 entries. It is an ongoing effort of representing the lexemes, word-forms and morphologigal patterns together with their underlying relations based on the newly created Multilingual Morpheme Ontology (MMoOn). It will be shown how segmented Hebrew language data can be granularly described in a Linked Data format, thus, serving as an exemplary case for creating morpheme inventories of any inflectional language with MMoOn. The resulting dataset is described a) according to the structure of the underlying data format, b) with respect to the Hebrew language characteristic of building word-forms directly from roots, c) by exemplifying how inflectional information is realized and d) with regard to its enrichment with external links to sense resources.
Anthology ID:
L16-1143
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
892–899
Language:
URL:
https://aclanthology.org/L16-1143
DOI:
Bibkey:
Cite (ACL):
Bettina Klimek, Natanael Arndt, Sebastian Krause, and Timotheus Arndt. 2016. Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 892–899, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory (Klimek et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1143.pdf
Code
 aksw/mmoon