Morpheus: A Neural Network for Jointly Learning Contextual Lemmatization and Morphological Tagging

Eray Yildiz, A. Cüneyd Tantuğ


Abstract
In this study, we present Morpheus, a joint contextual lemmatizer and morphological tagger. Morpheus is based on a neural sequential architecture where inputs are the characters of the surface words in a sentence and the outputs are the minimum edit operations between surface words and their lemmata as well as the morphological tags assigned to the words. The experiments on the datasets in nearly 100 languages provided by SigMorphon 2019 Shared Task 2 organizers show that the performance of Morpheus is comparable to the state-of-the-art system in terms of lemmatization. In morphological tagging, on the other hand, Morpheus significantly outperforms the SigMorphon baseline. In our experiments, we also show that the neural encoder-decoder architecture trained to predict the minimum edit operations can produce considerably better results than the architecture trained to predict the characters in lemmata directly as in previous studies. According to the SigMorphon 2019 Shared Task 2 results, Morpheus has placed 3rd in lemmatization and reached the 9th place in morphological tagging among all participant teams.
Anthology ID:
W19-4205
Volume:
Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
August
Year:
2019
Address:
Florence, Italy
Editors:
Garrett Nicolai, Ryan Cotterell
Venue:
ACL
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
25–34
Language:
URL:
https://aclanthology.org/W19-4205
DOI:
10.18653/v1/W19-4205
Bibkey:
Cite (ACL):
Eray Yildiz and A. Cüneyd Tantuğ. 2019. Morpheus: A Neural Network for Jointly Learning Contextual Lemmatization and Morphological Tagging. In Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 25–34, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Morpheus: A Neural Network for Jointly Learning Contextual Lemmatization and Morphological Tagging (Yildiz & Tantuğ, ACL 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-4205.pdf
Code
 erayyildiz/Morpheus