Morphological Analysis of the Dravidian Language Family

Arun Kumar, Ryan Cotterell, Lluís Padró, Antoni Oliver


Abstract
The Dravidian languages are one of the most widely spoken language families in the world, yet there are very few annotated resources available to NLP researchers. To remedy this, we create DravMorph, a corpus annotated for morphological segmentation and part-of-speech. Additionally, we exploit novel features and higher-order models to set state-of-the-art results on these corpora on both tasks, beating techniques proposed in the literature by as much as 4 points in segmentation F1.
Anthology ID:
E17-2035
Volume:
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Mirella Lapata, Phil Blunsom, Alexander Koller
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
217–222
Language:
URL:
https://aclanthology.org/E17-2035
DOI:
Bibkey:
Cite (ACL):
Arun Kumar, Ryan Cotterell, Lluís Padró, and Antoni Oliver. 2017. Morphological Analysis of the Dravidian Language Family. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 217–222, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Morphological Analysis of the Dravidian Language Family (Kumar et al., EACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/E17-2035.pdf