Tübingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing

Çağrı Çöltekin, Taraka Rama


Abstract
This paper describes our systems and results on VarDial 2017 shared tasks. Besides three language/dialect discrimination tasks, we also participated in the cross-lingual dependency parsing (CLP) task using a simple methodology which we also briefly describe in this paper. For all the discrimination tasks, we used linear SVMs with character and word features. The system achieves competitive results among other systems in the shared task. We also report additional experiments with neural network models. The performance of neural network models was close but always below the corresponding SVM classifiers in the discrimination tasks. For the cross-lingual parsing task, we experimented with an approach based on automatically translating the source treebank to the target language, and training a parser on the translated treebank. We used off-the-shelf tools for both translation and parsing. Despite achieving better-than-baseline results, our scores in CLP tasks were substantially lower than the scores of the other participants.
Anthology ID:
W17-1218
Volume:
Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Preslav Nakov, Marcos Zampieri, Nikola Ljubešić, Jörg Tiedemann, Shevin Malmasi, Ahmed Ali
Venue:
VarDial
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
146–155
Language:
URL:
https://aclanthology.org/W17-1218
DOI:
10.18653/v1/W17-1218
Bibkey:
Cite (ACL):
Çağrı Çöltekin and Taraka Rama. 2017. Tübingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing. In Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), pages 146–155, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Tübingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing (Çöltekin & Rama, VarDial 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-1218.pdf