TwistBytes - Identification of Cuneiform Languages and German Dialects at VarDial 2019

Fernando Benites, Pius von Däniken, Mark Cieliebak


Abstract
We describe our approaches for the German Dialect Identification (GDI) and the Cuneiform Language Identification (CLI) tasks at the VarDial Evaluation Campaign 2019. The goal was to identify dialects of Swiss German in GDI and Sumerian and Akkadian in CLI. In GDI, the system should distinguish four dialects from the German-speaking part of Switzerland. Our system for GDI achieved third place out of 6 teams, with a macro averaged F-1 of 74.6%. In CLI, the system should distinguish seven languages written in cuneiform script. Our system achieved third place out of 8 teams, with a macro averaged F-1 of 74.7%.
Anthology ID:
W19-1421
Volume:
Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects
Month:
June
Year:
2019
Address:
Ann Arbor, Michigan
Editors:
Marcos Zampieri, Preslav Nakov, Shervin Malmasi, Nikola Ljubešić, Jörg Tiedemann, Ahmed Ali
Venue:
VarDial
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
194–201
Language:
URL:
https://aclanthology.org/W19-1421
DOI:
10.18653/v1/W19-1421
Bibkey:
Cite (ACL):
Fernando Benites, Pius von Däniken, and Mark Cieliebak. 2019. TwistBytes - Identification of Cuneiform Languages and German Dialects at VarDial 2019. In Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects, pages 194–201, Ann Arbor, Michigan. Association for Computational Linguistics.
Cite (Informal):
TwistBytes - Identification of Cuneiform Languages and German Dialects at VarDial 2019 (Benites et al., VarDial 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-1421.pdf