The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish

Gülşen Eryiğit


Abstract
The studies on dependency parsing of Turkish so far gave their results on the Turkish Dependency Treebank. This treebank consists of sentences where gold standard part-of-speech tags are manually assigned to each word and the words forming multi word expressions are also manually determined and combined into single units. For the first time, we investigate the results of parsing Turkish sentences from scratch and observe the accuracy drop at the end of processing raw data. We test one state-of-the art morphological analyzer together with two different morphological disambiguators. We both show separately the accuracy drop due to the automatic morphological processing and to the lack of multi word unit extraction. With this purpose, we use and present a new version of the Turkish Treebank where we detached the multi word expressions (MWEs) into multiple tokens and manually annotated the missing part-of-speech tags of these new tokens.
Anthology ID:
L12-1056
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1960–1965
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/198_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Gülşen Eryiğit. 2012. The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1960–1965, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish (Eryiğit, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/198_Paper.pdf