Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing

Željko Agić, Daša Berović, Danijela Merkler, Marko Tadić


Abstract
We present a new version of the Croatian Dependency Treebank. It constitutes a slight departure from the previously closely observed Prague Dependency Treebank syntactic layer annotation guidelines as we introduce a new subset of syntactic tags on top of the existing tagset. These new tags are used in explicit annotation of subordinate clauses via subordinate conjunctions. Introducing the new annotation to Croatian Dependency Treebank, we also modify head attachment rules addressing subordinate conjunctions and subordinate clause predicates. In an experiment with data-driven dependency parsing, we show that implementing these new annotation guidelines leeds to a statistically significant improvement in parsing accuracy. We also observe a substantial improvement in inter-annotator agreement, facilitating more consistent annotation in further treebank development.
Anthology ID:
L14-1545
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2313–2319
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/694_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Željko Agić, Daša Berović, Danijela Merkler, and Marko Tadić. 2014. Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2313–2319, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing (Agić et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/694_Paper.pdf