Because Size Does Matter: The Hamburg Dependency Treebank

Kilian A. Foth, Arne Köhn, Niels Beuck, Wolfgang Menzel


Abstract
We present the Hamburg Dependency Treebank (HDT), which to our knowledge is the largest dependency treebank currently available. It consists of genuine dependency annotations, i. e. they have not been transformed from phrase structures. We explore characteristics of the treebank and compare it against others. To exemplify the benefit of large dependency treebanks, we evaluate different parsers on the HDT. In addition, a set of tools will be described which help working with and searching in the treebank.
Anthology ID:
L14-1666
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2326–2333
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/860_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Kilian A. Foth, Arne Köhn, Niels Beuck, and Wolfgang Menzel. 2014. Because Size Does Matter: The Hamburg Dependency Treebank. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2326–2333, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Because Size Does Matter: The Hamburg Dependency Treebank (Foth et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/860_Paper.pdf