Iula2Standoff: a tool for creating standoff documents for the IULACT

Carlos Morell, Jorge Vivaldi, Núria Bel


Abstract
Due to the increase in the number and depth of analyses required over the text, like entity recognition, POS tagging, syntactic analysis, etc. the annotation in-line has become unpractical. In Natural Language Processing (NLP) some emphasis has been placed in finding an annotation method to solve this problem. A possibility is the standoff annotation. With this annotation style it is possible to add new levels of annotation without disturbing exiting ones, with minimal knock on effects. This annotation will increase the possibility of adding more linguistic information as well as more possibilities for sharing textual resources. In this paper we present a tool developed in the framework of the European Metanet4u (Enhancing the European Linguistic Infrastructure, GA 270893) for creating a multi-layered XML annotation scheme, based on the GrAF proposal for standoff annotations.
Anthology ID:
L12-1141
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
351–356
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/307_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Carlos Morell, Jorge Vivaldi, and Núria Bel. 2012. Iula2Standoff: a tool for creating standoff documents for the IULACT. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 351–356, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Iula2Standoff: a tool for creating standoff documents for the IULACT (Morell et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/307_Paper.pdf