Modality in Text: a Proposal for Corpus Annotation

Iris Hendrickx, Amália Mendes, Silvia Mencarelli


Abstract
We present a annotation scheme for modality in Portuguese. In our annotation scheme we have tried to combine a more theoretical linguistic viewpoint with a practical annotation scheme that will also be useful for NLP research but is not geared towards one specific application. Our notion of modality focuses on the attitude and opinion of the speaker, or of the subject of the sentence. We validated the annotation scheme on a corpus sample of approximately 2000 sentences that we fully annotated with modal information using the MMAX2 annotation tool to produce XML annotation. We discuss our main findings and give attention to the difficult cases that we encountered as they illustrate the complexity of modality and its interactions with other elements in the text.
Anthology ID:
L12-1288
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1805–1812
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/520_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Iris Hendrickx, Amália Mendes, and Silvia Mencarelli. 2012. Modality in Text: a Proposal for Corpus Annotation. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1805–1812, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Modality in Text: a Proposal for Corpus Annotation (Hendrickx et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/520_Paper.pdf