Negation in Norwegian: an annotated dataset

Petter Mæhlum, Jeremy Barnes, Robin Kurtz, Lilja Øvrelid, Erik Velldal


Abstract
This paper introduces NorecNeg – the first annotated dataset of negation for Norwegian. Negation cues and their in-sentence scopes have been annotated across more than 11K sentences spanning more than 400 documents for a subset of the Norwegian Review Corpus (NoReC). In addition to providing in-depth discussion of the annotation guidelines, we also present a first set of benchmark results based on a graph-parsing approach.
Anthology ID:
2021.nodalida-main.30
Volume:
Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa)
Month:
May 31--2 June
Year:
2021
Address:
Reykjavik, Iceland (Online)
Editors:
Simon Dobnik, Lilja Øvrelid
Venue:
NoDaLiDa
SIG:
Publisher:
Linköping University Electronic Press, Sweden
Note:
Pages:
299–308
Language:
URL:
https://aclanthology.org/2021.nodalida-main.30
DOI:
Bibkey:
Cite (ACL):
Petter Mæhlum, Jeremy Barnes, Robin Kurtz, Lilja Øvrelid, and Erik Velldal. 2021. Negation in Norwegian: an annotated dataset. In Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), pages 299–308, Reykjavik, Iceland (Online). Linköping University Electronic Press, Sweden.
Cite (Informal):
Negation in Norwegian: an annotated dataset (Mæhlum et al., NoDaLiDa 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.nodalida-main.30.pdf
Code
 ltgoslo/norec_neg
Data
NoReC