Typed Entity and Relation Annotation on Computer Science Papers

Yuka Tateisi, Tomoko Ohta, Sampo Pyysalo, Yusuke Miyao, Akiko Aizawa


Abstract
We describe our ongoing effort to establish an annotation scheme for describing the semantic structures of research articles in the computer science domain, with the intended use of developing search systems that can refine their results by the roles of the entities denoted by the query keys. In our scheme, mentions of entities are annotated with ontology-based types, and the roles of the entities are annotated as relations with other entities described in the text. So far, we have annotated 400 abstracts from the ACL anthology and the ACM digital library. In this paper, the scheme and the annotated dataset are described, along with the problems found in the course of annotation. We also show the results of automatic annotation and evaluate the corpus in a practical setting in application to topic extraction.
Anthology ID:
L16-1607
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3836–3843
Language:
URL:
https://aclanthology.org/L16-1607
DOI:
Bibkey:
Cite (ACL):
Yuka Tateisi, Tomoko Ohta, Sampo Pyysalo, Yusuke Miyao, and Akiko Aizawa. 2016. Typed Entity and Relation Annotation on Computer Science Papers. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3836–3843, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Typed Entity and Relation Annotation on Computer Science Papers (Tateisi et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1607.pdf
Code
 mynlp/ranis