A disambiguation resource extracted from Wikipedia for semantic annotation

Eric Charton, Michel Gagnon


Abstract
The Semantic Annotation (SA) task consists in establishing the relation between a textual entity (word or group of words designating a named entity of the real world or a concept) and its corresponding entity in an ontology. The main difficulty of this task is that a textual entity might be highly polysemic and potentially related to many different ontological representations. To solve this specific problem, various Information Retrieval techniques can be used. Most of those involves contextual words to estimate wich exact textual entity have to be recognized. In this paper, we present a resource of contextual words that can be used by IR algorithms to establish a link between a named entity (NE) in a text and an entry point to its semantic description in the LinkedData Network.
Anthology ID:
L12-1586
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3665–3671
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/983_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Eric Charton and Michel Gagnon. 2012. A disambiguation resource extracted from Wikipedia for semantic annotation. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3665–3671, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
A disambiguation resource extracted from Wikipedia for semantic annotation (Charton & Gagnon, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/983_Paper.pdf