Mapping WordNet Instances to Wikipedia

John P. McCrae


Abstract
Lexical resource differ from encyclopaedic resources and represent two distinct types of resource covering general language and named entities respectively. However, many lexical resources, including Princeton WordNet, contain many proper nouns, referring to named entities in the world yet it is not possible or desirable for a lexical resource to cover all named entities that may reasonably occur in a text. In this paper, we propose that instead of including synsets for instance concepts PWN should instead provide links to Wikipedia articles describing the concept. In order to enable this we have created a gold-quality mapping between all of the 7,742 instances in PWN and Wikipedia (where such a mapping is possible). As such, this resource aims to provide a gold standard for link discovery, while also allowing PWN to distinguish itself from other resources such as DBpedia or BabelNet. Moreover, this linking connects PWN to the Linguistic Linked Open Data cloud, thus creating a richer, more usable resource for natural language processing.
Anthology ID:
2018.gwc-1.8
Volume:
Proceedings of the 9th Global Wordnet Conference
Month:
January
Year:
2018
Address:
Nanyang Technological University (NTU), Singapore
Editors:
Francis Bond, Piek Vossen, Christiane Fellbaum
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
61–68
Language:
URL:
https://aclanthology.org/2018.gwc-1.8
DOI:
Bibkey:
Cite (ACL):
John P. McCrae. 2018. Mapping WordNet Instances to Wikipedia. In Proceedings of the 9th Global Wordnet Conference, pages 61–68, Nanyang Technological University (NTU), Singapore. Global Wordnet Association.
Cite (Informal):
Mapping WordNet Instances to Wikipedia (McCrae, GWC 2018)
Copy Citation:
PDF:
https://aclanthology.org/2018.gwc-1.8.pdf