Some strategies for the improvement of a Spanish WordNet

Matias Herrera, Javier Gonzalez, Luis Chiruzzo, Dina Wonsever


Abstract
Although there are currently several versions of Princeton WordNet for different languages, the lack of development of some of these versions does not make it possible to use them in different Natural Language Processing applications. So is the case of the Spanish Wordnet contained in the Multilingual Central Repository (MCR), which we tried unsuccessfully to incorporate into an anaphora resolution application and also in search terms expansion. In this situation, different strategies to improve MCR Spanish WordNet coverage were put forward and tested, obtaining encouraging results. A specific process was conducted to increase the number of adverbs, and a few simple processes were applied which made it possible to increase, at a very low cost, the number of terms in the Spanish WordNet. Finally, a more complex method based on distributional semantics was proposed, using the relations between English Wordnet synsets, also returning positive results.
Anthology ID:
2016.gwc-1.18
Volume:
Proceedings of the 8th Global WordNet Conference (GWC)
Month:
27--30 January
Year:
2016
Address:
Bucharest, Romania
Editors:
Christiane Fellbaum, Piek Vossen, Verginica Barbu Mititelu, Corina Forascu
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
115–122
Language:
URL:
https://aclanthology.org/2016.gwc-1.18
DOI:
Bibkey:
Cite (ACL):
Matias Herrera, Javier Gonzalez, Luis Chiruzzo, and Dina Wonsever. 2016. Some strategies for the improvement of a Spanish WordNet. In Proceedings of the 8th Global WordNet Conference (GWC), pages 115–122, Bucharest, Romania. Global Wordnet Association.
Cite (Informal):
Some strategies for the improvement of a Spanish WordNet (Herrera et al., GWC 2016)
Copy Citation:
PDF:
https://aclanthology.org/2016.gwc-1.18.pdf