Building the Cantonese Wordnet

Joanna Ut-Seong Sio, Luis Morgado Da Costa


Abstract
This paper reports on the development of the Cantonese Wordnet, a new wordnet project based on Hong Kong Cantonese. It is built using the expansion approach, leveraging on the existing Chinese Open Wordnet, and the Princeton Wordnet’s semantic hierarchy. The main goal of our project was to produce a high quality, human-curated resource – and this paper reports on the initial efforts and steady progress of our building method. It is our belief that the lexical data made available by this wordnet, including Jyutping romanization, will be useful for a variety of future uses, including many language processing tasks and linguistic research on Cantonese and its interactions with other Chinese dialects.
Anthology ID:
2019.gwc-1.26
Volume:
Proceedings of the 10th Global Wordnet Conference
Month:
July
Year:
2019
Address:
Wroclaw, Poland
Editors:
Piek Vossen, Christiane Fellbaum
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
206–215
Language:
URL:
https://aclanthology.org/2019.gwc-1.26
DOI:
Bibkey:
Cite (ACL):
Joanna Ut-Seong Sio and Luis Morgado Da Costa. 2019. Building the Cantonese Wordnet. In Proceedings of the 10th Global Wordnet Conference, pages 206–215, Wroclaw, Poland. Global Wordnet Association.
Cite (Informal):
Building the Cantonese Wordnet (Sio & Costa, GWC 2019)
Copy Citation:
PDF:
https://aclanthology.org/2019.gwc-1.26.pdf
Code
 lmorgadodacosta/cantonesewn