An Enhanced Mapping Scheme of the Universal Part-Of-Speech for Korean

Myung Hee Kim, Nathalie Colineau


Abstract
When mapping a language specific Part-Of-Speech (POS) tag set to the Universal POS tag set (UPOS), it is critical to consider the individual language’s linguistic features and the UPOS definitions. In this paper, we present an enhanced Sejong POS mapping to the UPOS in accordance with the Korean linguistic typology and the substantive definitions of the UPOS categories. This work updated one third of the Sejong POS mapping to the UPOS. We also introduced a new mapping for the KAIST POS tag set, another widely used Korean POS tag set, to the UPOS.
Anthology ID:
2020.lrec-1.472
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3826–3833
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.472
DOI:
Bibkey:
Cite (ACL):
Myung Hee Kim and Nathalie Colineau. 2020. An Enhanced Mapping Scheme of the Universal Part-Of-Speech for Korean. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 3826–3833, Marseille, France. European Language Resources Association.
Cite (Informal):
An Enhanced Mapping Scheme of the Universal Part-Of-Speech for Korean (Kim & Colineau, LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.472.pdf