Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation

Ayaka Morimoto, Akifumi Yoshimoto, Akihiko Kato, Hiroyuki Shindo, Yuji Matsumoto


Abstract
This paper presents our ongoing work on compilation of English multi-word expression (MWE) lexicon. We are especially interested in collecting flexible MWEs, in which some other components can intervene the expression such as “a number of” vs “a large number of” where a modifier of “number” can be placed in the expression and inherit the original meaning. We fiest collect possible candidates of flexible English MWEs from the web, and annotate all of their occurrences in the Wall Street Journal portion of Ontonotes corpus. We make use of word dependency strcuture information of the sentences converted from the phrase structure annotation. This process enables semi-automatic annotation of MWEs in the corpus and simultanaously produces the internal and external dependency representation of flexible MWEs.
Anthology ID:
W16-3813
Volume:
Proceedings of the Workshop on Grammar and Lexicon: interactions and interfaces (GramLex)
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Eva Hajičová, Igor Boguslavsky
Venue:
GramLex
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
102–109
Language:
URL:
https://aclanthology.org/W16-3813
DOI:
Bibkey:
Cite (ACL):
Ayaka Morimoto, Akifumi Yoshimoto, Akihiko Kato, Hiroyuki Shindo, and Yuji Matsumoto. 2016. Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation. In Proceedings of the Workshop on Grammar and Lexicon: interactions and interfaces (GramLex), pages 102–109, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation (Morimoto et al., GramLex 2016)
Copy Citation:
PDF:
https://aclanthology.org/W16-3813.pdf