Promoting multiword expressions in A* TAG parsing

Jakub Waszczuk, Agata Savary, Yannick Parmentier


Abstract
Multiword expressions (MWEs) are pervasive in natural languages and often have both idiomatic and compositional readings, which leads to high syntactic ambiguity. We show that for some MWE types idiomatic readings are usually the correct ones. We propose a heuristic for an A* parser for Tree Adjoining Grammars which benefits from this knowledge by promoting MWE-oriented analyses. This strategy leads to a substantial reduction in the parsing search space in case of true positive MWE occurrences, while avoiding parsing failures in case of false positives.
Anthology ID:
C16-1042
Volume:
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Yuji Matsumoto, Rashmi Prasad
Venue:
COLING
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
429–439
Language:
URL:
https://aclanthology.org/C16-1042
DOI:
Bibkey:
Cite (ACL):
Jakub Waszczuk, Agata Savary, and Yannick Parmentier. 2016. Promoting multiword expressions in A* TAG parsing. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 429–439, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Promoting multiword expressions in A* TAG parsing (Waszczuk et al., COLING 2016)
Copy Citation:
PDF:
https://aclanthology.org/C16-1042.pdf