Retrieving Occurrences of Grammatical Constructions

Anna Ehrlemark, Richard Johansson, Benjamin Lyngfelt


Abstract
Finding authentic examples of grammatical constructions is central in constructionist approaches to linguistics, language processing, and second language learning. In this paper, we address this problem as an information retrieval (IR) task. To facilitate research in this area, we built a benchmark collection by annotating the occurrences of six constructions in a Swedish corpus. Furthermore, we implemented a simple and flexible retrieval system for finding construction occurrences, in which the user specifies a ranking function using lexical-semantic similarities (lexicon-based or distributional). The system was evaluated using standard IR metrics on the new benchmark, and we saw that lexical-semantical rerankers improve significantly over a purely surface-oriented system, but must be carefully tailored for each individual construction.
Anthology ID:
C16-1078
Volume:
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Yuji Matsumoto, Rashmi Prasad
Venue:
COLING
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
815–824
Language:
URL:
https://aclanthology.org/C16-1078
DOI:
Bibkey:
Cite (ACL):
Anna Ehrlemark, Richard Johansson, and Benjamin Lyngfelt. 2016. Retrieving Occurrences of Grammatical Constructions. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 815–824, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Retrieving Occurrences of Grammatical Constructions (Ehrlemark et al., COLING 2016)
Copy Citation:
PDF:
https://aclanthology.org/C16-1078.pdf