A Bootstrapping Approach to Parser Development

Izaskun Aldezabal, Koldo Gojenola, Kepa Sarasola


Abstract
This paper presents a robust parsing system for unrestricted Basque texts. It analyzes a sentence in two stages: a unification-based parser builds basic syntactic units such as NPs, PPs, and sentential complements, while a finite-state parser performs syntactic disambiguation and filtering of the results. The system has been applied to the acquisition of verbal subcategorization information, obtaining 66% recall and 87% precision in the determination of verb subcategorization instances. This information will be later incorporated to the parser, in order to improve its performance.
Anthology ID:
2000.iwpt-1.5
Volume:
Proceedings of the Sixth International Workshop on Parsing Technologies
Month:
February 23-25
Year:
2000
Address:
Trento, Italy
Editors:
Alberto Lavelli, John Carroll, Robert C. Berwick, Harry C. Bunt, Bob Carpenter, John Carroll, Ken Church, Mark Johnson, Aravind Joshi, Ronald Kaplan, Martin Kay, Bernard Lang, Alon Lavie, Anton Nijholt, Christer Samuelsson, Mark Steedman, Oliviero Stock, Hozumi Tanaka, Masaru Tomita, Hans Uszkoreit, K. Vijay-Shanker, David Weir, Mats Wiren
Venue:
IWPT
SIG:
SIGPARSE
Publisher:
Association for Computational Linguistics
Note:
Pages:
17–28
Language:
URL:
https://aclanthology.org/2000.iwpt-1.5
DOI:
Bibkey:
Cite (ACL):
Izaskun Aldezabal, Koldo Gojenola, and Kepa Sarasola. 2000. A Bootstrapping Approach to Parser Development. In Proceedings of the Sixth International Workshop on Parsing Technologies, pages 17–28, Trento, Italy. Association for Computational Linguistics.
Cite (Informal):
A Bootstrapping Approach to Parser Development (Aldezabal et al., IWPT 2000)
Copy Citation:
PDF:
https://aclanthology.org/2000.iwpt-1.5.pdf