Toward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames

Daisuke Kawahara, Ryohei Sasano, Sadao Kurohashi


Abstract
This paper proposes a wide-range anaphora resolution system toward text understanding. This system resolves zero, direct and indirect anaphors in Japanese texts by integrating two sorts of linguistic resources: a hand-annotated corpus with various relations and automatically constructed case frames. The corpus has relevance tags which consist of predicate-argument relations, relations between nouns and coreferences, and is utilized for learning parameters of the system and testing it. The case frames are indispensable knowledge both for detecting zero/indirect anaphors and estimating appropriate antecedents. Our preliminary experiments showed promising results.
Anthology ID:
L04-1242
Volume:
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
Month:
May
Year:
2004
Address:
Lisbon, Portugal
Editors:
Maria Teresa Lino, Maria Francisca Xavier, Fátima Ferreira, Rute Costa, Raquel Silva
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2004/pdf/414.pdf
DOI:
Bibkey:
Cite (ACL):
Daisuke Kawahara, Ryohei Sasano, and Sadao Kurohashi. 2004. Toward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04), Lisbon, Portugal. European Language Resources Association (ELRA).
Cite (Informal):
Toward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames (Kawahara et al., LREC 2004)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2004/pdf/414.pdf