A Progressive Learning Approach to Chinese SRL Using Heterogeneous Data

Qiaolin Xia, Lei Sha, Baobao Chang, Zhifang Sui


Abstract
Previous studies on Chinese semantic role labeling (SRL) have concentrated on a single semantically annotated corpus. But the training data of single corpus is often limited. Whereas the other existing semantically annotated corpora for Chinese SRL are scattered across different annotation frameworks. But still, Data sparsity remains a bottleneck. This situation calls for larger training datasets, or effective approaches which can take advantage of highly heterogeneous data. In this paper, we focus mainly on the latter, that is, to improve Chinese SRL by using heterogeneous corpora together. We propose a novel progressive learning model which augments the Progressive Neural Network with Gated Recurrent Adapters. The model can accommodate heterogeneous inputs and effectively transfer knowledge between them. We also release a new corpus, Chinese SemBank, for Chinese SRL. Experiments on CPB 1.0 show that our model outperforms state-of-the-art methods.
Anthology ID:
P17-1189
Volume:
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2017
Address:
Vancouver, Canada
Editors:
Regina Barzilay, Min-Yen Kan
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2069–2077
Language:
URL:
https://aclanthology.org/P17-1189
DOI:
10.18653/v1/P17-1189
Bibkey:
Cite (ACL):
Qiaolin Xia, Lei Sha, Baobao Chang, and Zhifang Sui. 2017. A Progressive Learning Approach to Chinese SRL Using Heterogeneous Data. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2069–2077, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):
A Progressive Learning Approach to Chinese SRL Using Heterogeneous Data (Xia et al., ACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/P17-1189.pdf