A Review of Dataset and Labeling Methods for Causality Extraction

Jinghang Xu, Wanli Zuo, Shining Liang, Xianglin Zuo


Abstract
Causality represents the most important kind of correlation between events. Extracting causali-ty from text has become a promising hot topic in NLP. However, there is no mature research systems and datasets for public evaluation. Moreover, there is a lack of unified causal sequence label methods, which constitute the key factors that hinder the progress of causality extraction research. We survey the limitations and shortcomings of existing causality research field com-prehensively from the aspects of basic concepts, extraction methods, experimental data, and la-bel methods, so as to provide reference for future research on causality extraction. We summa-rize the existing causality datasets, explore their practicability and extensibility from multiple perspectives and create a new causal dataset ESC. Aiming at the problem of causal sequence labeling, we analyse the existing methods with a summarization of its regulation and propose a new causal label method of core word. Multiple candidate causal label sequences are put for-ward according to label controversy to explore the optimal label method through experiments, and suggestions are provided for selecting label method.
Anthology ID:
2020.coling-main.133
Volume:
Proceedings of the 28th International Conference on Computational Linguistics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Donia Scott, Nuria Bel, Chengqing Zong
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
1519–1531
Language:
URL:
https://aclanthology.org/2020.coling-main.133
DOI:
10.18653/v1/2020.coling-main.133
Bibkey:
Cite (ACL):
Jinghang Xu, Wanli Zuo, Shining Liang, and Xianglin Zuo. 2020. A Review of Dataset and Labeling Methods for Causality Extraction. In Proceedings of the 28th International Conference on Computational Linguistics, pages 1519–1531, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):
A Review of Dataset and Labeling Methods for Causality Extraction (Xu et al., COLING 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.coling-main.133.pdf