DeepGeneMD: A Joint Deep Learning Model for Extracting Gene Mutation-Disease Knowledge from PubMed Literature

Feifan Liu, Xiaoyu Zheng, Bo Wang, Catarina Kiefe


Abstract
Understanding the pathogenesis of genetic diseases through different gene activities and their relations to relevant diseases is important for new drug discovery and drug repositioning. In this paper, we present a joint deep learning model in a multi-task learning paradigm for gene mutation-disease knowledge extraction, DeepGeneMD, which adapts the state-of-the-art hierarchical multi-task learning framework for joint inference on named entity recognition (NER) and relation extraction (RE) in the context of the AGAC (Active Gene Annotation Corpus) track at 2019 BioNLP Open Shared Tasks (BioNLP-OST). It simultaneously extracts gene mutation related activities, diseases, and their relations from the published scientific literature. In DeepGeneMD, we explore the task decomposition to create auxiliary subtasks so that more interactions between different learning subtasks can be leveraged in model training. Our model achieves the average F1 score of 0.45 on recognizing gene activities and disease entities, ranking 2nd in the AGAC NER task; and the average F1 score of 0.35 on extracting relations, ranking 1st in the AGAC RE task.
Anthology ID:
D19-5712
Volume:
Proceedings of the 5th Workshop on BioNLP Open Shared Tasks
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
Kim Jin-Dong, Nédellec Claire, Bossy Robert, Deléger Louise
Venue:
BioNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
77–83
Language:
URL:
https://aclanthology.org/D19-5712
DOI:
10.18653/v1/D19-5712
Bibkey:
Cite (ACL):
Feifan Liu, Xiaoyu Zheng, Bo Wang, and Catarina Kiefe. 2019. DeepGeneMD: A Joint Deep Learning Model for Extracting Gene Mutation-Disease Knowledge from PubMed Literature. In Proceedings of the 5th Workshop on BioNLP Open Shared Tasks, pages 77–83, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
DeepGeneMD: A Joint Deep Learning Model for Extracting Gene Mutation-Disease Knowledge from PubMed Literature (Liu et al., BioNLP 2019)
Copy Citation:
PDF:
https://aclanthology.org/D19-5712.pdf