Identifying Risk Factors For Heart Disease in Electronic Medical Records: A Deep Learning Approach

Thanat Chokwijitkul, Anthony Nguyen, Hamed Hassanzadeh, Siegfried Perez


Abstract
Automatic identification of heart disease risk factors in clinical narratives can expedite disease progression modelling and support clinical decisions. Existing practical solutions for cardiovascular risk detection are mostly hybrid systems entailing the integration of knowledge-driven and data-driven methods, relying on dictionaries, rules and machine learning methods that require a substantial amount of human effort. This paper proposes a comparative analysis on the applicability of deep learning, a re-emerged data-driven technique, in the context of clinical text classification. Various deep learning architectures were devised and evaluated for extracting heart disease risk factors from clinical documents. The data provided for the 2014 i2b2/UTHealth shared task focusing on identifying risk factors for heart disease was used for system development and evaluation. Results have shown that a relatively simple deep learning model can achieve a high micro-averaged F-measure of 0.9081, which is comparable to the best systems from the shared task. This is highly encouraging given the simplicity of the deep learning approach compared to the heavily feature-engineered hybrid approaches that were required to achieve state-of-the-art performances.
Anthology ID:
W18-2303
Volume:
Proceedings of the BioNLP 2018 workshop
Month:
July
Year:
2018
Address:
Melbourne, Australia
Editors:
Dina Demner-Fushman, Kevin Bretonnel Cohen, Sophia Ananiadou, Junichi Tsujii
Venue:
BioNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
18–27
Language:
URL:
https://aclanthology.org/W18-2303
DOI:
10.18653/v1/W18-2303
Bibkey:
Cite (ACL):
Thanat Chokwijitkul, Anthony Nguyen, Hamed Hassanzadeh, and Siegfried Perez. 2018. Identifying Risk Factors For Heart Disease in Electronic Medical Records: A Deep Learning Approach. In Proceedings of the BioNLP 2018 workshop, pages 18–27, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):
Identifying Risk Factors For Heart Disease in Electronic Medical Records: A Deep Learning Approach (Chokwijitkul et al., BioNLP 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-2303.pdf