End-to-End System for Bacteria Habitat Extraction

Farrokh Mehryary, Kai Hakala, Suwisa Kaewphan, Jari Björne, Tapio Salakoski, Filip Ginter


Abstract
We introduce an end-to-end system capable of named-entity detection, normalization and relation extraction for extracting information about bacteria and their habitats from biomedical literature. Our system is based on deep learning, CRF classifiers and vector space models. We train and evaluate the system on the BioNLP 2016 Shared Task Bacteria Biotope data. The official evaluation shows that the joint performance of our entity detection and relation extraction models outperforms the winning team of the Shared Task by 19pp on F1-score, establishing a new top score for the task. We also achieve state-of-the-art results in the normalization task. Our system is open source and freely available at https://github.com/TurkuNLP/BHE.
Anthology ID:
W17-2310
Volume:
BioNLP 2017
Month:
August
Year:
2017
Address:
Vancouver, Canada,
Editors:
Kevin Bretonnel Cohen, Dina Demner-Fushman, Sophia Ananiadou, Junichi Tsujii
Venue:
BioNLP
SIG:
SIGBIOMED
Publisher:
Association for Computational Linguistics
Note:
Pages:
80–90
Language:
URL:
https://aclanthology.org/W17-2310
DOI:
10.18653/v1/W17-2310
Bibkey:
Cite (ACL):
Farrokh Mehryary, Kai Hakala, Suwisa Kaewphan, Jari Björne, Tapio Salakoski, and Filip Ginter. 2017. End-to-End System for Bacteria Habitat Extraction. In BioNLP 2017, pages 80–90, Vancouver, Canada,. Association for Computational Linguistics.
Cite (Informal):
End-to-End System for Bacteria Habitat Extraction (Mehryary et al., BioNLP 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-2310.pdf
Code
 TurkuNLP/BHE