Building a Norwegian Lexical Resource for Medical Entity Recognition

Ildiko Pilan, Pål H. Brekke, Lilja Øvrelid


Abstract
We present a large Norwegian lexical resource of categorized medical terms. The resource, which merges information from large medical databases, contains over 56,000 entries, including automatically mapped terms from a Norwegian medical dictionary. We describe the methodology behind this automatic dictionary entry mapping based on keywords and suffixes and further present the results of a manual evaluation performed on a subset by a domain expert. The evaluation indicated that ca. 80% of the mappings were correct.
Anthology ID:
2020.multilingualbio-1.2
Volume:
Proceedings of the LREC 2020 Workshop on Multilingual Biomedical Text Processing (MultilingualBIO 2020)
Month:
May
Year:
2020
Address:
Marseille, France
Editor:
Maite Melero
Venue:
MultilingualBIO
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
9–14
Language:
English
URL:
https://aclanthology.org/2020.multilingualbio-1.2
DOI:
Bibkey:
Cite (ACL):
Ildiko Pilan, Pål H. Brekke, and Lilja Øvrelid. 2020. Building a Norwegian Lexical Resource for Medical Entity Recognition. In Proceedings of the LREC 2020 Workshop on Multilingual Biomedical Text Processing (MultilingualBIO 2020), pages 9–14, Marseille, France. European Language Resources Association.
Cite (Informal):
Building a Norwegian Lexical Resource for Medical Entity Recognition (Pilan et al., MultilingualBIO 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.multilingualbio-1.2.pdf