Bornholmsk Natural Language Processing: Resources and Tools

Leon Derczynski, Alex Speed Kjeldsen


Abstract
This paper introduces language processing resources and tools for Bornholmsk, a language spoken on the island of Bornholm, with roots in Danish and closely related to Scanian. This presents an overview of the language and available data, and the first NLP models for this living, minority Nordic language. Sammenfattnijng på borrijnholmst: Dæjnna artikkelijn introduserer natursprågsresurser å varktoi for borrijnholmst, ed språg a dær snakkes på ön Borrijnholm me rødder i danst å i nær familia me skånst. Artikkelijn gjer ed âuersyn âuer språged å di datan som fijnnes, å di fosste NLP modællarna for dætta læwenes nordiska minnretâlsspråged.
Anthology ID:
W19-6138
Volume:
Proceedings of the 22nd Nordic Conference on Computational Linguistics
Month:
September–October
Year:
2019
Address:
Turku, Finland
Editors:
Mareike Hartmann, Barbara Plank
Venue:
NoDaLiDa
SIG:
Publisher:
Linköping University Electronic Press
Note:
Pages:
338–344
Language:
URL:
https://aclanthology.org/W19-6138
DOI:
Bibkey:
Cite (ACL):
Leon Derczynski and Alex Speed Kjeldsen. 2019. Bornholmsk Natural Language Processing: Resources and Tools. In Proceedings of the 22nd Nordic Conference on Computational Linguistics, pages 338–344, Turku, Finland. Linköping University Electronic Press.
Cite (Informal):
Bornholmsk Natural Language Processing: Resources and Tools (Derczynski & Kjeldsen, NoDaLiDa 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-6138.pdf
Code
 StrombergNLP/bornholmsk
Data
BornholmskUniversal Dependencies