The Ontology of Bulgarian Dialects – Architecture and Information Retrieval

Rositsa Dekova


Abstract
Following a concise description of the structure, the paper focuses on the potential of the Ontology of the Bulgarian Dialects, which demonstrates a novel usage of the ontological modelling for the purposes of dialect digital archiving and information processing. The ontology incorporates information on the dialects of the Bulgarian language and includes data from 84 dialects, spoken not only on the territory of the Republic of Bulgaria, but also abroad. It encodes both their geographical distribution and some of their main diagnostic features, such as the different mutations (also referred to as reflexes) of some of the Old Bulgarian vowels. The mutations modelled so far in the ontology include the reflex of the back nasal vowel /ѫ/ under stress, the reflex of the back er vowel /ъ/ under stress, and the reflex of the yat vowel /ѣ/ under stress when it precedes a syllable with a back vowel. Besides the opportunity for formal structuring of the considerable amount of data gathered through the years by dialectologists, the ontology also provides numerous possibilities for information retrieval – searches by dialect, country, dialect region, city or village, various combinations of diagnostic features.
Anthology ID:
2020.lrec-1.600
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
4877–4882
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.600
DOI:
Bibkey:
Cite (ACL):
Rositsa Dekova. 2020. The Ontology of Bulgarian Dialects – Architecture and Information Retrieval. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 4877–4882, Marseille, France. European Language Resources Association.
Cite (Informal):
The Ontology of Bulgarian Dialects – Architecture and Information Retrieval (Dekova, LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.600.pdf