Automatic Prediction of Morphosemantic Relations

Svetla Koeva, Svetlozara Leseva, Ivelina Stoyanova, Tsvetana Dimitrova, Maria Todorova


Abstract
This paper presents a machine learning method for automatic identification and classification of morphosemantic relations (MSRs) between verb and noun synset pairs in the Bulgarian WordNet (BulNet). The core training data comprise 6,641 morphosemantically related verb–noun literal pairs from BulNet. The core dataset were preprocessed quality-wise by applying validation and reorganisation procedures. Further, the data were supplemented with negative examples of literal pairs not linked by an MSR. The designed supervised machine learning method uses the RandomTree algorithm and is implemented in Java with the Weka package. A set of experiments were performed to test various approaches to the task. Future work on improving the classifier includes adding more training data, employing more features, and fine-tuning. Apart from the language specific information about derivational processes, the proposed method is language independent.
Anthology ID:
2016.gwc-1.26
Volume:
Proceedings of the 8th Global WordNet Conference (GWC)
Month:
27--30 January
Year:
2016
Address:
Bucharest, Romania
Editors:
Christiane Fellbaum, Piek Vossen, Verginica Barbu Mititelu, Corina Forascu
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
169–177
Language:
URL:
https://aclanthology.org/2016.gwc-1.26
DOI:
Bibkey:
Cite (ACL):
Svetla Koeva, Svetlozara Leseva, Ivelina Stoyanova, Tsvetana Dimitrova, and Maria Todorova. 2016. Automatic Prediction of Morphosemantic Relations. In Proceedings of the 8th Global WordNet Conference (GWC), pages 169–177, Bucharest, Romania. Global Wordnet Association.
Cite (Informal):
Automatic Prediction of Morphosemantic Relations (Koeva et al., GWC 2016)
Copy Citation:
PDF:
https://aclanthology.org/2016.gwc-1.26.pdf