Maria Kobozeva


2019

pdf bib
Towards the Data-driven System for Rhetorical Parsing of Russian Texts
Artem Shelmanov | Dina Pisarevskaya | Elena Chistova | Svetlana Toldova | Maria Kobozeva | Ivan Smirnov
Proceedings of the Workshop on Discourse Relation Parsing and Treebanking 2019

Results of the first experimental evaluation of machine learning models trained on Ru-RSTreebank – first Russian corpus annotated within RST framework – are presented. Various lexical, quantitative, morphological, and semantic features were used. In rhetorical relation classification, ensemble of CatBoost model with selected features and a linear SVM model provides the best score (macro F1 = 54.67 ± 0.38). We discover that most of the important features for rhetorical relation classification are related to discourse connectives derived from the connectives lexicon for Russian and from other sources.

2017

pdf bib
Rhetorical relations markers in Russian RST Treebank
Svetlana Toldova | Dina Pisarevskaya | Margarita Ananyeva | Maria Kobozeva | Alexander Nasedkin | Sofia Nikiforova | Irina Pavlova | Alexey Shelepov
Proceedings of the 6th Workshop on Recent Advances in RST and Related Formalisms