Neural Language Models vs Wordnet-based Semantically Enriched Representation in CST Relation Recognition

Arkadiusz Janz; Maciej Piasecki; Piotr Wątorski

Neural Language Models vs Wordnet-based Semantically Enriched Representation in CST Relation Recognition

Arkadiusz Janz, Maciej Piasecki, Piotr Wątorski

Abstract

Neural language models, including transformer-based models, that are pre-trained on very large corpora became a common way to represent text in various tasks, including recognition of textual semantic relations, e.g. Cross-document Structure Theory. Pre-trained models are usually fine tuned to downstream tasks and the obtained vectors are used as an input for deep neural classifiers. No linguistic knowledge obtained from resources and tools is utilised. In this paper we compare such universal approaches with a combination of rich graph-based linguistically motivated sentence representation and a typical neural network classifier applied to a task of recognition of CST relation in Polish. The representation describes selected levels of the sentence structure including description of lexical meanings on the basis of the wordnet (plWordNet) synsets and connected SUMO concepts. The obtained results show that in the case of difficult relations and medium size training corpus semantically enriched text representation leads to significantly better results.

Anthology ID:: 2021.gwc-1.26
Volume:: Proceedings of the 11th Global Wordnet Conference
Month:: January
Year:: 2021
Address:: University of South Africa (UNISA)
Editors:: Piek Vossen, Christiane Fellbaum
Venue:: GWC
SIG:: SIGLEX
Publisher:: Global Wordnet Association
Note:
Pages:: 223–233
Language:
URL:: https://aclanthology.org/2021.gwc-1.26
DOI:
Bibkey:
Cite (ACL):: Arkadiusz Janz, Maciej Piasecki, and Piotr Wątorski. 2021. Neural Language Models vs Wordnet-based Semantically Enriched Representation in CST Relation Recognition. In Proceedings of the 11th Global Wordnet Conference, pages 223–233, University of South Africa (UNISA). Global Wordnet Association.
Cite (Informal):: Neural Language Models vs Wordnet-based Semantically Enriched Representation in CST Relation Recognition (Janz et al., GWC 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.gwc-1.26.pdf

PDF Cite Search