UWB at SemEval-2020 Task 1: Lexical Semantic Change Detection

Ondřej Pražák, Pavel Přibáň, Stephen Taylor, Jakub Sido


Abstract
In this paper, we describe our method for detection of lexical semantic change, i.e., word sense changes over time. We examine semantic differences between specific words in two corpora, chosen from different time periods, for English, German, Latin, and Swedish. Our method was created for the SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection. We ranked 1st in Sub-task 1: binary change detection, and 4th in Sub-task 2: ranked change detection. We present our method which is completely unsupervised and language independent. It consists of preparing a semantic vector space for each corpus, earlier and later; computing a linear transformation between earlier and later spaces, using Canonical Correlation Analysis and orthogonal transformation;and measuring the cosines between the transformed vector for the target word from the earlier corpus and the vector for the target word in the later corpus.
Anthology ID:
2020.semeval-1.30
Volume:
Proceedings of the Fourteenth Workshop on Semantic Evaluation
Month:
December
Year:
2020
Address:
Barcelona (online)
Editors:
Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
Venue:
SemEval
SIG:
SIGLEX
Publisher:
International Committee for Computational Linguistics
Note:
Pages:
246–254
Language:
URL:
https://aclanthology.org/2020.semeval-1.30
DOI:
10.18653/v1/2020.semeval-1.30
Bibkey:
Cite (ACL):
Ondřej Pražák, Pavel Přibáň, Stephen Taylor, and Jakub Sido. 2020. UWB at SemEval-2020 Task 1: Lexical Semantic Change Detection. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 246–254, Barcelona (online). International Committee for Computational Linguistics.
Cite (Informal):
UWB at SemEval-2020 Task 1: Lexical Semantic Change Detection (Pražák et al., SemEval 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.semeval-1.30.pdf
Code
 pauli31/SemEval2020-task1