TUE at SemEval-2020 Task 1: Detecting Semantic Change by Clustering Contextual Word Embeddings

Anna Karnysheva, Pia Schwarz


Abstract
This paper describes our system for SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection. Target words of corpora from two different time periods are classified according to their semantic change. The languages covered are English, German, Latin, and Swedish. Our approach involves clustering ELMo embeddings using DBSCAN and K-means. For a more fine grained detection of semantic change we take the Jensen-Shannon Distance metric and rank the target words from strongest to weakest change. The results show that this is a valid approach for the classification subtask where we rank 13th out of 33 groups with an accuracy score of 61.2%. For the ranking subtask we score a Spearman’s rank-order correlation coefficient of 0.087 which places us on rank 29.
Anthology ID:
2020.semeval-1.28
Volume:
Proceedings of the Fourteenth Workshop on Semantic Evaluation
Month:
December
Year:
2020
Address:
Barcelona (online)
Editors:
Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
Venue:
SemEval
SIG:
SIGLEX
Publisher:
International Committee for Computational Linguistics
Note:
Pages:
232–238
Language:
URL:
https://aclanthology.org/2020.semeval-1.28
DOI:
10.18653/v1/2020.semeval-1.28
Bibkey:
Cite (ACL):
Anna Karnysheva and Pia Schwarz. 2020. TUE at SemEval-2020 Task 1: Detecting Semantic Change by Clustering Contextual Word Embeddings. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 232–238, Barcelona (online). International Committee for Computational Linguistics.
Cite (Informal):
TUE at SemEval-2020 Task 1: Detecting Semantic Change by Clustering Contextual Word Embeddings (Karnysheva & Schwarz, SemEval 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.semeval-1.28.pdf