A comparative study of word embeddings and other features for lexical complexity detection in French

Aina Garí Soler, Marianna Apidianaki, Alexandre Allauzen


Abstract
Lexical complexity detection is an important step for automatic text simplification which serves to make informed lexical substitutions. In this study, we experiment with word embeddings for measuring the complexity of French words and combine them with other features that have been shown to be well-suited for complexity prediction. Our results on a synonym ranking task show that embeddings perform better than other features in isolation, but do not outperform frequency-based systems in this language.
Anthology ID:
2018.jeptalnrecital-court.34
Volume:
Actes de la Conférence TALN. Volume 1 - Articles longs, articles courts de TALN
Month:
5
Year:
2018
Address:
Rennes, France
Editors:
Pascale Sébillot, Vincent Claveau
Venue:
JEP/TALN/RECITAL
SIG:
Publisher:
ATALA
Note:
Pages:
499–508
Language:
URL:
https://aclanthology.org/2018.jeptalnrecital-court.34
DOI:
Bibkey:
Cite (ACL):
Aina Garí Soler, Marianna Apidianaki, and Alexandre Allauzen. 2018. A comparative study of word embeddings and other features for lexical complexity detection in French. In Actes de la Conférence TALN. Volume 1 - Articles longs, articles courts de TALN, pages 499–508, Rennes, France. ATALA.
Cite (Informal):
A comparative study of word embeddings and other features for lexical complexity detection in French (Garí Soler et al., JEP/TALN/RECITAL 2018)
Copy Citation:
PDF:
https://aclanthology.org/2018.jeptalnrecital-court.34.pdf