Coreference-Based Text Simplification

Rodrigo Wilkens, Bruno Oberle, Amalia Todirascu


Abstract
Text simplification aims at adapting documents to make them easier to read by a given audience. Usually, simplification systems consider only lexical and syntactic levels, and, moreover, are often evaluated at the sentence level. Thus, studies on the impact of simplification in text cohesion are lacking. Some works add coreference resolution in their pipeline to address this issue. In this paper, we move forward in this direction and present a rule-based system for automatic text simplification, aiming at adapting French texts for dyslexic children. The architecture of our system takes into account not only lexical and syntactic but also discourse information, based on coreference chains. Our system has been manually evaluated in terms of grammaticality and cohesion. We have also built and used an evaluation corpus containing multiple simplification references for each sentence. It has been annotated by experts following a set of simplification guidelines, and can be used to run automatic evaluation of other simplification systems. Both the system and the evaluation corpus are freely available.
Anthology ID:
2020.readi-1.14
Volume:
Proceedings of the 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI)
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Núria Gala, Rodrigo Wilkens
Venue:
READI
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
93–100
Language:
English
URL:
https://aclanthology.org/2020.readi-1.14
DOI:
Bibkey:
Cite (ACL):
Rodrigo Wilkens, Bruno Oberle, and Amalia Todirascu. 2020. Coreference-Based Text Simplification. In Proceedings of the 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI), pages 93–100, Marseille, France. European Language Resources Association.
Cite (Informal):
Coreference-Based Text Simplification (Wilkens et al., READI 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.readi-1.14.pdf