Negative language transfer in learner English: A new dataset

Leticia Farias Wanderley, Nicole Zhao, Carrie Demmans Epp


Abstract
Automatic personalized corrective feedback can help language learners from different backgrounds better acquire a new language. This paper introduces a learner English dataset in which learner errors are accompanied by information about possible error sources. This dataset contains manually annotated error causes for learner writing errors. These causes tie learner mistakes to structures from their first languages, when the rules in English and in the first language diverge. This new dataset will enable second language acquisition researchers to computationally analyze a large quantity of learner errors that are related to language transfer from the learners’ first language. The dataset can also be applied in personalizing grammatical error correction systems according to the learners’ first language and in providing feedback that is informed by the cause of an error.
Anthology ID:
2021.naacl-main.251
Volume:
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Month:
June
Year:
2021
Address:
Online
Editors:
Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tur, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, Yichao Zhou
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3129–3142
Language:
URL:
https://aclanthology.org/2021.naacl-main.251
DOI:
10.18653/v1/2021.naacl-main.251
Bibkey:
Cite (ACL):
Leticia Farias Wanderley, Nicole Zhao, and Carrie Demmans Epp. 2021. Negative language transfer in learner English: A new dataset. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3129–3142, Online. Association for Computational Linguistics.
Cite (Informal):
Negative language transfer in learner English: A new dataset (Farias Wanderley et al., NAACL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.naacl-main.251.pdf
Video:
 https://aclanthology.org/2021.naacl-main.251.mp4
Code
 EdTeKLA/LanguageTransfer
Data
FCEPenn Treebank