Automatic Error Detection concerning the Definite and Indefinite Conjugation in the HunLearner Corpus

Veronika Vincze, János Zsibrita, Péter Durst, Martina Katalin Szabó


Abstract
In this paper we present the results of automatic error detection, concerning the definite and indefinite conjugation in the extended version of the HunLearner corpus, the learners’ corpus of the Hungarian language. We present the most typical structures that trigger definite or indefinite conjugation in Hungarian and we also discuss the most frequent types of errors made by language learners in the corpus texts. We also illustrate the error types with sentences taken from the corpus. Our results highlight grammatical structures that might pose problems for learners of Hungarian, which can be fruitfully applied in the teaching and practicing of such constructions from the language teacher’s or learners’ point of view. On the other hand, these results may be exploited in extending the functionalities of a grammar checker, concerning the definiteness of the verb. Our automatic system was able to achieve perfect recall, i.e. it could find all the mismatches between the type of the object and the conjugation of the verb, which is promising for future studies in this area.
Anthology ID:
L14-1301
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3958–3962
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/340_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Veronika Vincze, János Zsibrita, Péter Durst, and Martina Katalin Szabó. 2014. Automatic Error Detection concerning the Definite and Indefinite Conjugation in the HunLearner Corpus. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3958–3962, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Automatic Error Detection concerning the Definite and Indefinite Conjugation in the HunLearner Corpus (Vincze et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/340_Paper.pdf