Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan, Guatemala)

Jonas Kuhn, B’alam Mateo-Toledo


Abstract
This paper reports on a number of experiments in which we applied standard techniques from NLP in the context of documentation of endangered languages. We concentrated on the use of existing, freely available toolkits. Specifically, we explore the use of Finite-State Morphological Analysis, Maximum Entropy Part-of-Speech Tagging, and N-Gram Language Modeling.
Anthology ID:
L04-1240
Volume:
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
Month:
May
Year:
2004
Address:
Lisbon, Portugal
Editors:
Maria Teresa Lino, Maria Francisca Xavier, Fátima Ferreira, Rute Costa, Raquel Silva
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2004/pdf/412.pdf
DOI:
Bibkey:
Cite (ACL):
Jonas Kuhn and B’alam Mateo-Toledo. 2004. Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan, Guatemala). In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04), Lisbon, Portugal. European Language Resources Association (ELRA).
Cite (Informal):
Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan, Guatemala) (Kuhn & Mateo-Toledo, LREC 2004)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2004/pdf/412.pdf