The MULINCO corpus and corpus platform

Bente Maegaard, Lene Offersgaard, Lina Henriksen, Hanne Jansen, Xavier Lepetit, Costanza Navarretta, Claus Povlsen


Abstract
The MULINCO project (MUltiLINgual Corpus of the University of Copenhagen) started early 2005. The purpose of this cross-disciplinary project is to create a corpus platform for education and research in monolingual and translation studies. The project covers two main types of corpus texts: literary and non-literary. The platform is being developed using available tools as far as possible, and integrating them in a very open architecture. In this paper we describe the current status and future developments of both the text and tool side of the corpus platform, and we show some examples of student exercises taking advantage of tagged and aligned texts.
Anthology ID:
L06-1286
Volume:
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Month:
May
Year:
2006
Address:
Genoa, Italy
Editors:
Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, Daniel Tapias
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/480_pdf.pdf
DOI:
Bibkey:
Cite (ACL):
Bente Maegaard, Lene Offersgaard, Lina Henriksen, Hanne Jansen, Xavier Lepetit, Costanza Navarretta, and Claus Povlsen. 2006. The MULINCO corpus and corpus platform. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy. European Language Resources Association (ELRA).
Cite (Informal):
The MULINCO corpus and corpus platform (Maegaard et al., LREC 2006)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/480_pdf.pdf