Corpora, datasets, lexicons: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
Line 10: Line 10:


== Corpora ==
== Corpora ==
=== Multilingual ===
(alphabetical order)
* [http://spraakbanken.gu.se/ Bank of Swedish]
* [http://www.tekstlab.uio.no/Bosnian/Corpus.html Oslo Corpus of Bosnian]
* [http://hnk.ffzg.hr/ Croatian National Corpus (HNK)]
* [http://ucnk.ff.cuni.cz/ Czech National Corpus (CNC)]
* [http://corpus.nytud.hu/mnsz/ Hungarian National Corpus]
* [http://korpus.pl/ IPI PAN Corpus of Polish]
* [http://www.corpusdoportugues.org/ Portuguese Corpus]
* [http://www.ruscorpora.ru/ Russian National Corpus (RNK)]
* [http://korpus.juls.savba.sk/ Slovak National Corpus (SNK)]
* [http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
* [http://www.corpusdelespanol.org/ Spanish Corpus]
* [http://www.csse.monash.edu.au/~jwb/tanakacorpus.html Tanaka Corpus: Japanese-English sentence pairs]

Revision as of 20:10, 2 November 2006