Difference between revisions of "Corpora, datasets, lexicons"

From ACL Wiki
Jump to navigation Jump to search
Line 10: Line 10:
  
 
== Corpora ==
 
== Corpora ==
 
 
 
=== Multilingual ===
 
(alphabetical order)
 
* [http://spraakbanken.gu.se/ Bank of Swedish]
 
* [http://www.tekstlab.uio.no/Bosnian/Corpus.html Oslo Corpus of Bosnian]
 
* [http://hnk.ffzg.hr/ Croatian National Corpus (HNK)]
 
* [http://ucnk.ff.cuni.cz/ Czech National Corpus (CNC)]
 
* [http://corpus.nytud.hu/mnsz/ Hungarian National Corpus]
 
* [http://korpus.pl/ IPI PAN Corpus of Polish]
 
* [http://www.corpusdoportugues.org/ Portuguese Corpus]
 
* [http://www.ruscorpora.ru/ Russian National Corpus (RNK)]
 
* [http://korpus.juls.savba.sk/ Slovak National Corpus (SNK)]
 
* [http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
 
* [http://www.corpusdelespanol.org/ Spanish Corpus]
 
* [http://www.csse.monash.edu.au/~jwb/tanakacorpus.html Tanaka Corpus: Japanese-English sentence pairs]
 

Revision as of 14:10, 2 November 2006