Resources for Russian: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
Kopotev (talk | contribs)
Kiwibird (talk | contribs)
Line 1: Line 1:
==Corpora==
==Corpora==
===Free open source===
* [http://www.euromatrixplus.net/multi-un/ MultiUN] "A Multilingual corpus from United Nation Documents", the Russian portion is 876 MB, the other languages in the multilingual corpus are: English/French/Spanish/Arabic/Chinese/German
===Unknown license===
<!-- Please keep this list in alphabetical order -->
<!-- Please keep this list in alphabetical order -->


* [http://www.helsinki.fi/venaja/english/e-material/hanco/index.htm HANCO: The Helsinki annotated corpus of Russian texts]
* [http://www.helsinki.fi/venaja/english/e-material/hanco/index.htm HANCO: The Helsinki annotated corpus of Russian texts] (searchable, no visible download links)
* [http://www.sfb441.uni-tuebingen.de/b1/korpora.html Russian Corpora (uni-tuebingen.de)]
* [http://www.sfb441.uni-tuebingen.de/b1/korpora.html Russian Corpora (uni-tuebingen.de)] (searchable, no visible download links)
* [http://corpus.leeds.ac.uk/ruscorpora.html Russian Internet Corpus]
* [http://corpus.leeds.ac.uk/ruscorpora.html Russian Internet Corpus]
* [http://www.ruscorpora.ru/ Russian National Corpus]
* [http://www.ruscorpora.ru/ Russian National Corpus]  
* [http://www.philol.msu.ru/~lex/corpus/ Russian Newspaper Corpus]
* [http://www.philol.msu.ru/~lex/corpus/ Russian Newspaper Corpus]
* [http://lib.ru/ Various texts in Russian (lib.ru)]
* [http://lib.ru/ Various texts in Russian (lib.ru)]

Revision as of 10:38, 18 April 2011

Corpora

Free open source

  • MultiUN "A Multilingual corpus from United Nation Documents", the Russian portion is 876 MB, the other languages in the multilingual corpus are: English/French/Spanish/Arabic/Chinese/German

Unknown license

POS taggers

Grammars

Various resources