Resources for Russian: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
Linas (talk | contribs)
Grammars: Link Grammar Parser, includes Russian dictionaries.
Zeman (talk | contribs)
HamleDT
Line 7: Line 7:
<!-- Please keep this list in alphabetical order -->
<!-- Please keep this list in alphabetical order -->


* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
* [http://www.helsinki.fi/venaja/english/e-material/hanco/index.htm HANCO: The Helsinki annotated corpus of Russian texts] (searchable, no visible download links)
* [http://www.helsinki.fi/venaja/english/e-material/hanco/index.htm HANCO: The Helsinki annotated corpus of Russian texts] (searchable, no visible download links)
* [http://www.sfb441.uni-tuebingen.de/b1/korpora.html Russian Corpora (uni-tuebingen.de)] (searchable, no visible download links)
* [http://www.sfb441.uni-tuebingen.de/b1/korpora.html Russian Corpora (uni-tuebingen.de)] (searchable, no visible download links)

Revision as of 15:51, 26 May 2014

Corpora

Free open source

  • MultiUN "A Multilingual corpus from United Nation Documents", the Russian portion is 876 MB, the other languages in the multilingual corpus are: English/French/Spanish/Arabic/Chinese/German
  • WMT corpora, including the Yandex 1M corpus, News Commentary, and News Crawl

Unknown license

POS taggers

Grammars

Various resources