Multilingual Corpora: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
move monolingual corpora to individual language's page, part 1
move monolingual corpora to individual language's pages, part 2
Line 17: Line 17:
*[http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
*[http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC95T20 Hansard French-English parallel corpus]
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC95T20 Hansard French-English parallel corpus]
*[http://www.ucl.ac.uk/english-usage/ice/avail.htm ICE corpora]
*[http://korpus.pl/ IPI PAN Corpus of Polish]
*[http://www.tu-chemnitz.de/phil/InternetGrammar/ Learner Behaviour on the Internet]
*[http://www.tu-chemnitz.de/phil/InternetGrammar/ Learner Behaviour on the Internet]
*[http://corpora.informatik.uni-leipzig.de Leipzig Corpora Collection: Large monolingual raw corpora for 17+ languages, free downloads]
*[http://corpora.informatik.uni-leipzig.de Leipzig Corpora Collection: Large monolingual raw corpora for 17+ languages, free downloads]
Line 29: Line 27:
*[http://www.ims.uni-stuttgart.de/info/Newspapers.html Newspapers on the Internet]
*[http://www.ims.uni-stuttgart.de/info/Newspapers.html Newspapers on the Internet]
*[http://logos.uio.no/opus/ OPUS - an open source parallel corpus]
*[http://logos.uio.no/opus/ OPUS - an open source parallel corpus]
*[http://www.tekstlab.uio.no/Bosnian/Corpus.html Oslo Corpus of Bosnian]
*[http://langbank.engl.polyu.edu.hk/indexl.html PolyU Language Bank]
*[http://langbank.engl.polyu.edu.hk/indexl.html PolyU Language Bank]
*[http://www.corpusdoportugues.org/ Portuguese Corpus]
*[http://register.consilium.eu.int/ Public registry of the Council of the EU]
*[http://register.consilium.eu.int/ Public registry of the Council of the EU]
*[http://www.ruscorpora.ru/ Russian National Corpus (RNK)]
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
*[http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
*[http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
*[http://www.ling.su.se/DaLi/research/smultron/index.htm SMULTRON Corpus] parallell treebank of English, German and Swedish
*[http://www.ling.su.se/DaLi/research/smultron/index.htm SMULTRON Corpus] parallell treebank of English, German and Swedish
*[http://www.corpusdelespanol.org/ Spanish Corpus]
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages]
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages]
*[http://conferences.unite.un.org/UNCorpus/ UN parallel corpora]
*[http://conferences.unite.un.org/UNCorpus/ UN parallel corpora]

Revision as of 14:56, 26 June 2016

This page lists multilingual corpora. For monolingual corpora, see List of resources by language.

See also Multilingual resources.