Difference between revisions of "Corpora for English"

From ACL Wiki
Jump to navigation Jump to search
(Added ANNIS search tool)
(+ICE)
Line 10: Line 10:
 
*[https://corpling.uis.georgetown.edu/gum/ GUM - Georgetown University Multilayer corpus], multiple parses, coreference, entities, sentence types and RST
 
*[https://corpling.uis.georgetown.edu/gum/ GUM - Georgetown University Multilayer corpus], multiple parses, coreference, entities, sentence types and RST
 
*[https://www.gutenberg.org Project Gutenberg]
 
*[https://www.gutenberg.org Project Gutenberg]
 +
*[http://www.ucl.ac.uk/english-usage/ice/avail.htm International Corpus of English]
 
*[http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 
*[http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 
*[http://prize.hutter1.net/ Hutter Prize for Lossless Compression of Human Knowledge 100M sample of Wikipedia]
 
*[http://prize.hutter1.net/ Hutter Prize for Lossless Compression of Human Knowledge 100M sample of Wikipedia]

Revision as of 08:48, 26 June 2016

For languages other than English, see List of resources by language.

Free and Downloadable

Proprietary or Require Prior Permission


Link collections

Corpora tools