Resources for Croatian

From ACL Wiki
Revision as of 07:38, 8 January 2008 by Pdturney (talk | contribs) (Reverted edits by Edward518 (Talk); changed back to last version by Dcavar)
Jump to navigation Jump to search

General

Corpora

  • Croatian Language Corpus (continuously growing (currently approx. 100 mil. tokens) corpus of Croatian covering various genres and time periods, using Philologic for online search)

Free

  • Southeast European Times (paragraph aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — 9,678 paragraphs, 92,450— 122,912 words per language)