Resources for Finnish: Difference between revisions
Jump to navigation
Jump to search
→Corpora: +Europarl corpus |
|||
| Line 1: | Line 1: | ||
==Corpora== | ==Corpora== | ||
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English | |||
* [http://corpora.informatik.uni-leipzig.de/ Finnish plain text and Co-occurrences at LCC] | * [http://corpora.informatik.uni-leipzig.de/ Finnish plain text and Co-occurrences at LCC] | ||
* [http://www.csc.fi/english/research/sciences/linguistics/index_html CSC Kielipankki] Language Bank at the [http://www.csc.fi/ CSC] Scientific Computing Centre, including some 200 million word tokens of Finnish texts. | * [http://www.csc.fi/english/research/sciences/linguistics/index_html CSC Kielipankki] Language Bank at the [http://www.csc.fi/ CSC] Scientific Computing Centre, including some 200 million word tokens of Finnish texts. | ||
Revision as of 17:15, 12 October 2013
Corpora
- Europarl corpus, sentence aligned with English
- Finnish plain text and Co-occurrences at LCC
- CSC Kielipankki Language Bank at the CSC Scientific Computing Centre, including some 200 million word tokens of Finnish texts.
Morphological analysers
Free software
- Omorfi is an Open Morphology for Finnish, in association with the voikko speller project, see also https://kitwiki.csc.fi/twiki/bin/view/KitWiki/OmorfiHFSTVersion for installing with HFST. (LGPL/GPL)