Resources for Finnish: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
distinguish free vs. non-free corpora; +corpus link; etc.
Mikahama (talk | contribs)
No edit summary
Line 10: Line 10:
* [http://www.kielipankki.fi CSC Kielipankki] Language Bank at the [http://www.csc.fi/ CSC] Scientific Computing Centre, including some 200 million word tokens of Finnish texts.
* [http://www.kielipankki.fi CSC Kielipankki] Language Bank at the [http://www.csc.fi/ CSC] Scientific Computing Centre, including some 200 million word tokens of Finnish texts.


==Morphological analysers==
==NLP Tools==
===Free software===
===Free software===
* [https://github.com/mikahama/uralicNLP UralicNLP] is a Python library that provides morphological tagging, generation, lemmatization and disambiguation in many Uralic languages including Finnish
* [https://gna.org/projects/omorfi/ Omorfi] is an Open Morphology for Finnish, in association with the [[voikko]] speller project, see also https://kitwiki.csc.fi/twiki/bin/view/KitWiki/OmorfiHFSTVersion for installing with [[HFST]]. (LGPL/GPL)
* [https://gna.org/projects/omorfi/ Omorfi] is an Open Morphology for Finnish, in association with the [[voikko]] speller project, see also https://kitwiki.csc.fi/twiki/bin/view/KitWiki/OmorfiHFSTVersion for installing with [[HFST]]. (LGPL/GPL)
* [https://github.com/mikahama/finmeter FinMeter] can be used to analyze Finnish poetry. The functionalities include ryhme, meter, metaphor interpretation and sentiment analysis.
* [https://github.com/mikahama/murre Murre] can normalize spoken or dialectal Finnsh text into the standard written norm. It can also generate dialectal forms from standard Finnish




[[Category:Resources by language|Finnish]]
[[Category:Resources by language|Finnish]]

Revision as of 10:28, 29 June 2020

Corpora

Free

Non-Free

  • Araneum Finnicum, Gigaword Finnish web corpus
  • CSC Kielipankki Language Bank at the CSC Scientific Computing Centre, including some 200 million word tokens of Finnish texts.

NLP Tools

Free software

  • UralicNLP is a Python library that provides morphological tagging, generation, lemmatization and disambiguation in many Uralic languages including Finnish
  • Omorfi is an Open Morphology for Finnish, in association with the voikko speller project, see also https://kitwiki.csc.fi/twiki/bin/view/KitWiki/OmorfiHFSTVersion for installing with HFST. (LGPL/GPL)
  • FinMeter can be used to analyze Finnish poetry. The functionalities include ryhme, meter, metaphor interpretation and sentiment analysis.
  • Murre can normalize spoken or dialectal Finnsh text into the standard written norm. It can also generate dialectal forms from standard Finnish