Resources for Macedonian: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
m -*
 
(4 intermediate revisions by the same user not shown)
Line 2: Line 2:


===Free software===
===Free software===
* [https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-bg apertium-mk-bg] RBMT system between Macedonian and Bulgarian
* [https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-en apertium-mk-en] RBMT system between Macedonian and English
===Proprietary===
==Morphological analysis==
===Free software===
* [https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-bg/apertium-mk-bg.mk.dix Morphological analyser] 8,764 lemmata, ~92% coverage over SETimes


===Proprietary===
===Proprietary===
==Corpora==
===Free===
* [http://www.statmt.org/setimes/ Southeast European Times] (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)


==Bibliography==
==Bibliography==
Line 9: Line 25:
* Vojnovski, V., S. Džeroski, and Erjavec, T. (2005) "[http://kt.ijs.si/dunja/SiKDD2005/Papers/VojnovskiTaggingSiKDD2005.pdf Learning PoS tagging from a tagged Macedonian text corpus]". ''Proceedings of SiKDD 2005 (Conference on Data Mining and Data Warehouses), Ljubljana, Slovenia'', pp. 199-202.  
* Vojnovski, V., S. Džeroski, and Erjavec, T. (2005) "[http://kt.ijs.si/dunja/SiKDD2005/Papers/VojnovskiTaggingSiKDD2005.pdf Learning PoS tagging from a tagged Macedonian text corpus]". ''Proceedings of SiKDD 2005 (Conference on Data Mining and Data Warehouses), Ljubljana, Slovenia'', pp. 199-202.  
::A POS tagger for Macedonian is trained on the Macedonian of George Orwells ''Nineteen Eighty-Four''
::A POS tagger for Macedonian is trained on the Macedonian of George Orwells ''Nineteen Eighty-Four''
* Ivanovska, A., Zdravkova, K., Džeroski, S., Erjavec, T. (2005) "Learning Rules for Morphological Analysis and Synthesis of Macedonian Nouns". ''Proceedings of IS 2005, the 8th International Multiconference on the Information Society, 11-17 October 2005, Ljubljana. pp. 195-198
::Gives a machine learning approach to learning Macedonian nouns.


==External links==
==External links==

Latest revision as of 23:04, 7 October 2010

Machine translation systems

Free software

Proprietary

Morphological analysis

Free software

Proprietary

Corpora

Free

  • Southeast European Times (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)

Bibliography

A POS tagger for Macedonian is trained on the Macedonian of George Orwells Nineteen Eighty-Four
  • Ivanovska, A., Zdravkova, K., Džeroski, S., Erjavec, T. (2005) "Learning Rules for Morphological Analysis and Synthesis of Macedonian Nouns". Proceedings of IS 2005, the 8th International Multiconference on the Information Society, 11-17 October 2005, Ljubljana. pp. 195-198
Gives a machine learning approach to learning Macedonian nouns.

External links