Difference between revisions of "Resources for Macedonian"

From ACL Wiki
Jump to: navigation, search
(Corpora)
(Corpora)
Line 5: Line 5:
 
===Proprietary===
 
===Proprietary===
  
 +
 +
==Morphological analysis==
 +
 +
===Free software===
 +
 +
* [https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-bg/apertium-mk-bg.mk.dix Morphological analyser] 8,764 lemmata, ~92% coverage over SETimes
 +
 +
===Proprietary===
  
 
==Corpora==
 
==Corpora==

Revision as of 16:01, 7 October 2010

Machine translation systems

Free software

Proprietary

Morphological analysis

Free software

Proprietary

Corpora

Free

  • Southeast European Times (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)

Bibliography

A POS tagger for Macedonian is trained on the Macedonian of George Orwells Nineteen Eighty-Four
  • Ivanovska, A., Zdravkova, K., Džeroski, S., Erjavec, T. (2005) "Learning Rules for Morphological Analysis and Synthesis of Macedonian Nouns". Proceedings of IS 2005, the 8th International Multiconference on the Information Society, 11-17 October 2005, Ljubljana. pp. 195-198
Gives a machine learning approach to learning Macedonian nouns.

External links