Difference between revisions of "Resources for Macedonian"

From ACL Wiki
Jump to navigation Jump to search
m (-*)
 
(4 intermediate revisions by the same user not shown)
Line 2: Line 2:
  
 
===Free software===
 
===Free software===
 +
 +
* [https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-bg apertium-mk-bg] RBMT system between Macedonian and Bulgarian
 +
* [https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-en apertium-mk-en] RBMT system between Macedonian and English
 +
 +
===Proprietary===
 +
 +
==Morphological analysis==
 +
 +
===Free software===
 +
 +
* [https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-bg/apertium-mk-bg.mk.dix Morphological analyser] 8,764 lemmata, ~92% coverage over SETimes
  
 
===Proprietary===
 
===Proprietary===
 +
 +
==Corpora==
 +
 +
===Free===
 +
* [http://www.statmt.org/setimes/ Southeast European Times] (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)
  
 
==Bibliography==
 
==Bibliography==
Line 9: Line 25:
 
* Vojnovski, V., S. Džeroski, and Erjavec, T. (2005) "[http://kt.ijs.si/dunja/SiKDD2005/Papers/VojnovskiTaggingSiKDD2005.pdf Learning PoS tagging from a tagged Macedonian text corpus]". ''Proceedings of SiKDD 2005 (Conference on Data Mining and Data Warehouses), Ljubljana, Slovenia'', pp. 199-202.  
 
* Vojnovski, V., S. Džeroski, and Erjavec, T. (2005) "[http://kt.ijs.si/dunja/SiKDD2005/Papers/VojnovskiTaggingSiKDD2005.pdf Learning PoS tagging from a tagged Macedonian text corpus]". ''Proceedings of SiKDD 2005 (Conference on Data Mining and Data Warehouses), Ljubljana, Slovenia'', pp. 199-202.  
 
::A POS tagger for Macedonian is trained on the Macedonian of George Orwells ''Nineteen Eighty-Four''
 
::A POS tagger for Macedonian is trained on the Macedonian of George Orwells ''Nineteen Eighty-Four''
 +
* Ivanovska, A., Zdravkova, K., Džeroski, S., Erjavec, T. (2005) "Learning Rules for Morphological Analysis and Synthesis of Macedonian Nouns". ''Proceedings of IS 2005, the 8th International Multiconference on the Information Society, 11-17 October 2005, Ljubljana. pp. 195-198
 +
::Gives a machine learning approach to learning Macedonian nouns.
  
 
==External links==
 
==External links==

Latest revision as of 16:04, 7 October 2010

Machine translation systems

Free software

Proprietary

Morphological analysis

Free software

Proprietary

Corpora

Free

  • Southeast European Times (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)

Bibliography

A POS tagger for Macedonian is trained on the Macedonian of George Orwells Nineteen Eighty-Four
  • Ivanovska, A., Zdravkova, K., Džeroski, S., Erjavec, T. (2005) "Learning Rules for Morphological Analysis and Synthesis of Macedonian Nouns". Proceedings of IS 2005, the 8th International Multiconference on the Information Society, 11-17 October 2005, Ljubljana. pp. 195-198
Gives a machine learning approach to learning Macedonian nouns.

External links