Difference between revisions of "Resources for Macedonian"

From ACL Wiki

Jump to navigation Jump to search

Revision as of 16:01, 7 October 2010

Machine translation systems

Free software

Proprietary

Morphological analysis

Free software

Morphological analyser 8,764 lemmata, ~92% coverage over SETimes

Proprietary

Corpora

Free

Southeast European Times (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)

Bibliography

Vojnovski, V., S. Džeroski, and Erjavec, T. (2005) "Learning PoS tagging from a tagged Macedonian text corpus". Proceedings of SiKDD 2005 (Conference on Data Mining and Data Warehouses), Ljubljana, Slovenia, pp. 199-202.

A POS tagger for Macedonian is trained on the Macedonian of George Orwells Nineteen Eighty-Four

Ivanovska, A., Zdravkova, K., Džeroski, S., Erjavec, T. (2005) "Learning Rules for Morphological Analysis and Synthesis of Macedonian Nouns". Proceedings of IS 2005, the 8th International Multiconference on the Information Society, 11-17 October 2005, Ljubljana. pp. 195-198

Gives a machine learning approach to learning Macedonian nouns.

External links

Retrieved from "https://aclweb.org/aclwiki/index.php?title=Resources_for_Macedonian&oldid=8210"

Resources by language