Resources for Macedonian
Jump to navigation
Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.
Machine translation systems
Free software
- apertium-mk-bg RBMT system between Macedonian and Bulgarian
- apertium-mk-en RBMT system between Macedonian and English
Proprietary
Morphological analysis
Free software
- Morphological analyser 8,764 lemmata, ~92% coverage over SETimes
Proprietary
Corpora
Free
- Southeast European Times (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)
Bibliography
- Vojnovski, V., S. Džeroski, and Erjavec, T. (2005) "Learning PoS tagging from a tagged Macedonian text corpus". Proceedings of SiKDD 2005 (Conference on Data Mining and Data Warehouses), Ljubljana, Slovenia, pp. 199-202.
- A POS tagger for Macedonian is trained on the Macedonian of George Orwells Nineteen Eighty-Four
- Ivanovska, A., Zdravkova, K., Džeroski, S., Erjavec, T. (2005) "Learning Rules for Morphological Analysis and Synthesis of Macedonian Nouns". Proceedings of IS 2005, the 8th International Multiconference on the Information Society, 11-17 October 2005, Ljubljana. pp. 195-198
- Gives a machine learning approach to learning Macedonian nouns.