Difference between revisions of "Resources for Indonesian"
Jump to navigation
Jump to search
(→Tools: Link grammar parser has bare-bones prototype) |
|||
Line 8: | Line 8: | ||
* [http://www.panl10n.net/english/OutputsIndonesia2.htm Part of Speech Tagger for Bahasa Indonesia] (GPL licence) | * [http://www.panl10n.net/english/OutputsIndonesia2.htm Part of Speech Tagger for Bahasa Indonesia] (GPL licence) | ||
* [https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms Rule-based Indonesian-Malay Machine Translation] by [http://dl.dropbox.com/u/537350/paper/MALINDO-2010-final.pdf Septina Dian Larasati]. Possible to use for morphological tagging. | * [https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms Rule-based Indonesian-Malay Machine Translation] by [http://dl.dropbox.com/u/537350/paper/MALINDO-2010-final.pdf Septina Dian Larasati]. Possible to use for morphological tagging. | ||
− | + | * [http://abisource.com/projects/link-grammar/ Link Grammar Parser], includes prototype Indonesian dictionaries. | |
[[Category:Resources by language|Indonesian]] | [[Category:Resources by language|Indonesian]] |
Revision as of 20:21, 16 December 2015
Corpora
- Kompas and Tempo Online Collection for evaluation purposes.
- 500,000 Word Bahasa Indonesia Corpus and Parallel English Translation (A-NC-SA 3.0 licence)
- 500,000 Word Bahasa Indonesia Parallel Corpus with Penn Treebank (A-NC-SA 3.0 licence)
- One Million POS Tagged Corpus of Bahasa Indonesia (A-NC-SA 3.0 licence)
Tools
- Part of Speech Tagger for Bahasa Indonesia (GPL licence)
- Rule-based Indonesian-Malay Machine Translation by Septina Dian Larasati. Possible to use for morphological tagging.
- Link Grammar Parser, includes prototype Indonesian dictionaries.