Difference between revisions of "Resources for Indonesian"

From ACL Wiki
Jump to navigation Jump to search
(Created page with "==Machine translation== ===Free software=== * [https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms apertium-id-ms] – rule-based Indonesian<->Malay ma...")
 
(Added Indra and the Wordnet Bahasa)
 
(2 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==Machine translation==
+
==Corpora==
===Free software===
+
* [http://ilps.science.uva.nl/resources/bahasa Kompas and Tempo Online Collection] for evaluation purposes.
* [https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms apertium-id-ms] – rule-based Indonesian<->Malay machine translation engine, documented in ''Septina Dian Larasati and Vladislav Kuboň (2010) "[http://dl.dropbox.com/u/537350/paper/MALINDO-2010-final.pdf A Study of Indonesian-to-Malaysian MT System]". MALINDO 2010: Proceedings of the 4th International MALINDO Workshop. Jakarta, Indonesia, 2 August 2010''. Possible to use for morphological tagging.
+
* [http://www.panl10n.net/english/OutputsIndonesia2.htm 500,000 Word Bahasa Indonesia Corpus and Parallel English Translation] (A-NC-SA 3.0 licence)
 +
* [http://www.panl10n.net/english/OutputsIndonesia2.htm 500,000 Word Bahasa Indonesia Parallel Corpus with Penn Treebank] (A-NC-SA 3.0 licence)
 +
* [http://www.panl10n.net/english/OutputsIndonesia2.htm One Million POS Tagged Corpus of Bahasa Indonesia] (A-NC-SA 3.0 licence)
  
 +
==Tools==
 +
* [http://www.panl10n.net/english/OutputsIndonesia2.htm Part of Speech Tagger for Bahasa Indonesia] (GPL licence)
 +
* [https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms Rule-based Indonesian-Malay Machine Translation] by [http://dl.dropbox.com/u/537350/paper/MALINDO-2010-final.pdf Septina Dian Larasati]. Possible to use for morphological tagging.
 +
* [http://abisource.com/projects/link-grammar/ Link Grammar Parser], includes prototype Indonesian dictionaries.
 +
 +
==Grammars==
 +
 +
* [http://moin.delph-in.net/IndraTop Broad-coverage Indonesian Resource Grammar (INDRA)] based on HPSG, using the DELPH-IN infrastructure.
 +
 +
==Lexicons==
 +
 +
* [http://wn-msa.sourceforge.net/ Wordnet Bahasa] Semantic lexicon for Indonesian and Malay, linked to the Open Multilingual Wordnet.
  
  
 
[[Category:Resources by language|Indonesian]]
 
[[Category:Resources by language|Indonesian]]

Latest revision as of 22:28, 14 November 2018

Corpora

Tools

Grammars

Lexicons

  • Wordnet Bahasa Semantic lexicon for Indonesian and Malay, linked to the Open Multilingual Wordnet.