Difference between revisions of "Resources for Japanese"

From ACL Wiki
Jump to navigation Jump to search
Line 21: Line 21:
 
====Monolingual====
 
====Monolingual====
 
* [http://www-lab25.kuee.kyoto-u.ac.jp/NLP_Portal/lr-cat-e.html#jp:knb_corpus Kyoto University and NTT Blog Corpus]
 
* [http://www-lab25.kuee.kyoto-u.ac.jp/NLP_Portal/lr-cat-e.html#jp:knb_corpus Kyoto University and NTT Blog Corpus]
 +
* [http://www.edrdg.org/~jwb/compv/ Compilation of 64,776 potential Japanese compound verbs]
  
 
== Grammars ==
 
== Grammars ==

Revision as of 19:09, 11 October 2017

There is a very good list at JAIST: Catalogue of Language Resources and Tools in Japan

Corpora

Proprietary

Free/Open Licence

Multilingual

Monolingual

Grammars

Free/Open Licence

Unknown licence

Dictionaries

Free/Open Licence

  • JMdict/EDICT Japanese-English and Japanese-Multilanguage dictionary in text and XML formats, by EDRDG (Electronic Dictionary R&D Group) - 170,000 entries, (CC-BY-SA 3.0 licence)
  • ENAMDICT/JMnedict proper name dictionary in text and XML formats - 740,000 entries, by EDRDG (Electronic Dictionary R&D Group), (CC-BY-SA 3.0 licence)
  • Japanese version of WordNet by NICT, (WordNet license, like BSD)
  • Kanjidic/Kanjidic2 Kanji dictionaries in text and XML formats covering about 13,000 characters, by EDRDG (Electronic Dictionary R&D Group), (CC-BY-SA 3.0 licence)

Unknown licence