Difference between revisions of "Resources for Japanese"

From ACL Wiki
Jump to navigation Jump to search
(→‎Multilingual: added Japanese basic JEC data)
Line 9: Line 9:
 
====Multilingual====
 
====Multilingual====
 
* [http://www.edrdg.org/projects/tanaka/tanakacorpus.html Tanaka Corpus] by Jim Breen, under a CC-BY-SA 3.0 licence
 
* [http://www.edrdg.org/projects/tanaka/tanakacorpus.html Tanaka Corpus] by Jim Breen, under a CC-BY-SA 3.0 licence
** [http://tatoeba.org/eng/home Tatoeba] Updated version of the Tanaka Corpus;  ≈150,000 sentence pairs  (CC-BY)
+
* [http://www.thai-sbobet.com sbo] Updated version of the Tanaka Corpus;  ≈150,000 sentence pairs  (CC-BY)
 
* [http://alaginrc.nict.go.jp/WikiCorpus/index_E.html Japanese-English Bilingual Corpus of Wikipedia's Kyoto Articles]  ≈500,000 pairs of manually-translated sentences (CC-BY 3.0)
 
* [http://alaginrc.nict.go.jp/WikiCorpus/index_E.html Japanese-English Bilingual Corpus of Wikipedia's Kyoto Articles]  ≈500,000 pairs of manually-translated sentences (CC-BY 3.0)
 
* [http://id.ndl.go.jp/auth/ndlsh National Diet Library Subject Headers]  Japanese Subject Headers, with paraphrases including English Translations([http://id.ndl.go.jp/auth/docs/about-ndlsh#03 non-commercial attribution])
 
* [http://id.ndl.go.jp/auth/ndlsh National Diet Library Subject Headers]  Japanese Subject Headers, with paraphrases including English Translations([http://id.ndl.go.jp/auth/docs/about-ndlsh#03 non-commercial attribution])

Revision as of 02:13, 25 June 2012

There is a very good list at Kyoto University: Catalogue of Language Resources and Tools in Japan

Corpora

Proprietary

Free/Open Licence

Multilingual

Monolingual

Grammars

Free/Open Licence

Unknown licence

Dictionaries

Free/Open Licence

Unknown licence