Resources for Japanese

From ACLWiki
(Difference between revisions)
Jump to: navigation, search
(Multilingual)
 
(One intermediate revision by one user not shown)
Line 1: Line 1:
There is a very good list at Kyoto University: [http://www-lab25.kuee.kyoto-u.ac.jp/NLP_Portal/lr-cat-e.html Catalogue of Language Resources and Tools in Japan]
+
There is a very good list at JAIST: [http://www.jaist.ac.jp/project/NLP_Portal/doc/LR/lr-cat-e.html Catalogue of Language Resources and Tools in Japan]
  
 
==Corpora==
 
==Corpora==
Line 12: Line 12:
 
* [http://alaginrc.nict.go.jp/WikiCorpus/index_E.html Japanese-English Bilingual Corpus of Wikipedia's Kyoto Articles]  ≈500,000 pairs of manually-translated sentences (CC-BY 3.0)
 
* [http://alaginrc.nict.go.jp/WikiCorpus/index_E.html Japanese-English Bilingual Corpus of Wikipedia's Kyoto Articles]  ≈500,000 pairs of manually-translated sentences (CC-BY 3.0)
 
* [http://id.ndl.go.jp/auth/ndlsh National Diet Library Subject Headers]  Japanese Subject Headers, with paraphrases including English Translations ([http://id.ndl.go.jp/auth/docs/about-ndlsh#03 non-commercial attribution])
 
* [http://id.ndl.go.jp/auth/ndlsh National Diet Library Subject Headers]  Japanese Subject Headers, with paraphrases including English Translations ([http://id.ndl.go.jp/auth/docs/about-ndlsh#03 non-commercial attribution])
* [http://mastarpj.nict.go.jp/~mutiyama/align/index.html English-Japanese Translation Alignment Data]  aligned by [http://mastarpj.nict.go.jp/~mutiyama/ Masao Utiyama] (GFDL, CC-by-nc 1.0)
+
* [http://www2.nict.go.jp/univ-com/multi_trans/member/mutiyama/ English-Japanese Translation Alignment Data]  aligned by [http://mastarpj.nict.go.jp/~mutiyama/ Masao Utiyama] (GFDL, CC-by-nc 1.0)
 
* [http://nlpwww.nict.go.jp/wn-ja/index.en.html WordNet Definitions and Glosses]  ≈180,000 sentence/phrase pairs from the [http://nlpwww.nict.go.jp/wn-ja/index.en.html Japanese Wordnet] (WordNet license, similar to BSD)
 
* [http://nlpwww.nict.go.jp/wn-ja/index.en.html WordNet Definitions and Glosses]  ≈180,000 sentence/phrase pairs from the [http://nlpwww.nict.go.jp/wn-ja/index.en.html Japanese Wordnet] (WordNet license, similar to BSD)
 
* [http://nlpwww.nict.go.jp/wn-ja/eng/downloads.html#jsemcor Japanese Translation of SemCor] ≈14,000 sentences from the [http://nlpwww.nict.go.jp/wn-ja/index.en.html Japanese Wordnet], easily aligned to the [http://www.cse.unt.edu/~rada/downloads.html#semcor English source]  (WordNet license, similar to BSD)
 
* [http://nlpwww.nict.go.jp/wn-ja/eng/downloads.html#jsemcor Japanese Translation of SemCor] ≈14,000 sentences from the [http://nlpwww.nict.go.jp/wn-ja/index.en.html Japanese Wordnet], easily aligned to the [http://www.cse.unt.edu/~rada/downloads.html#semcor English source]  (WordNet license, similar to BSD)

Latest revision as of 04:27, 10 November 2013

There is a very good list at JAIST: Catalogue of Language Resources and Tools in Japan

Contents

Corpora

Proprietary

Free/Open Licence

Multilingual

Monolingual

Grammars

Free/Open Licence

Unknown licence

Dictionaries

Free/Open Licence

Unknown licence

Personal tools