Difference between revisions of "Resources for Russian"

From ACL Wiki
Jump to: navigation, search
(Corpora)
(Free open source: +WMT corpora)
Line 2: Line 2:
 
===Free open source===
 
===Free open source===
 
* [http://www.euromatrixplus.net/multi-un/ MultiUN] "A Multilingual corpus from United Nation Documents", the Russian portion is 876 MB, the other languages in the multilingual corpus are: English/French/Spanish/Arabic/Chinese/German
 
* [http://www.euromatrixplus.net/multi-un/ MultiUN] "A Multilingual corpus from United Nation Documents", the Russian portion is 876 MB, the other languages in the multilingual corpus are: English/French/Spanish/Arabic/Chinese/German
 +
* [http://www.statmt.org/wmt13/translation-task.html#download WMT corpora], including [http://en.wikipedia.org/wiki/Europarl_corpus Europarl], News Commentary, and News Crawl
  
 
===Unknown license===
 
===Unknown license===

Revision as of 11:05, 12 October 2013

Corpora

Free open source

  • MultiUN "A Multilingual corpus from United Nation Documents", the Russian portion is 876 MB, the other languages in the multilingual corpus are: English/French/Spanish/Arabic/Chinese/German
  • WMT corpora, including Europarl, News Commentary, and News Crawl

Unknown license

POS taggers

Grammars

Various resources