Difference between revisions of "Resources for Spanish"
Jump to navigation
Jump to search
(+MultiUN corpora) |
(Added: Araneum) |
||
(One intermediate revision by one other user not shown) | |||
Line 1: | Line 1: | ||
==Corpora== | ==Corpora== | ||
− | *[http://www.corpusdelespanol.org/ Corpus del Español] (website only) | + | * [http://ucts.uniba.sk/aranea_about/ Araneum Hispanicum], Gigaword Spanish web corpus |
+ | * [http://www.corpusdelespanol.org/ Corpus del Español] (website only) | ||
* [http://www.lllf.uam.es/~fmarcos/informes/corpus/corpulee.html Corpus de referencia de la lengua Española contemporanea: corpus oral peninsular] | * [http://www.lllf.uam.es/~fmarcos/informes/corpus/corpulee.html Corpus de referencia de la lengua Española contemporanea: corpus oral peninsular] | ||
+ | * [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style. | ||
* [http://www.statmt.org/wmt13/translation-task.html#download WMT corpora], including [http://en.wikipedia.org/wiki/Europarl_corpus Europarl], News Commentary, and News Crawl | * [http://www.statmt.org/wmt13/translation-task.html#download WMT corpora], including [http://en.wikipedia.org/wiki/Europarl_corpus Europarl], News Commentary, and News Crawl | ||
* [http://www.euromatrixplus.net/multi-un/ UN parallel corpora] | * [http://www.euromatrixplus.net/multi-un/ UN parallel corpora] |
Revision as of 13:29, 8 March 2015
Corpora
- Araneum Hispanicum, Gigaword Spanish web corpus
- Corpus del Español (website only)
- Corpus de referencia de la lengua Española contemporanea: corpus oral peninsular
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
- WMT corpora, including Europarl, News Commentary, and News Crawl
- UN parallel corpora