Difference between revisions of "Resources for Spanish"
Jump to navigation
Jump to search
Line 11: | Line 11: | ||
== Datasets == | == Datasets == | ||
− | * [http://lcl.uniroma1.it/similarity-datasets/ Spanish word similarity dataset] based on [http://www.aclweb.org/aclwiki/index.php?title=RG-65_Test_Collection_%28State_of_the_art%29 RG-65] | + | * [http://lcl.uniroma1.it/similarity-datasets/ Spanish word similarity dataset] based on [http://www.aclweb.org/aclwiki/index.php?title=RG-65_Test_Collection_%28State_of_the_art%29 RG-65]. |
[[Category:Resources by language|Spanish]] | [[Category:Resources by language|Spanish]] |
Revision as of 03:09, 30 June 2015
Corpora
- Araneum Hispanicum, Gigaword Spanish web corpus
- Corpus del Español (website only)
- Corpus de referencia de la lengua Española contemporanea: corpus oral peninsular
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
- WMT corpora, including Europarl, News Commentary, and News Crawl
- UN parallel corpora
Grammars
Datasets
- Spanish word similarity dataset based on RG-65.