Difference between revisions of "Resources for Spanish"

From ACL Wiki
Jump to navigation Jump to search
(HamleDT)
 
(4 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
==Corpora==
 
==Corpora==
*[http://www.corpusdelespanol.org/ Corpus del Español] (website only)
+
* [http://ucts.uniba.sk/aranea_about/ Araneum Hispanicum], Gigaword Spanish web corpus
 +
* [http://www.corpusdelespanol.org/ Corpus del Español] (website only)
 
* [http://www.lllf.uam.es/~fmarcos/informes/corpus/corpulee.html Corpus de referencia de la lengua Española contemporanea: corpus oral peninsular]
 
* [http://www.lllf.uam.es/~fmarcos/informes/corpus/corpulee.html Corpus de referencia de la lengua Española contemporanea: corpus oral peninsular]
 
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 
* [http://www.statmt.org/wmt13/translation-task.html#download WMT corpora], including [http://en.wikipedia.org/wiki/Europarl_corpus Europarl], News Commentary, and News Crawl
 
* [http://www.statmt.org/wmt13/translation-task.html#download WMT corpora], including [http://en.wikipedia.org/wiki/Europarl_corpus Europarl], News Commentary, and News Crawl
 
* [http://www.euromatrixplus.net/multi-un/ UN parallel corpora]
 
* [http://www.euromatrixplus.net/multi-un/ UN parallel corpora]
 +
* [https://dev.termwatch.es/~fresa/CORPUS/MSF2/ The Portuguese/Spanish corpus of Multi-Sentence Fusion]
 +
* [https://www.kaggle.com/mikahama/the-best-sarcasm-annotated-dataset-in-spanish Sarcasm annotated dataset]
  
 
== Grammars ==
 
== Grammars ==
 
* [[Generation grammars|KPML generation grammar]]
 
* [[Generation grammars|KPML generation grammar]]
 +
 +
== Datasets ==
 +
* [http://lcl.uniroma1.it/similarity-datasets/ Spanish word similarity dataset] based on [http://www.aclweb.org/aclwiki/index.php?title=RG-65_Test_Collection_%28State_of_the_art%29 RG-65].
 +
  
  
 
[[Category:Resources by language|Spanish]]
 
[[Category:Resources by language|Spanish]]

Latest revision as of 04:40, 29 June 2020

Corpora

Grammars

Datasets