Difference between revisions of "Resources for Portugese"

From ACL Wiki
Jump to navigation Jump to search
(added sections; +Europarl corpus)
(HamleDT)
Line 2: Line 2:
 
==Corpora==
 
==Corpora==
 
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 
+
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
  
 
==Software==
 
==Software==

Revision as of 08:50, 26 May 2014

Corpora

  • Europarl corpus, sentence aligned with English
  • HamleDT, harmonized dependency treebanks of many languages, common annotation style.

Software

  • CEPRIL - Portugese Segmenter
  • Corpógrafo - a Web-based environment for corpora research