Difference between revisions of "Resources for Portugese"

From ACL Wiki
Jump to navigation Jump to search
(New page: * [http://www.linguateca.pt/corpografo CorpÛgrafo])
 
(HamleDT)
(3 intermediate revisions by 2 users not shown)
Line 1: Line 1:
* [http://www.linguateca.pt/corpografo CorpÛgrafo]
+
 
 +
==Corpora==
 +
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 +
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 +
 
 +
==Software==
 +
* [http://lael.pucsp.br/corpora/segmentador/ CEPRIL] -  Portugese Segmenter
 +
* [http://www.linguateca.pt/corpografo Corpógrafo] - a Web-based environment for corpora research
 +
 
 +
 
 +
[[Category:Resources by language|Portugese]]

Revision as of 09:50, 26 May 2014

Corpora

  • Europarl corpus, sentence aligned with English
  • HamleDT, harmonized dependency treebanks of many languages, common annotation style.

Software

  • CEPRIL - Portugese Segmenter
  • Corpógrafo - a Web-based environment for corpora research