Resources for Portugese
Jump to navigation
Jump to search
Corpora
- Colonia, corpus of historical Portuguese.
- Europarl corpus, sentence aligned with English
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
Software
- CEPRIL - Portugese Segmenter
- Corpógrafo - a Web-based environment for corpora research
Wordlists
- P-AWL - the Portuguese academic wordlist compiled as described in Baptista et al. (2010)