Resources for Dutch
Revision as of 19:04, 5 September 2019 by Sean Bethard (talk | contribs) (Move *[http://www.let.rug.nl/~vannoord/alp/ Algorithms for Linguistic Processing] from Uncategorized resources to Resources for Dutch)
Corpora
- Araneum Nederlandicum, Gigaword Dutch web corpus
- Dutch Plain text and Co-occurrences at LCC
- Europarl corpus - sentence-aligned with English
- CLiPS Stylometry Investigation (CSI) corpus - multi-purpose text corpus, main use in stylometry
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
Tools
- Dutch HPSG-based parser Includes the Alpino treebank (7137 sentences, newspaper, manually corrected)
- Algorithms for Linguistic Processing