Difference between revisions of "Resources for Dutch"
Jump to navigation
Jump to search
Verhoevenben (talk | contribs) |
Sean Bethard (talk | contribs) m (Move *[http://www.let.rug.nl/~vannoord/alp/ Algorithms for Linguistic Processing] from Uncategorized resources to Resources for Dutch) |
||
(2 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
== Corpora == | == Corpora == | ||
+ | * [http://ucts.uniba.sk/aranea_about/ Araneum Nederlandicum], Gigaword Dutch web corpus | ||
* [http://corpora.informatik.uni-leipzig.de/ Dutch Plain text and Co-occurrences at LCC] | * [http://corpora.informatik.uni-leipzig.de/ Dutch Plain text and Co-occurrences at LCC] | ||
* [http://www.statmt.org/europarl Europarl corpus] - sentence-aligned with English | * [http://www.statmt.org/europarl Europarl corpus] - sentence-aligned with English | ||
* [http://www.clips.uantwerpen.be/datasets/csi-corpus CLiPS Stylometry Investigation (CSI) corpus] - multi-purpose text corpus, main use in stylometry | * [http://www.clips.uantwerpen.be/datasets/csi-corpus CLiPS Stylometry Investigation (CSI) corpus] - multi-purpose text corpus, main use in stylometry | ||
+ | * [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style. | ||
== Tools == | == Tools == | ||
* [http://www.let.rug.nl/~vannoord/alp/Alpino/ Dutch HPSG-based parser] Includes the Alpino treebank (7137 sentences, newspaper, manually corrected) | * [http://www.let.rug.nl/~vannoord/alp/Alpino/ Dutch HPSG-based parser] Includes the Alpino treebank (7137 sentences, newspaper, manually corrected) | ||
+ | *[http://www.let.rug.nl/~vannoord/alp/ Algorithms for Linguistic Processing] | ||
== Grammars == | == Grammars == | ||
* [[Generation grammars|KPML generation grammar]] | * [[Generation grammars|KPML generation grammar]] | ||
− | |||
− | |||
[[Category:Resources by language|Dutch]] | [[Category:Resources by language|Dutch]] |
Latest revision as of 19:04, 5 September 2019
Corpora
- Araneum Nederlandicum, Gigaword Dutch web corpus
- Dutch Plain text and Co-occurrences at LCC
- Europarl corpus - sentence-aligned with English
- CLiPS Stylometry Investigation (CSI) corpus - multi-purpose text corpus, main use in stylometry
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
Tools
- Dutch HPSG-based parser Includes the Alpino treebank (7137 sentences, newspaper, manually corrected)
- Algorithms for Linguistic Processing