Difference between revisions of "Resources for German"
Jump to navigation
Jump to search
Line 11: | Line 11: | ||
* [http://www.sfs.uni-tuebingen.de/en_tuebads.shtml Tübingen Treebank of Spoken German (TüBa-D/S, aka Verbmobil treebank)] | * [http://www.sfs.uni-tuebingen.de/en_tuebads.shtml Tübingen Treebank of Spoken German (TüBa-D/S, aka Verbmobil treebank)] | ||
* [http://www.sfs.uni-tuebingen.de/en_tuepp.shtml Tübingen Partially Parsed Corpus of Written German (TüPP-D/Z)] | * [http://www.sfs.uni-tuebingen.de/en_tuepp.shtml Tübingen Partially Parsed Corpus of Written German (TüPP-D/Z)] | ||
+ | * [http://www.coli.uni-saarland.de/~gparis/LMD-TAZ_corpus/ Le Monde Diplomatique-Tageszeitung Parallel Corpus] | ||
==Evaluation datasets== | ==Evaluation datasets== |
Revision as of 00:44, 10 June 2010
Corpora
- Bavarian Archive for Speech Signals Corpora
- COSMAS II
- Experimental Corpus Query System (University of Stuttgart, Germany)
- German plain text and Co-occurrences at LCC
- NEGRA Corpus
- TIGER treebank
- Tübingen Treebank of Written German (TüBa-D/Z)
- Tübingen Treebank of Spoken German (TüBa-D/S, aka Verbmobil treebank)
- Tübingen Partially Parsed Corpus of Written German (TüPP-D/Z)
- Le Monde Diplomatique-Tageszeitung Parallel Corpus
Evaluation datasets
Grammars
Lexicons
Free software
- DING - German-English Dictionary with approximately 253,000 entries (GPL 2 or later).
Proprietary/gratis
- Lexical information for German ("The data is freely available for education, research and other non-commercial purposes.")
- Canoo.net - German Dictionaries and Grammars
Unknown license
- IMSLex German Lexicon (no license information, but only "sample" download)
- mOlif morphological analyzer (broken link)