Difference between revisions of "Resources for German"

From ACL Wiki
Jump to navigation Jump to search
(HamleDT)
m (→‎Free license: update WMT link)
(One intermediate revision by one other user not shown)
Line 3: Line 3:
 
* [http://www.computing.dcu.ie/~ygraham/software.html RIA Open Source Rule Induction Tool] includes an LFG-parsed German-English phrase-aligned parallel corpus, a subset of the EuroParl corpus (4000 sentences for each language, the tool at least is LGPL)
 
* [http://www.computing.dcu.ie/~ygraham/software.html RIA Open Source Rule Induction Tool] includes an LFG-parsed German-English phrase-aligned parallel corpus, a subset of the EuroParl corpus (4000 sentences for each language, the tool at least is LGPL)
 
* [http://www.euromatrixplus.net/multi-un/ UN parallel corpora]
 
* [http://www.euromatrixplus.net/multi-un/ UN parallel corpora]
* [http://www.statmt.org/wmt13/translation-task.html#download WMT corpora], including [http://en.wikipedia.org/wiki/Europarl_corpus Europarl], News Commentary, and News Crawl
+
* [http://www.statmt.org/wmt15/translation-task.html#download WMT corpora], including [http://en.wikipedia.org/wiki/Europarl_corpus Europarl], News Commentary, and News Crawl
  
 
===Unknown license===
 
===Unknown license===
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
  
 +
* [http://ucts.uniba.sk/aranea_about/ Araneum Germanicum], Gigaword German web corpus
 
* [http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]
 
* [http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]
 
* [http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]
 
* [http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]

Revision as of 08:57, 17 June 2015

Corpora

Free license

Unknown license

Evaluation datasets

Grammars

Morphological analysis

Free software

  • Morphisto, based on SMOR, is an SFST-based analyser and generator for German. (The morphology is GPLv2, but the lexicon is proprietary/non-commercial: CC-BY-SA-NC v3)
  • German morphology data, based on Morhpy, licensed under CC-BY-SA 3.0

Lexicons

Free software

  • DING - German-English Dictionary with approximately 253,000 entries (GPL 2 or later).
  • OpenThesaurus - German synonyms and associated terms (LGPL)

Proprietary/gratis

Unknown license

Resource Access

Timeline Analysis