Difference between revisions of "Resources for German"

From ACL Wiki
Jump to navigation Jump to search
(→‎Free software: revise section title, since corpora aren't software)
(10 intermediate revisions by 6 users not shown)
Line 1: Line 1:
 
==Corpora==
 
==Corpora==
===Free software===
+
===Free license===
* [http://www.computing.dcu.ie/~ygraham/software.html RIA Open Source Rule Induction Tool] includes an LFG-parsed German-English phrase-aligned parallel corpus (8000 sentences for each language, the tool at least is LGPL)
+
* [http://www.computing.dcu.ie/~ygraham/software.html RIA Open Source Rule Induction Tool] includes an LFG-parsed German-English phrase-aligned parallel corpus, a subset of the EuroParl corpus (4000 sentences for each language, the tool at least is LGPL)
 +
* [http://www.statmt.org/wmt13/translation-task.html#download WMT corpora], including [http://en.wikipedia.org/wiki/Europarl_corpus Europarl], News Commentary, and News Crawl
  
 
===Unknown license===
 
===Unknown license===
Line 22: Line 23:
 
== Grammars ==
 
== Grammars ==
 
* [[Generation grammars|KPML generation grammar]]
 
* [[Generation grammars|KPML generation grammar]]
 +
 +
== Morphological analysis ==
 +
=== Free software ===
 +
* [https://code.google.com/p/morphisto/ Morphisto], based on [[SMOR]], is an [[SFST]]-based analyser and generator for German. (The morphology is GPLv2, but the lexicon is proprietary/non-commercial: CC-BY-SA-NC v3)
 +
* [http://www.danielnaber.de/morphologie/index_en.html German morphology data], based on [http://www.wolfganglezius.de/doku.php?id=cl:morphy Morhpy], licensed under CC-BY-SA 3.0
  
 
==Lexicons==
 
==Lexicons==
 
===Free software===
 
===Free software===
 
* [http://www-user.tu-chemnitz.de/~fri/ding/ DING] - German-English Dictionary with approximately 253,000 entries (GPL 2 or later).
 
* [http://www-user.tu-chemnitz.de/~fri/ding/ DING] - German-English Dictionary with approximately 253,000 entries (GPL 2 or later).
 +
* [http://www.openthesaurus.de/ OpenThesaurus] - German synonyms and associated terms (LGPL)
  
 
===Proprietary/gratis===
 
===Proprietary/gratis===

Revision as of 11:00, 12 October 2013

Corpora

Free license

Unknown license

Evaluation datasets

Grammars

Morphological analysis

Free software

  • Morphisto, based on SMOR, is an SFST-based analyser and generator for German. (The morphology is GPLv2, but the lexicon is proprietary/non-commercial: CC-BY-SA-NC v3)
  • German morphology data, based on Morhpy, licensed under CC-BY-SA 3.0

Lexicons

Free software

  • DING - German-English Dictionary with approximately 253,000 entries (GPL 2 or later).
  • OpenThesaurus - German synonyms and associated terms (LGPL)

Proprietary/gratis

Unknown license

Resource Access

Timeline Analysis