Difference between revisions of "Resources for Danish"
Jump to navigation
Jump to search
(→Free) |
(HamleDT) |
||
Line 19: | Line 19: | ||
* [http://talkbank.org/SamtaleBank/ SamtaleBank] -- transcribed text with original audio, part of the [http://talkbank.org/ TalkBank] (GPL) | * [http://talkbank.org/SamtaleBank/ SamtaleBank] -- transcribed text with original audio, part of the [http://talkbank.org/ TalkBank] (GPL) | ||
* [http://www.isv.cbs.dk/~mbk/treebank/ PAROLE Corpus (SGML format)] (GPL) | * [http://www.isv.cbs.dk/~mbk/treebank/ PAROLE Corpus (SGML format)] (GPL) | ||
+ | * [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style. | ||
* [http://korpus.dsl.dk/korpus2000/indgang.php Danish news corpus] | * [http://korpus.dsl.dk/korpus2000/indgang.php Danish news corpus] | ||
* [http://www.statmt.org/europarl/ EuroParl] -- parallel corpus Danish-English, 164 MB gzipped, "no known copyright restrictions" | * [http://www.statmt.org/europarl/ EuroParl] -- parallel corpus Danish-English, 164 MB gzipped, "no known copyright restrictions" |
Latest revision as of 08:38, 26 May 2014
Machine translation systems
Free software
- apertium-sv-da Swedish<->Danish machine translation system (morphological analysis/generation, bilingual dictionary with PoS information, HMM-based disambiguator). Demo of Swedish->Danish. (GPL)
Proprietary
- GramTrans Danish <-> {Catalan,Norwegian,Esperanto,Galician,German,Spanish,English}, based on Constraint Grammar
Lexical resources
Free software
Corpora
Free
- SamtaleBank -- transcribed text with original audio, part of the TalkBank (GPL)
- PAROLE Corpus (SGML format) (GPL)
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
- Danish news corpus
- EuroParl -- parallel corpus Danish-English, 164 MB gzipped, "no known copyright restrictions"