Difference between revisions of "Resources for Swedish"
Jump to navigation
Jump to search
(→Corpora: +Europarl corpus) |
|||
(One intermediate revision by one other user not shown) | |||
Line 7: | Line 7: | ||
===Proprietary=== | ===Proprietary=== | ||
+ | ==Morphology== | ||
+ | ===Free software=== | ||
+ | * [https://github.com/mikahama/uralicNLP UralicNLP] is a Python library that provides morphological tagging, generation, lemmatization and disambiguation in many languages including Swedish | ||
==Lexical resources== | ==Lexical resources== | ||
Line 22: | Line 25: | ||
===Proprietary=== | ===Proprietary=== | ||
+ | * [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style. | ||
* [http://www.ling.su.se/staff/sofia/suc/suc.html Stockholm Umeå Corpus] (Tagged Corpus, freely available for research purposes) | * [http://www.ling.su.se/staff/sofia/suc/suc.html Stockholm Umeå Corpus] (Tagged Corpus, freely available for research purposes) | ||
Latest revision as of 04:38, 29 June 2020
Machine translation systems
Free software
- Apertium Pre-alpha Swedish<->Danish material is available from CVS.
Proprietary
Morphology
Free software
- UralicNLP is a Python library that provides morphological tagging, generation, lemmatization and disambiguation in many languages including Swedish
Lexical resources
Free software
- Den stora svenska ordlistan, 8.8 MB plaintext, License: CC-BY-SA
Proprietary
- Talbanken05 (Dependency Treebank, freely available for research purposes)
Corpora
Free
- Europarl corpus, sentence aligned with English
- Swedish plain text and Co-occurrences at LCC
Proprietary
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
- Stockholm Umeå Corpus (Tagged Corpus, freely available for research purposes)