Difference between revisions of "Resources for Swedish"
Jump to navigation
Jump to search
(5 intermediate revisions by 4 users not shown) | |||
Line 7: | Line 7: | ||
===Proprietary=== | ===Proprietary=== | ||
+ | ==Morphology== | ||
+ | ===Free software=== | ||
+ | * [https://github.com/mikahama/uralicNLP UralicNLP] is a Python library that provides morphological tagging, generation, lemmatization and disambiguation in many languages including Swedish | ||
==Lexical resources== | ==Lexical resources== | ||
+ | ===Free software=== | ||
+ | * [http://www.dsso.se/ Den stora svenska ordlistan], 8.8 MB plaintext, License: [[CC-BY-SA]] | ||
===Proprietary=== | ===Proprietary=== | ||
Line 15: | Line 20: | ||
==Corpora== | ==Corpora== | ||
+ | ===Free=== | ||
+ | * [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English | ||
+ | * [http://corpora.informatik.uni-leipzig.de/ Swedish plain text and Co-occurrences at LCC] | ||
===Proprietary=== | ===Proprietary=== | ||
− | + | * [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style. | |
* [http://www.ling.su.se/staff/sofia/suc/suc.html Stockholm Umeå Corpus] (Tagged Corpus, freely available for research purposes) | * [http://www.ling.su.se/staff/sofia/suc/suc.html Stockholm Umeå Corpus] (Tagged Corpus, freely available for research purposes) | ||
Latest revision as of 04:38, 29 June 2020
Machine translation systems
Free software
- Apertium Pre-alpha Swedish<->Danish material is available from CVS.
Proprietary
Morphology
Free software
- UralicNLP is a Python library that provides morphological tagging, generation, lemmatization and disambiguation in many languages including Swedish
Lexical resources
Free software
- Den stora svenska ordlistan, 8.8 MB plaintext, License: CC-BY-SA
Proprietary
- Talbanken05 (Dependency Treebank, freely available for research purposes)
Corpora
Free
- Europarl corpus, sentence aligned with English
- Swedish plain text and Co-occurrences at LCC
Proprietary
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
- Stockholm Umeå Corpus (Tagged Corpus, freely available for research purposes)