Difference between revisions of "Resources for Swedish"

From ACL Wiki
Jump to navigation Jump to search
 
(3 intermediate revisions by 3 users not shown)
Line 7: Line 7:
 
===Proprietary===
 
===Proprietary===
  
 +
==Morphology==
 +
===Free software===
 +
* [https://github.com/mikahama/uralicNLP UralicNLP] is a Python library that provides morphological tagging, generation, lemmatization and disambiguation in many languages including Swedish
  
 
==Lexical resources==
 
==Lexical resources==
 
===Free software===
 
===Free software===
* [http://www.dsso.se/ Den stora svenska ordlistan], 8,8 MB plaintext, License: [[CC-BY-SA]]
+
* [http://www.dsso.se/ Den stora svenska ordlistan], 8.8 MB plaintext, License: [[CC-BY-SA]]
  
 
===Proprietary===
 
===Proprietary===
Line 17: Line 20:
  
 
==Corpora==
 
==Corpora==
 +
===Free===
 +
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 
* [http://corpora.informatik.uni-leipzig.de/ Swedish plain text and Co-occurrences at LCC]
 
* [http://corpora.informatik.uni-leipzig.de/ Swedish plain text and Co-occurrences at LCC]
  
 
===Proprietary===
 
===Proprietary===
 
+
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 
* [http://www.ling.su.se/staff/sofia/suc/suc.html Stockholm Umeå Corpus] (Tagged Corpus, freely available for research purposes)
 
* [http://www.ling.su.se/staff/sofia/suc/suc.html Stockholm Umeå Corpus] (Tagged Corpus, freely available for research purposes)
  

Latest revision as of 05:38, 29 June 2020

Machine translation systems

Free software

  • Apertium Pre-alpha Swedish<->Danish material is available from CVS.

Proprietary

Morphology

Free software

  • UralicNLP is a Python library that provides morphological tagging, generation, lemmatization and disambiguation in many languages including Swedish

Lexical resources

Free software

Proprietary

  • Talbanken05 (Dependency Treebank, freely available for research purposes)

Corpora

Free

Proprietary

  • HamleDT, harmonized dependency treebanks of many languages, common annotation style.
  • Stockholm Umeå Corpus (Tagged Corpus, freely available for research purposes)

Bibliography

External links