Difference between revisions of "Resources for Finnish"

From ACL Wiki
Jump to navigation Jump to search
(HamleDT)
(4 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
==Corpora==
 
==Corpora==
 
+
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 
* [http://corpora.informatik.uni-leipzig.de/ Finnish plain text and Co-occurrences at LCC]
 
* [http://corpora.informatik.uni-leipzig.de/ Finnish plain text and Co-occurrences at LCC]
 
* [http://www.csc.fi/english/research/sciences/linguistics/index_html CSC Kielipankki] Language Bank at the [http://www.csc.fi/ CSC] Scientific Computing Centre, including some 200 million word tokens of Finnish texts.
 
* [http://www.csc.fi/english/research/sciences/linguistics/index_html CSC Kielipankki] Language Bank at the [http://www.csc.fi/ CSC] Scientific Computing Centre, including some 200 million word tokens of Finnish texts.
 +
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 +
 +
==Morphological analysers==
 +
===Free software===
 +
* [https://gna.org/projects/omorfi/ Omorfi] is an Open Morphology for Finnish, in association with the [[voikko]] speller project, see also https://kitwiki.csc.fi/twiki/bin/view/KitWiki/OmorfiHFSTVersion for installing with [[HFST]]. (LGPL/GPL)
 +
 +
 +
[[Category:Resources by language|Finnish]]

Revision as of 09:42, 26 May 2014

Corpora

Morphological analysers

Free software