Difference between revisions of "Resources for Icelandic"
Jump to navigation
Jump to search
(9 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
− | == | + | ==Language Processing Toolkit== |
===Free software=== | ===Free software=== | ||
− | * [http:// | + | * [http://sourceforge.net/projects/icenlp/ IceNLP] (Part-of-Speech tagger, Shallow Parser, Lemmatizer, Segmentizer, Tokenizer, etc.) -- LGPL |
+ | |||
+ | ==Morphological analysis== | ||
− | === | + | ===Free software=== |
− | * [http:// | + | * [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/incubator/apertium-fo-is.is.dix Icelandic analyser] for [[lttoolbox]] (~4,300 lemmata, ~143,000 surface forms) -- GPL (online [http://xixona.dlsi.ua.es/~fran/icelandic here]) |
==Corpora== | ==Corpora== | ||
+ | ===Free software=== | ||
+ | * [http://linguist.is/icelandic_treebank/Download IcePaHC] - the Icelandic Parsed Historical Corpus. 440000 words (12th-19th century texts, phrase structure + PoS + lemma annotation). [[LGPL]] license. | ||
− | ==Proprietary software== | + | ===Proprietary software=== |
* [http://corpora.informatik.uni-leipzig.de/ Icelandic plain text and Co-occurrences at LCC] | * [http://corpora.informatik.uni-leipzig.de/ Icelandic plain text and Co-occurrences at LCC] |
Latest revision as of 00:09, 15 April 2011
Language Processing Toolkit
Free software
- IceNLP (Part-of-Speech tagger, Shallow Parser, Lemmatizer, Segmentizer, Tokenizer, etc.) -- LGPL
Morphological analysis
Free software
- Icelandic analyser for lttoolbox (~4,300 lemmata, ~143,000 surface forms) -- GPL (online here)
Corpora
Free software
- IcePaHC - the Icelandic Parsed Historical Corpus. 440000 words (12th-19th century texts, phrase structure + PoS + lemma annotation). LGPL license.