Difference between revisions of "Resources for Icelandic"
Jump to navigation
Jump to search
(4 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
− | == | + | ==Language Processing Toolkit== |
===Free software=== | ===Free software=== | ||
* [http://sourceforge.net/projects/icenlp/ IceNLP] (Part-of-Speech tagger, Shallow Parser, Lemmatizer, Segmentizer, Tokenizer, etc.) -- LGPL | * [http://sourceforge.net/projects/icenlp/ IceNLP] (Part-of-Speech tagger, Shallow Parser, Lemmatizer, Segmentizer, Tokenizer, etc.) -- LGPL | ||
− | |||
− | == | + | ==Morphological analysis== |
− | * [http:// | + | ===Free software=== |
+ | |||
+ | * [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/incubator/apertium-fo-is.is.dix Icelandic analyser] for [[lttoolbox]] (~4,300 lemmata, ~143,000 surface forms) -- GPL (online [http://xixona.dlsi.ua.es/~fran/icelandic here]) | ||
==Corpora== | ==Corpora== | ||
+ | ===Free software=== | ||
+ | * [http://linguist.is/icelandic_treebank/Download IcePaHC] - the Icelandic Parsed Historical Corpus. 440000 words (12th-19th century texts, phrase structure + PoS + lemma annotation). [[LGPL]] license. | ||
===Proprietary software=== | ===Proprietary software=== |
Latest revision as of 00:09, 15 April 2011
Language Processing Toolkit
Free software
- IceNLP (Part-of-Speech tagger, Shallow Parser, Lemmatizer, Segmentizer, Tokenizer, etc.) -- LGPL
Morphological analysis
Free software
- Icelandic analyser for lttoolbox (~4,300 lemmata, ~143,000 surface forms) -- GPL (online here)
Corpora
Free software
- IcePaHC - the Icelandic Parsed Historical Corpus. 440000 words (12th-19th century texts, phrase structure + PoS + lemma annotation). LGPL license.