Difference between revisions of "Resources for Icelandic"
Jump to navigation
Jump to search
Line 12: | Line 12: | ||
==Corpora== | ==Corpora== | ||
+ | ===Free software=== | ||
+ | * [http://linguist.is/icelandic_treebank/Download IcePaHC] - the Icelandic Parsed Historical Corpus. 440000 words (12th-19th century texts, phrase structure + PoS + lemma annotation). [[LGPL]] license. | ||
===Proprietary software=== | ===Proprietary software=== |
Revision as of 23:40, 13 April 2011
Language Processing Toolkit
Free software
- IceNLP (Part-of-Speech tagger, Shallow Parser, Lemmatizer, Segmentizer, Tokenizer, etc.) -- LGPL
Morphological analysis
Free software
- Icelandic analyser for lttoolbox (~4,300 lemmata, ~143,000 surface forms) -- GPL (online here)
Corpora
Free software
- IcePaHC - the Icelandic Parsed Historical Corpus. 440000 words (12th-19th century texts, phrase structure + PoS + lemma annotation). LGPL license.