Difference between revisions of "Resources for Icelandic"

From ACL Wiki
Jump to navigation Jump to search
 
(9 intermediate revisions by 3 users not shown)
Line 1: Line 1:
==Morphological analysis==
+
==Language Processing Toolkit==
  
 
===Free software===
 
===Free software===
  
* [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/incubator/apertium-fo-is.is.dix Icelandic analyser] for [http://wiki.apertium.org/wiki/lttoolbox lttoolbox] (~2,300 lemmata, ~95,000 surface forms) -- GPL
+
* [http://sourceforge.net/projects/icenlp/ IceNLP] (Part-of-Speech tagger, Shallow Parser, Lemmatizer, Segmentizer, Tokenizer, etc.) -- LGPL
 +
 
 +
==Morphological analysis==
  
===Proprietary software===
+
===Free software===
  
* [http://nlp.ru.is/projects.htm IceNLP]
+
* [http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/incubator/apertium-fo-is.is.dix Icelandic analyser] for [[lttoolbox]] (~4,300 lemmata, ~143,000 surface forms) -- GPL (online [http://xixona.dlsi.ua.es/~fran/icelandic here])
  
 
==Corpora==
 
==Corpora==
 +
===Free software===
 +
* [http://linguist.is/icelandic_treebank/Download IcePaHC] - the Icelandic Parsed Historical Corpus. 440000 words (12th-19th century texts, phrase structure + PoS + lemma annotation). [[LGPL]] license.
  
==Proprietary software==
+
===Proprietary software===
  
 
* [http://corpora.informatik.uni-leipzig.de/ Icelandic plain text and Co-occurrences at LCC]
 
* [http://corpora.informatik.uni-leipzig.de/ Icelandic plain text and Co-occurrences at LCC]

Latest revision as of 00:09, 15 April 2011

Language Processing Toolkit

Free software

  • IceNLP (Part-of-Speech tagger, Shallow Parser, Lemmatizer, Segmentizer, Tokenizer, etc.) -- LGPL

Morphological analysis

Free software

Corpora

Free software

  • IcePaHC - the Icelandic Parsed Historical Corpus. 440000 words (12th-19th century texts, phrase structure + PoS + lemma annotation). LGPL license.

Proprietary software