Morphology software for English

From ACL Wiki
Revision as of 19:47, 5 February 2009 by Jonsafari (talk | contribs) (+cat)
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Software - Morphology and part of speech tagging

For languages other than English, see List of resources by language.

Morphology

Free software

  • Catvar 2.0 - The Categorial Variation Database for English (OSL)
  • Flemmv3.1 - inflectional morphology parser for French -- perl scripts, GPL license
  • lttoolbox -- lexical processing tools for building morphological analysers/generators with XML specification files (GPL)
  • MAP - Cambridge/Edinburgh Morphological Analyzer and Dictionary System (freeware)
  • Morph-It! version 0.31 - a free morphological resource for the Italian language
  • SFST - Stuttgart Finite State Transducer Tools (GPL)

Proprietary software

  • CELEX database - Dutch, English, and German word forms
  • FONOL - Phonological Programming Language (non-commercial only)
  • German Morphology Browser
  • Hebrew Morphological Parser
  • MORLEX - A lexical database for French
  • morpha and morphg - fast and robust morphological analysis and generation for English, from John A. Carroll (non-commercial only)
  • MORFOGEN - a Morphology Grammar Builder and Dictionary Interface Tool
  • NOMLEX - a dictionary of English nominalizations
  • PC-KIMMO - a Two-level Processor for Morphological Analysis, including KGEN, KTEXT, and Englex
  • TULIP - a two level phonological formalism
  • Xerox/PARC - finite-state morphological analysis/generation using xfst, lexc, twolc

Part of speech tagging

Free software

  • ACOPOST - A Collection Of PoS Taggers Maximum Entropy Tagger, Trigram Tagger, Transformation-based Tagger, Example-based tagger
  • LBJ POS Tagger - Uses averaged perceptron based sequential model. Java API, Free, open source license.
  • NLTK - Natural Language Toolkit Regexp Tagger, N-Gram Tagger, Brill Tagger, HMM Tagger, plus a freely downloadable book with a chapter on tagging
  • RelEx - provides English-language part-of-speech tagging, entity tagging, as well as other types of tags (gender, date, money ...), after performing a deep parse, so that tags agree with parse. Also provides resulting stems. Apache 2.0 License.
  • Spejd - Shallow Parsing and Disambiguation Engine a GPL tool for simultaneous rule-based morphosyntactic disambiguation and partial parsing
  • Tagger training on the Apertium Wiki (HMM + constraint based)
  • VISL Constraint Grammar rule based disambiguation (GPL)

Proprietary software

Combined morphology and tagging

Free software

Proprietary software