Difference between revisions of "Morphology software for English"
Jump to navigation
Jump to search
(→Free software: move the Morph It link to the Resources for Italian page) |
(→Proprietary software: Move Flemm to Resources for French -- it is GPL license, btw, not proprietrary) |
||
Line 17: | Line 17: | ||
*[http://www.ru.nl/celex/ CELEX database] - Dutch, English, and German word forms | *[http://www.ru.nl/celex/ CELEX database] - Dutch, English, and German word forms | ||
− | + | ||
*[http://www.cs.cmu.edu/afs/cs.cmu.edu/project/ai-repository/ai/areas/nlp/morph/fonol/0.html FONOL] - Phonological Programming Language (non-commercial only) | *[http://www.cs.cmu.edu/afs/cs.cmu.edu/project/ai-repository/ai/areas/nlp/morph/fonol/0.html FONOL] - Phonological Programming Language (non-commercial only) | ||
*[http://services.canoo.com/MorphologyBrowser.html German Morphology Browser] | *[http://services.canoo.com/MorphologyBrowser.html German Morphology Browser] |
Revision as of 17:48, 12 November 2008
Software - Morphology and part of speech tagging
For languages other than English, see List of resources by language.
Morphology
Free software
- Catvar 2.0 - The Categorial Variation Database for English (OSL)
- lttoolbox -- lexical processing tools for building morphological analysers/generators with XML specification files (GPL)
- SFST - Stuttgart Finite State Transducer Tools (GPL)
Proprietary software
- CELEX database - Dutch, English, and German word forms
- FONOL - Phonological Programming Language (non-commercial only)
- German Morphology Browser
- Hebrew Morphological Parser
- MAP - Cambridge/Edinburgh Morphological Analyzer and Dictionary System (freeware)
- MORLEX - A lexical database for French
- morpha and morphg - fast and robust morphological analysis and generation for English, from John A. Carroll (non-commercial only)
- MORFOGEN - a Morphology Grammar Builder and Dictionary Interface Tool
- NOMLEX - a dictionary of English nominalizations
- PC-KIMMO - a Two-level Processor for Morphological Analysis, including KGEN, KTEXT, and Englex
- TULIP - a two level phonological formalism
- Xerox/PARC - finite-state morphological analysis/generation using xfst, lexc, twolc
Part of speech tagging
Free software
- ACOPOST - A Collection Of PoS Taggers Maximum Entropy Tagger, Trigram Tagger, Transformation-based Tagger, Example-based tagger
- Tagger training on the Apertium Wiki (HMM + constraint based)
- NLTK - Natural Language Toolkit Regexp Tagger, N-Gram Tagger, Brill Tagger, HMM Tagger, plus a freely downloadable book with a chapter on tagging
- RelEx - provides English-language part-of-speech tagging, entity tagging, as well as other types of tags (gender, date, money ...), after performing a deep parse, so that tags agree with parse. Also provides resulting stems. Apache 2.0 License.
- Spejd - Shallow Parsing and Disambiguation Engine a GPL tool for simultaneous rule-based morphosyntactic disambiguation and partial parsing
- VISL Constraint Grammar rule based disambiguation (GPL)
Proprietary software
Combined morphology and tagging
Free software
- XTAG - tools for parsing and grammar development, including morphological analysis and tagging, as described in XTAG System - A Wide Coverage Grammar for English and A Freely Available Wide Coverage Morphological Analyzer for English
Proprietary software
- Korean morphological analyzer and part-of-speech tagger
- NEUCSP - a tool for Chinese Word Segmentation and POS tagging