Multilingual Tools and Software
Revision as of 09:47, 14 October 2013 by Tristan Miller (talk | contribs) (Remove dead/uniligual links; add several more tools; update links; reformat and sort list)
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.
For individual languages, see List of resources by language.
- CSniper
- A search-based annotation tool to help distributed annotation teams finding infrequent linguistic phenomena in large corpora
- Dictionaries for International Ispell
- Dictionaries and affix files for various languages
- DKPro Core
- A collection of software components for NLP based on the Apache UIMA framework
- DKPro Lab
- A lightweight framework for parameter sweeping experiments
- DKPro LSR
- A unified API for several lexical-semantic resources, including GermaNet, OpenThesaurus, Wikipedia, Wiktionary, and WordNet
- DKPro Similarity
- An open source software package for developing text similarity algorithms
- DKPro Spelling
- A collection of software components for spelling correction, especially for correcting real-word spelling errors
- DKPro Statistics
- A collection of statistical tools, currently including correlation and inter-rater agreement methods
- DKPro Text Classification
- A UIMA-based text classification framework
- DKPro WSD
- A modular, extensible Java framework for word sense disambiguation
- Heart of Gold
- XML-based middleware for the integration of (deep and shallow) NLP components
- JoBimText
- A software solution for automatic text expansion using contextualized distributional similarity
- JOWKL
- A Java-based API for OmegaWiki
- JWKTL
- A Java API for the free multilingual online dictionary Wiktionary
- JWPL
- A Java API for Wikipedia
- Kirrkirr 4.0 Dictionary Program
- Software for the exploration of indigenous language dictionaries
- Morfette
- A tool for supervised learning of inflectional morphology
- MtRecode
- Character conversion program
- MtScript
- The Multext multi-lingual text editor
- RIA Open Source Rule Induction Tool
- A tool for automatic induction of transfer rules for Transfer-Based Statistical Machine Translation using dependency structures (LFG f-structures)
- SProUT
- Shallow Processing with Unification and Typed Feature Structures
- TIGERSearch
- Tools for linguistic text exploration
- UBY
- A network of lexical resources interlinked at the sense level and a project on semantic integration of lexical resources for NLP applications
- WebAnno
- A general purpose web-based annotation tool for a wide range of linguistic annotations.
- vislcg3
- A tool that parses Constraint Grammar rules, commonly used for rule-based morphological disambiguation, syntactic function labelling and dependency annotation
See also Multilingual resources.