Multilingual Tools and Software

From ACL Wiki
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

For individual languages, see List of resources by language.


CSniper
A search-based annotation tool to help distributed annotation teams finding infrequent linguistic phenomena in large corpora
Dictionaries for International Ispell
Dictionaries and affix files for various languages
DKPro Core
A collection of software components for NLP based on the Apache UIMA framework
DKPro Lab
A lightweight framework for parameter sweeping experiments
DKPro LSR
A unified API for several lexical-semantic resources, including GermaNet, OpenThesaurus, Wikipedia, Wiktionary, and WordNet
DKPro Similarity
An open source software package for developing text similarity algorithms
DKPro Spelling
A collection of software components for spelling correction, especially for correcting real-word spelling errors
DKPro Statistics
A collection of statistical tools, currently including correlation and inter-rater agreement methods
DKPro Text Classification
A UIMA-based text classification framework
DKPro WSD
A modular, extensible Java framework for word sense disambiguation
Heart of Gold
XML-based middleware for the integration of (deep and shallow) NLP components
JoBimText
A software solution for automatic text expansion using contextualized distributional similarity
JOWKL
A Java-based API for OmegaWiki
JWKTL
A Java API for the free multilingual online dictionary Wiktionary
JWPL
A Java API for Wikipedia
Kirrkirr 4.0 Dictionary Program
Software for the exploration of indigenous language dictionaries
Morfette
A tool for supervised learning of inflectional morphology
MtRecode
Character conversion program
MtScript
The Multext multi-lingual text editor
RIA Open Source Rule Induction Tool
A tool for automatic induction of transfer rules for Transfer-Based Statistical Machine Translation using dependency structures (LFG f-structures)
SProUT
Shallow Processing with Unification and Typed Feature Structures
TIGERSearch
Tools for linguistic text exploration
UBY
A network of lexical resources interlinked at the sense level and a project on semantic integration of lexical resources for NLP applications
UralicNLP
A Python library providing lemmatization, morphological tagging and generation, and disambiguation in many Uralic languages (Finnish, Skolt Sami, Erzya...) and a growing number of non-Uralic languages (Arabic, Swedish, Russian...)
WebAnno
A general purpose web-based annotation tool for a wide range of linguistic annotations.
vislcg3
A tool that parses Constraint Grammar rules, commonly used for rule-based morphological disambiguation, syntactic function labelling and dependency annotation

See also Multilingual resources.