Multilingual Tools and Software

From ACL Wiki
Jump to navigation Jump to search

For individual languages, see List of resources by language.


CSniper
A search-based annotation tool to help distributed annotation teams finding infrequent linguistic phenomena in large corpora
Dictionaries for International Ispell
Dictionaries and affix files for various languages
DKPro Core
A collection of software components for NLP based on the Apache UIMA framework
DKPro Lab
A lightweight framework for parameter sweeping experiments
DKPro LSR
A unified API for several lexical-semantic resources, including GermaNet, OpenThesaurus, Wikipedia, Wiktionary, and WordNet
DKPro Similarity
An open source software package for developing text similarity algorithms
DKPro Spelling
A collection of software components for spelling correction, especially for correcting real-word spelling errors
DKPro Statistics
A collection of statistical tools, currently including correlation and inter-rater agreement methods
DKPro Text Classification
A UIMA-based text classification framework
DKPro WSD
A modular, extensible Java framework for word sense disambiguation
Heart of Gold
XML-based middleware for the integration of (deep and shallow) NLP components
JoBimText
A software solution for automatic text expansion using contextualized distributional similarity
JOWKL
A Java-based API for OmegaWiki
JWKTL
A Java API for the free multilingual online dictionary Wiktionary
JWPL
A Java API for Wikipedia
Kirrkirr 4.0 Dictionary Program
Software for the exploration of indigenous language dictionaries
Morfette
A tool for supervised learning of inflectional morphology
MtRecode
Character conversion program
MtScript
The Multext multi-lingual text editor
RIA Open Source Rule Induction Tool
A tool for automatic induction of transfer rules for Transfer-Based Statistical Machine Translation using dependency structures (LFG f-structures)
SProUT
Shallow Processing with Unification and Typed Feature Structures
TIGERSearch
Tools for linguistic text exploration
UBY
A network of lexical resources interlinked at the sense level and a project on semantic integration of lexical resources for NLP applications
UralicNLP
A Python library providing lemmatization, morphological tagging and generation, and disambiguation in many Uralic languages (Finnish, Skolt Sami, Erzya...) and a growing number of non-Uralic languages (Arabic, Swedish, Russian...)
WebAnno
A general purpose web-based annotation tool for a wide range of linguistic annotations.
vislcg3
A tool that parses Constraint Grammar rules, commonly used for rule-based morphological disambiguation, syntactic function labelling and dependency annotation

See also Multilingual resources.