Difference between revisions of "Multilingual Tools and Software"

From ACL Wiki
Jump to navigation Jump to search
 
(2 intermediate revisions by 2 users not shown)
Line 3: Line 3:
  
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
*[http://fmg-www.cs.ucla.edu/geoff/ispell-dictionaries.html#Spanish-dicts Dictionaries for International Ispell]
+
;[https://code.google.com/p/csniper/ CSniper]
*[http://heartofgold.dfki.de Heart of Gold] - XML-based middleware for the integration of (deep and shallow) NLP components
+
:A search-based annotation tool to help distributed annotation teams finding infrequent linguistic phenomena in large corpora
*[http://www.ukp.tu-darmstadt.de/software/jwktl/ JWKTL] - Java Wiktionary Library
+
;[http://fmg-www.cs.ucla.edu/geoff/ispell-dictionaries.html Dictionaries for International Ispell]
*[http://nlp.stanford.edu/kirrkirr/ Kirrkirr 4.0 Dictionary Program] - software for the exploration of indigenous language dictionaries
+
:Dictionaries and affix files for various languages
*[http://www.lpl.univ-aix.fr/projects/multext/MtRecode/ MtRecode] - Character conversion program
+
;[https://code.google.com/p/dkpro-core-asl/ DKPro Core]
*[http://www.lpl.univ-aix.fr/projects/multext/MtScript/ MtScript] - The Multext multi-lingual text editor
+
:A collection of software components for NLP based on the Apache UIMA framework
*[http://bach.arts.kuleuven.ac.be/~piet/fr_nlp.html Natural Language Processing for French]
+
;[https://code.google.com/p/dkpro-lab/ DKPro Lab]
* [http://www.computing.dcu.ie/~ygraham/software.html RIA Open Source Rule Induction Tool] is a tool for automatic induction of transfer rules for Transfer-Based Statistical Machine Translation using dependency structures ([[LFG]] f-structures). RIA is licensed under the [[LGPL]].
+
:A lightweight framework for parameter sweeping experiments
*[http://sprout.dfki.de SProUT] - Shallow Processing with Unification and Typed Feature Structures
+
;[https://code.google.com/p/dkpro-lsr/ DKPro LSR]
*[http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/ TIGERSearch] - tools for linguistic text exploration
+
:A unified API for several lexical-semantic resources, including GermaNet, OpenThesaurus, Wikipedia, Wiktionary, and WordNet
*[http://beta.visl.sdu.dk/cg3.html vislcg3] is a GPL-licensed tool that parses [[Constraint Grammar]] rules, commonly used for rule-based morphological disambiguation, syntactic function labelling and dependency annotation.
+
;[https://code.google.com/p/dkpro-similarity-asl/ DKPro Similarity]
 +
:An open source software package for developing text similarity algorithms
 +
;[https://code.google.com/p/dkpro-spelling-asl/ DKPro Spelling]
 +
:A collection of software components for spelling correction, especially for correcting real-word spelling errors
 +
;[https://code.google.com/p/dkpro-statistics/ DKPro Statistics]
 +
:A collection of statistical tools, currently including correlation and inter-rater agreement methods
 +
;[https://code.google.com/p/dkpro-tc/ DKPro Text Classification]
 +
:A UIMA-based text classification framework
 +
;[https://code.google.com/p/dkpro-wsd/ DKPro WSD]
 +
:A  modular, extensible Java framework for word sense disambiguation
 +
;[http://heartofgold.dfki.de Heart of Gold]
 +
:XML-based middleware for the integration of (deep and shallow) NLP components
 +
;[http://www.ukp.tu-darmstadt.de/software/jobimtext/ JoBimText]
 +
:A software solution for automatic text expansion using contextualized distributional similarity
 +
;[https://code.google.com/p/jowkl/ JOWKL]
 +
:A Java-based API for OmegaWiki
 +
;[https://code.google.com/p/jwktl/ JWKTL]
 +
:A Java API for the free multilingual online dictionary Wiktionary
 +
;[https://code.google.com/p/jwpl/ JWPL]
 +
:A Java API for Wikipedia
 +
;[http://nlp.stanford.edu/kirrkirr/ Kirrkirr 4.0 Dictionary Program]
 +
:Software for the exploration of indigenous language dictionaries
 +
;[https://sites.google.com/site/morfetteweb/ Morfette]
 +
:A tool for supervised learning of inflectional morphology
 +
;[http://www.lpl.univ-aix.fr/projects/multext/MtRecode/ MtRecode]
 +
:Character conversion program
 +
;[http://www.lpl.univ-aix.fr/projects/multext/MtScript/ MtScript]
 +
:The Multext multi-lingual text editor
 +
;[http://www.computing.dcu.ie/~ygraham/software.html RIA Open Source Rule Induction Tool]
 +
:A tool for automatic induction of transfer rules for Transfer-Based Statistical Machine Translation using dependency structures ([[LFG]] f-structures)
 +
;[http://sprout.dfki.de SProUT]
 +
:Shallow Processing with Unification and Typed Feature Structures
 +
;[http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/ TIGERSearch]
 +
:Tools for linguistic text exploration
 +
;[https://code.google.com/p/uby/ UBY]
 +
:A network of lexical resources interlinked at the sense level and a project on semantic integration of lexical resources for NLP applications
 +
;[https://github.com/mikahama/uralicNLP UralicNLP]
 +
:A Python library providing lemmatization, morphological tagging and generation, and disambiguation in many Uralic languages (Finnish, Skolt Sami, Erzya...) and a growing number of non-Uralic languages (Arabic, Swedish, Russian...)
 +
;[https://code.google.com/p/webanno/ WebAnno]
 +
:A general purpose web-based annotation tool for a wide range of linguistic annotations.
 +
;[http://beta.visl.sdu.dk/cg3.html vislcg3]
 +
:A tool that parses [[Constraint Grammar]] rules, commonly used for rule-based morphological disambiguation, syntactic function labelling and dependency annotation
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
 
 
  
 
See also [[Multilingual resources]].
 
See also [[Multilingual resources]].

Latest revision as of 04:43, 29 June 2020

For individual languages, see List of resources by language.


CSniper
A search-based annotation tool to help distributed annotation teams finding infrequent linguistic phenomena in large corpora
Dictionaries for International Ispell
Dictionaries and affix files for various languages
DKPro Core
A collection of software components for NLP based on the Apache UIMA framework
DKPro Lab
A lightweight framework for parameter sweeping experiments
DKPro LSR
A unified API for several lexical-semantic resources, including GermaNet, OpenThesaurus, Wikipedia, Wiktionary, and WordNet
DKPro Similarity
An open source software package for developing text similarity algorithms
DKPro Spelling
A collection of software components for spelling correction, especially for correcting real-word spelling errors
DKPro Statistics
A collection of statistical tools, currently including correlation and inter-rater agreement methods
DKPro Text Classification
A UIMA-based text classification framework
DKPro WSD
A modular, extensible Java framework for word sense disambiguation
Heart of Gold
XML-based middleware for the integration of (deep and shallow) NLP components
JoBimText
A software solution for automatic text expansion using contextualized distributional similarity
JOWKL
A Java-based API for OmegaWiki
JWKTL
A Java API for the free multilingual online dictionary Wiktionary
JWPL
A Java API for Wikipedia
Kirrkirr 4.0 Dictionary Program
Software for the exploration of indigenous language dictionaries
Morfette
A tool for supervised learning of inflectional morphology
MtRecode
Character conversion program
MtScript
The Multext multi-lingual text editor
RIA Open Source Rule Induction Tool
A tool for automatic induction of transfer rules for Transfer-Based Statistical Machine Translation using dependency structures (LFG f-structures)
SProUT
Shallow Processing with Unification and Typed Feature Structures
TIGERSearch
Tools for linguistic text exploration
UBY
A network of lexical resources interlinked at the sense level and a project on semantic integration of lexical resources for NLP applications
UralicNLP
A Python library providing lemmatization, morphological tagging and generation, and disambiguation in many Uralic languages (Finnish, Skolt Sami, Erzya...) and a growing number of non-Uralic languages (Arabic, Swedish, Russian...)
WebAnno
A general purpose web-based annotation tool for a wide range of linguistic annotations.
vislcg3
A tool that parses Constraint Grammar rules, commonly used for rule-based morphological disambiguation, syntactic function labelling and dependency annotation

See also Multilingual resources.