Difference between revisions of "Uncategorized software"
Jump to navigation
Jump to search
m (Software - Uncategorized moved to Uncategorized software: More orthodox naming) |
(remove entries already categorized) |
||
(43 intermediate revisions by 13 users not shown) | |||
Line 1: | Line 1: | ||
− | '''[[Software]] - Uncategorized and miscellaneous | + | <div class="usermessage"> |
+ | * Please ''do not add anything new to this list''; we would like to eventually eliminate this list. | ||
+ | * Please help us by moving links into categories; add new categories to [[Tools and Software]] if needed. | ||
+ | * Please add new items to the [[List of resources by language]] where appropriate. | ||
+ | </div> | ||
+ | |||
+ | ==[[Software]] - Uncategorized and miscellaneous== | ||
*[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/bookcode/allen/0.html Code from James Allen's "Natural Language Understanding" (code at CMU)] | *[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/bookcode/allen/0.html Code from James Allen's "Natural Language Understanding" (code at CMU)] | ||
*[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/bookcode/nlp_pp/0.html Code from Michael Covington's "NLP for Prolog Programmers" (code at CMU)] | *[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/bookcode/nlp_pp/0.html Code from Michael Covington's "NLP for Prolog Programmers" (code at CMU)] | ||
*[http://www.dtreg.com/ DTREG decision tree generator] | *[http://www.dtreg.com/ DTREG decision tree generator] | ||
− | |||
− | |||
*[http://homepage.mac.com/bncweb/home.html BNCweb: A Web-Based Interface to the British National Corpus] | *[http://homepage.mac.com/bncweb/home.html BNCweb: A Web-Based Interface to the British National Corpus] | ||
*[http://nlg18.csie.ntu.edu.tw:8080/opinion/index.html Chinese sentiment dictionary NTUSD] | *[http://nlg18.csie.ntu.edu.tw:8080/opinion/index.html Chinese sentiment dictionary NTUSD] | ||
Line 14: | Line 18: | ||
*[http://www.comp.nus.edu.sg/~qiul/NLPTools/JavaRAP.html JavaRAP] | *[http://www.comp.nus.edu.sg/~qiul/NLPTools/JavaRAP.html JavaRAP] | ||
*[https://sourceforge.net/projects/jwordnet/ JWNL (Java WordNet Library)] | *[https://sourceforge.net/projects/jwordnet/ JWNL (Java WordNet Library)] | ||
+ | *[http://www.linguastream.org LinguaStream] | ||
*[http://xlex.uni-muenster.de/ MTP Xlex/www] | *[http://xlex.uni-muenster.de/ MTP Xlex/www] | ||
*[http://www.langsoft.ch Natural Language Processing software] | *[http://www.langsoft.ch Natural Language Processing software] | ||
− | *[http://www. | + | *http://opennlp.sf.net OpenNLP] |
+ | *[http://www.dcs.shef.ac.uk/~francois/personality/recognizer.html Personality Recognizer from Text] | ||
*[http://www.nzdl.org/ELKB/ Roget's Thesaurus as an Electronic Lexical Knowledge Base] | *[http://www.nzdl.org/ELKB/ Roget's Thesaurus as an Electronic Lexical Knowledge Base] | ||
*[http://www.chass.utoronto.ca/tact/ Text Analysis Computing Tools (TACT)] | *[http://www.chass.utoronto.ca/tact/ Text Analysis Computing Tools (TACT)] | ||
Line 22: | Line 28: | ||
*[http://igm.univ-mlv.fr/~unitex/ Unitex] | *[http://igm.univ-mlv.fr/~unitex/ Unitex] | ||
*[http://www.mith2.umd.edu/products/ver-mach/ Versioning Machine 2.0] | *[http://www.mith2.umd.edu/products/ver-mach/ Versioning Machine 2.0] | ||
− | *[http://www. | + | |
− | *[http:// | + | ==Applications== |
− | *[http:// | + | <!-- Please keep this list in alphabetical order --> |
+ | |||
+ | |||
+ | *[http://webdeptos.uma.es/filifa/personal/amoreno/indexer/ BNC Indexer] | ||
+ | *[http://www.brainhat.com/ Brainhat Natural Language Processing] | ||
+ | *[http://www.chilibot.net/ Chilibot: NLP based miner for gene/protein/keyword relationships] | ||
+ | *[http://www.bultreebank.org/clark CLaRK System] | ||
+ | *[http://delphesintl.com/ Delphes Technologies International] | ||
+ | *[http://www.dtreg.com DTREG 2.0 decision trees with TreeBoost] | ||
+ | *[http://www.dfki.de/lt/registry/apps/korek21.html KOREKTOR 2.0 (at the DFKI NLP archive)] | ||
+ | *[http://www.xs4all.nl/~bsarempt/linguistics/index.html KURA 1.0] | ||
+ | *[http://ngram.sourceforge.net Ngram Statistics Package], identify collocations | ||
+ | *[http://www.ccs.neu.edu/home/futrelle/bionlp/commercial/opus.html Opus, a commercial biology text mining system] | ||
+ | *[http://sourceforge.net/projects/pytalk/ Project: Pytalk] | ||
+ | *[http://www.wagsoft.com/RSTTool/ Release of RSTTool: RSTTool 2.7] | ||
+ | *[http://senseclusters.sourceforge.net SenseClusters], cluster similar contexts | ||
+ | *[http://www.softissimo.com/ SOFTISSIMO] | ||
+ | *[http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html TreeTagger] | ||
+ | |||
+ | ==Tools== | ||
+ | |||
+ | *[http://www.ldc.upenn.edu/Projects/ACE/Tools/ Automatic Content Extraction (ACE): Annotation Tools] | ||
+ | *[http://www.norvig.com/paip/grammar.lisp a simple grammar of English] | ||
+ | *[http://sourceforge.net/projects/acopost/ ACOPOST] | ||
+ | *[http://acdc.linguateca.pt/example_alignment.html Alignment of bilingual corpora performed with EasyAlign] | ||
+ | *[http://www.lsi.upc.es/~lambert/software/AlignmentSet.html Alignment Set Toolkit] | ||
+ | *[http://lucene.apache.org/java/docs/ Apache Lucene] | ||
+ | *[http://www.arabeyes.org/ Arabeyes Project] | ||
+ | *[http://misshoover.si.umich.edu/~zzheng/sentence/ Automatic English Sentence Segmentation] | ||
+ | *[http://www.clg.wlv.ac.uk/projects/CAST/demos.php Automatic Summarization Demos] | ||
+ | *[http://www.r.dl.itc.u-tokyo.ac.jp/~nakagawa/resource/termext/atr-e.html Automatic Term Extraction System] | ||
+ | *[http://lael.pucsp.br/corpora/ Bancos de dados e Ferramentas de an`alise] | ||
+ | *[http://www.ai.mit.edu/~murphyk/Software/BNT/bnt.html Bayes Net Toolbox for Matlab] | ||
+ | *[http://bndev.sourceforge.net/ Bayesian Network tools in Java (BNJ)] | ||
+ | *[http://sslmit.unibo.it/~baroni/bootcat.html BootCaT: Simple Utilities to Bootstrap Corpora and Terms from the Web] | ||
+ | *[http://callisto.mitre.org/ Callisto Annotation Tool] | ||
+ | *[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2005T13 CCGBank] | ||
+ | *[http://lael.pucsp.br/corpora/alinhador/ CEPRIL aligner] | ||
+ | *[http://pie.usna.edu/explorec.html Chargrams Database from British National Corpus] | ||
+ | *[http://www.bultreebank.org/clark/index.html CLaRK System] | ||
+ | *[http://dlt4.mit.edu/~dr/COALS/ COALS: Correlated Occurrence Analogue to Lexical Semantics] | ||
+ | *[ftp://cs.nyu.edu/pub/html/comlex.html/ Comlex] | ||
+ | *[http://www.ai.mit.edu/projects/iiip/doc/cl-http/home-page.html Common Lisp Hypermedia Server] | ||
+ | *[http://www.cpan.org/ Comprehensive Perl Archive Network] | ||
+ | *[http://clg.wlv.ac.uk/projects/CAST/ Computer Aided Summarisation Tool (CAST)] | ||
+ | *[http://infomap.stanford.edu/webdemo Concept Search Engine Information Mapping Demo (Center for the Study of Language and Information, Stanford University)] | ||
+ | *[https://sourceforge.net/projects/concollate/ Concollate] | ||
+ | *[http://borel.slu.edu/crubadan/ Corpus building for minority languages] | ||
+ | *[http://montev.isi.edu:8000/align-tool/?CORPUS=de-news-morphix&AFILE=full-model1-50-50.gz Corpus De-News-Morphix Alignment Tool] | ||
+ | *[http://search.cpan.org/dist/SuffixTree/ CPAN Suffix Tree Module] | ||
+ | *[http://www.ucl.ac.uk/english-usage/diachronic/index.htm Creating a Parsed and Searchable Diachronic Corpus of Present-Day Spoken English] | ||
+ | *[http://www.cis.upenn.edu/~dbikel/software.html#wn Dan Bikel's Java WordNet Library] | ||
+ | *[http://www.dataharmony.com/ Data Harmony, Document Management Software] | ||
+ | *[http://www.cs.ualberta.ca/~lindek/demos.htm Demos of dependency database, parser, and other tools] | ||
+ | *[http://fuzzy.cs.uni-magdeburg.de/~borgelt/dtree.html Dtree - Decision and Regression Tree Induction] | ||
+ | *[http://www.foreignword.com/dictionary/truespel/transpel.htm English-Truespel (USA Accent) Text Conversion Tool] | ||
+ | *[http://www.cs.jhu.edu/~brill/ Eric Brill's Part of Speech Tagger] | ||
+ | *[http://odur.let.rug.nl/~vannoord/Fsa/Manual/node1.html Finite State Automata Utilities v6] | ||
+ | *[http://www.jaist.ac.jp/~hieuxuan/flexcrfs/flexcrfs.html FlexCRFs: Flexible Conditional Random Fields] | ||
+ | *[http://garraf.epsevg.upc.es/freeling/ FreeLing 1.2] | ||
+ | *[http://grid.let.rug.nl/~vannoord/Fsa/fsa.html FSA6.2xx: Finite State Automata Utilities] | ||
+ | *[http://gate.ac.uk/ GATE (General Architecture for Text Engineering)] | ||
+ | *[http://www.clsp.jhu.edu/ws2005/groups/statistical/GenPar.html GenPar Toolkit for Generalized Parsing] | ||
+ | *[http://www.parc.xerox.com/istl/groups/nltt/medley/ Grammar Writer's Workbench for Lexical Functional Grammar] | ||
+ | *[http://heartofgold.dfki.de Heart of Gold - XML-based middleware for the integration of (deep and shallow) NLP components] | ||
+ | *[http://htk.eng.cam.ac.uk Hidden Markov Model Toolkit] | ||
+ | *[http://www.ida.liu.se/~nlplab/ILink/ I*Link] | ||
+ | *[http://www.kbsim.com/ifind.html iFind KBSim.com - Knowledge-Based Simulations, Inc.] | ||
+ | *[http://www.infogistics.com/posdemo.htm Infogistics: NLProcessor Interactive Demo] | ||
+ | *[http://www.isi.edu/~marcu/software.html ISI's version of the RSTTool] | ||
+ | *[http://www-2.cs.cmu.edu/~javabayes/Home/ JavaBayes - v0.346] | ||
+ | *[http://www.dcs.shef.ac.uk/~francois/jmrc/index.html jMRC - MRC Psycholinguistic Database Java Interface] | ||
+ | *[http://www.comp.leeds.ac.uk/andyr/software/jTokeniser/ jTokeniser] | ||
+ | *[http://sourceforge.net/projects/jwordnet/ JWNL (Java WordNet Library)] | ||
+ | *[http://sslmit.unibo.it/%7ebaroni/welcome_to_knorpora.html Knorpora 1.0] | ||
+ | *[http://miniappolis.com/KWiCFinder/KWiCFinderHome.html KWiCFinder] | ||
+ | *[http://www.kwicfinder.com/KWiCFinder.html Kwicfinder] | ||
+ | *[http://odur.let.rug.nl/~vannoord/TextCat/competitors.html Language Identification Tools] | ||
+ | *[http://www-2.cs.cmu.edu/~lemur/download.html Lemur Toolkit Download] | ||
+ | *[http://www.lemurproject.org/ Lemur Toolkit Website] | ||
+ | *[http://www.leximancer.com/ Leximancer] | ||
+ | *[http://www.csie.ntu.edu.tw/~cjlin/libsvm/ LIBSVM: A Library for Support Vector Machines] | ||
+ | *[http://search.cpan.org/~lgoddard/Lingua-Syllable-0.03/Syllable.pm Lingua-Syllable] | ||
+ | *[http://listserv.linguistlist.org/cgi-bin/wa?A2=ind0109&L=corpora&P=R729 list of POS taggers] | ||
+ | *[http://ucrel.lancs.ac.uk/llwizard.html Log-likelihood calculator] | ||
+ | *[ftp://ftp.ncbi.nlm.nih.gov/pub/lsmith/MedPost/medpost.tar.gz MedPost: A Part-of-Speech Tagger for BioMedical text] | ||
+ | *[http://www.lexically.net/wordsmith/version4/index.htm Mike Scott's Web - Wordsmith Tools] | ||
+ | *[http://mmax.eml-research.de MMAX Annotation Tool] | ||
+ | *[http://www.dcs.shef.ac.uk/research/ilash/Moby/ Moby Database] | ||
+ | *[http://search.cpan.org/author/SHLOMOY/Lingua-EN-Sentence-0.25/lib/Lingua/EN/Sentence.pm Module for splitting text into sentences] | ||
+ | *[http://www.cs.berkeley.edu/~aiken/moss.html Moss: A System for Detecting Software Plagiarism] | ||
+ | *[http://www.natlantech.com/lingbench_ide.html Natlanco] | ||
+ | *[http://www.cs.jhu.edu/~brill/code.html Natural Language Processing Systems] | ||
+ | *[http://www.ltg.ed.ac.uk/NITE/ NITE XML Toolkit] | ||
+ | *[http://nltk.sourceforge.net NLTK - Natural Language Toolkit] | ||
+ | *[http://crl.nmsu.edu/Tools/Software/ NMSU Natural Language Processing Tools] | ||
+ | *[http://annotation.semanticweb.org/ontomat/index.html Ontomat Homepage] | ||
+ | *[http://teach-computers.org/word-expert.html Open Mind] | ||
+ | *[http://davinci.cs.ucdavis.edu/ OpenRCT Home] | ||
+ | *[http://www.oriel.org/homonym.htm ORIEL -- Online Research Information Environment for the Life Sciences] | ||
+ | *[http://clg.wlv.ac.uk/projects/PALinkA/ PALinkA: A Resource Annotation Tool] | ||
+ | *[http://www.sil.org/ PC-KIMMO, Englex, PC-PATR, and PC-PARSE] | ||
+ | *[http://wall.jussieu.fr/dyn/Context2 perl concordancer] | ||
+ | *[http://www.tartarus.org/~martin/PorterStemmer/index.html Porter Stemming Algorithm] | ||
+ | *[http://www.sciences.univ-nantes.fr/info/perso/permanents/enguehard/recherche/CoRRecT/CoRRecT_gb.htm Project CoRRecT: Reference Corpus for the Recognition of Terms] | ||
+ | *[http://protege.stanford.edu/ Protege Project] | ||
+ | *[http://www.lingsoft.fi/cgi-pub/engcg Publically available POS tagger] | ||
+ | *[http://corpus.leeds.ac.uk/query-zh.html Query to Chinese Corpora] | ||
+ | *[http://www-rali.iro.umontreal.ca/Reacc/ Réacc - reaccenting software] | ||
+ | *[http://rdues.uce.ac.uk/acronym.shtml RDUES ACRONYM (Automatic Collocational Retrieval of NYMs) Project] | ||
+ | *[http://www.comp.nus.edu.sg/~rpnlpir/daemonCollins/ README for the daemonized version of Collins' Parser] | ||
+ | *[http://www.research-lab.com/ Research-lab.com] | ||
+ | *[http://www.reitter-it-media.de/compling/index.html RST LaTeX (Reitter IT and Media)] | ||
+ | *[http://herzberg.ca.sandia.gov/jess/index.shtml Rule Engine for the Java Platform] | ||
+ | *[http://elib.cs.berkeley.edu/src/satz/ SATZ--Adaptive Sentence Boundary Detector] | ||
+ | *[http://ixa.si.ehu.es/Ixa/resources/selprefs Selectional Preferences Extracted from Semcor for WordNet 1.6 Synsets (v 1.0)] | ||
+ | *[http://ilk.uvt.nl/~sabine/chunklink/ Software - The chunklink script, by Sabine Buchholz] | ||
+ | *[http://people.csail.mit.edu/people/mcollins/code.html Software and Data Sets for Collins Natural Language Parser] | ||
+ | *[http://senta.di.ubi.pt Software for the Extraction of N-ary Textual Associations (SENTA)] | ||
+ | *[http://www-a2k.is.tokushima-u.ac.jp/member/kita/NLP/nlp_tools.html Software Tools for NLP] | ||
+ | *[http://sprout.dfki.de SProUT - Shallow Processing with Unification and Typed Feature Structures] | ||
+ | *[http://www-nlp.stanford.edu/software/lex-parser.shtml Stanford Parser] | ||
+ | *[http://www.lsi.upc.edu/%7Enlp/SVMTool/ SVMTool] | ||
+ | *[http://swesum.nada.kth.se/index-eng.html SweSum - Automatic Text Summarizer (with PRM)] | ||
+ | *[http://www.wagsoft.com/Coder/ Systemic Coder -- a Text Markup Tool (Version 4.5)] | ||
+ | *[http://www-2.cs.cmu.edu/~lenzo/t2p/ t2p: Text-to-Phoneme Converter Builder] | ||
+ | *[http://www.d.umn.edu/~tpederse/parallel.html Ted Pedersen - Tools for Parallel Text] | ||
+ | *[http://lsi.research.telcordia.com/ Telcordia Latent Semantic Indexing Demo Machine] | ||
+ | *[http://www.tei-c.org/Software/index.html Text Encoding Initiative --Tools] | ||
+ | *[http://www.comp.lancs.ac.uk/computing/research/ucrel/claws/tagservice.html The CLAWS tagging service] | ||
+ | *[http://www.clsp.jhu.edu/ws99/projects/mt/toolkit/ The EGYPT Statistical Machine Translation Toolkit] | ||
+ | *[http://www.ims.uni-stuttgart.de/CorpusToolbox/ The IMS Corpus Toolbox Webpage] | ||
+ | *[http://jazzy.sourceforge.net/ The Java Open Source Spell Checker] | ||
+ | *[http://www.findingnames.net/ The Naming Company] | ||
+ | *[http://timex2.mitre.org/taggers/timex2_taggers.html TIMEX2 Taggers] | ||
+ | *[http://www.coli.uni-sb.de/~thorsten/tnt/ TnT - Statistical Part-of-Speech Tagger] | ||
+ | *[http://www.cs.columbia.edu/nlp/tools.html Tools developed at Columbia University (FUF, Surge, Crep, Segmenter, Verber, Xtract)] | ||
+ | *[http://www.torch.ch Torch3] | ||
+ | *[http://main.amu.edu.pl/~sipkadan/lingo.htm Turbo Lingo] | ||
+ | *[http://stp.ling.uu.se/cgi-bin/joerg/Uplug Uplug] | ||
+ | *[http://wordlist.sourceforge.net/varcon-readme VarCon (Variant Conversion Info)] | ||
+ | *[http://www.edict.com.hk/concordance/ Virtual Language Centre's Web Concordancer] | ||
+ | *[http://www.textanalysis.com/ VisualText] | ||
+ | *[http://sourceforge.net/projects/wordfreak Wordfreak] | ||
+ | *[http://www.d.umn.edu/~tpederse/wsdshell.html WSD Shell] | ||
+ | *[http://www.xml-ces.org/ XCES: Corpus Encoding Standard for XML] | ||
+ | |||
+ | == WordNet stuff (placeholder) == | ||
+ | |||
+ | * [http://search.cpan.org/dist/WordNet-SenseRelate Word-Net SenseRelate] | ||
+ | * [http://search.cpan.org/dist/WordNet-Similarity Word-Net Similarity] | ||
+ | * [http://www.clres.com/WordNet.html alphabetic version of WordNet 2.0] | ||
+ | * [http://www.ai.mit.edu/~jrennie/WordNet/ Perl interface to WordNet] | ||
+ | |||
+ | == QA systems (placeholder, needs to go somewhere else) == | ||
+ | |||
+ | * [http://www.answerbus.com/index.shtml Answerbus -- Automatic Language Detection Software ] | ||
+ | * [http://demos.inf.ed.ac.uk:8080/qualim/ QuALiM Question Answering System - Searches Wikipedia] | ||
+ | * [http://start.csail.mit.edu/ START Natural Language Question Answering System] | ||
+ | * [http://www.umiacs.umd.edu/~jimmylin/downloads/index.html Aranea Question Answering System] | ||
+ | * [http://webglimpse.net/ Webglimpse] | ||
+ | * [http://www.openchannelsoftware.org/projects/Qanda Qanda: Open source question answering system] | ||
+ | |||
+ | ==Miscellaneous== | ||
+ | |||
+ | * [http://homepage.mac.com/bncweb/home.html BNCweb: A Web-Based Interface to the British National Corpus ] | ||
+ | * [http://lingo.stanford.edu/ CSLI LinGO Lab (Stanford) ] | ||
+ | * [http://nlg18.csie.ntu.edu.tw:8080/opinion/index.html Chinese sentiment dictionary NTUSD ] | ||
+ | * [http://www.athel.com/colloc.html Collocate ] | ||
+ | * [http://www.lsi.upc.es/~nlp/freeling/ FreeLing 1.1 ] | ||
+ | * [http://www.webir.org/resources.html IR and IE on the web ] | ||
+ | * [https://sourceforge.net/projects/jwordnet/ JWNL (Java WordNet Library) ] | ||
+ | * [http://www.comp.nus.edu.sg/~qiul/NLPTools/JavaRAP.html JavaRAP ] | ||
+ | * [http://xlex.uni-muenster.de/ MTP Xlex/www ] | ||
+ | * [http://www.langsoft.ch Natural Language Processing software ] | ||
+ | * [http://www.dfki.de/lt/registry/draft.html Natural Language Software Registry (at DFKI) ] | ||
+ | * [http://www.nzdl.org/ELKB/ Roget's Thesaurus as an Electronic Lexical Knowledge Base ] | ||
+ | * [http://www.chass.utoronto.ca/tact/ Text Analysis Computing Tools (TACT) ] | ||
+ | * [http://odur.let.rug.nl/~vannoord/TextCat/ TextCat ] | ||
+ | * [http://igm.univ-mlv.fr/~unitex/ Unitex ] | ||
+ | |||
+ | ==See also== | ||
+ | |||
+ | * [[Named entity recognizers]] | ||
[[Category:Software]] | [[Category:Software]] |
Latest revision as of 00:55, 16 October 2013
Software - Uncategorized and miscellaneous
- Code from James Allen's "Natural Language Understanding" (code at CMU)
- Code from Michael Covington's "NLP for Prolog Programmers" (code at CMU)
- DTREG decision tree generator
- BNCweb: A Web-Based Interface to the British National Corpus
- Chinese sentiment dictionary NTUSD
- Collocate
- CSLI LinGO Lab (Stanford)
- FreeLing 1.1
- IR and IE on the web
- JavaRAP
- JWNL (Java WordNet Library)
- LinguaStream
- MTP Xlex/www
- Natural Language Processing software
- http://opennlp.sf.net OpenNLP]
- Personality Recognizer from Text
- Roget's Thesaurus as an Electronic Lexical Knowledge Base
- Text Analysis Computing Tools (TACT)
- TextCat
- Unitex
- Versioning Machine 2.0
Applications
- BNC Indexer
- Brainhat Natural Language Processing
- Chilibot: NLP based miner for gene/protein/keyword relationships
- CLaRK System
- Delphes Technologies International
- DTREG 2.0 decision trees with TreeBoost
- KOREKTOR 2.0 (at the DFKI NLP archive)
- KURA 1.0
- Ngram Statistics Package, identify collocations
- Opus, a commercial biology text mining system
- Project: Pytalk
- Release of RSTTool: RSTTool 2.7
- SenseClusters, cluster similar contexts
- SOFTISSIMO
- TreeTagger
Tools
- Automatic Content Extraction (ACE): Annotation Tools
- a simple grammar of English
- ACOPOST
- Alignment of bilingual corpora performed with EasyAlign
- Alignment Set Toolkit
- Apache Lucene
- Arabeyes Project
- Automatic English Sentence Segmentation
- Automatic Summarization Demos
- Automatic Term Extraction System
- Bancos de dados e Ferramentas de an`alise
- Bayes Net Toolbox for Matlab
- Bayesian Network tools in Java (BNJ)
- BootCaT: Simple Utilities to Bootstrap Corpora and Terms from the Web
- Callisto Annotation Tool
- CCGBank
- CEPRIL aligner
- Chargrams Database from British National Corpus
- CLaRK System
- COALS: Correlated Occurrence Analogue to Lexical Semantics
- Comlex
- Common Lisp Hypermedia Server
- Comprehensive Perl Archive Network
- Computer Aided Summarisation Tool (CAST)
- Concept Search Engine Information Mapping Demo (Center for the Study of Language and Information, Stanford University)
- Concollate
- Corpus building for minority languages
- Corpus De-News-Morphix Alignment Tool
- CPAN Suffix Tree Module
- Creating a Parsed and Searchable Diachronic Corpus of Present-Day Spoken English
- Dan Bikel's Java WordNet Library
- Data Harmony, Document Management Software
- Demos of dependency database, parser, and other tools
- Dtree - Decision and Regression Tree Induction
- English-Truespel (USA Accent) Text Conversion Tool
- Eric Brill's Part of Speech Tagger
- Finite State Automata Utilities v6
- FlexCRFs: Flexible Conditional Random Fields
- FreeLing 1.2
- FSA6.2xx: Finite State Automata Utilities
- GATE (General Architecture for Text Engineering)
- GenPar Toolkit for Generalized Parsing
- Grammar Writer's Workbench for Lexical Functional Grammar
- Heart of Gold - XML-based middleware for the integration of (deep and shallow) NLP components
- Hidden Markov Model Toolkit
- I*Link
- iFind KBSim.com - Knowledge-Based Simulations, Inc.
- Infogistics: NLProcessor Interactive Demo
- ISI's version of the RSTTool
- JavaBayes - v0.346
- jMRC - MRC Psycholinguistic Database Java Interface
- jTokeniser
- JWNL (Java WordNet Library)
- Knorpora 1.0
- KWiCFinder
- Kwicfinder
- Language Identification Tools
- Lemur Toolkit Download
- Lemur Toolkit Website
- Leximancer
- LIBSVM: A Library for Support Vector Machines
- Lingua-Syllable
- list of POS taggers
- Log-likelihood calculator
- MedPost: A Part-of-Speech Tagger for BioMedical text
- Mike Scott's Web - Wordsmith Tools
- MMAX Annotation Tool
- Moby Database
- Module for splitting text into sentences
- Moss: A System for Detecting Software Plagiarism
- Natlanco
- Natural Language Processing Systems
- NITE XML Toolkit
- NLTK - Natural Language Toolkit
- NMSU Natural Language Processing Tools
- Ontomat Homepage
- Open Mind
- OpenRCT Home
- ORIEL -- Online Research Information Environment for the Life Sciences
- PALinkA: A Resource Annotation Tool
- PC-KIMMO, Englex, PC-PATR, and PC-PARSE
- perl concordancer
- Porter Stemming Algorithm
- Project CoRRecT: Reference Corpus for the Recognition of Terms
- Protege Project
- Publically available POS tagger
- Query to Chinese Corpora
- Réacc - reaccenting software
- RDUES ACRONYM (Automatic Collocational Retrieval of NYMs) Project
- README for the daemonized version of Collins' Parser
- Research-lab.com
- RST LaTeX (Reitter IT and Media)
- Rule Engine for the Java Platform
- SATZ--Adaptive Sentence Boundary Detector
- Selectional Preferences Extracted from Semcor for WordNet 1.6 Synsets (v 1.0)
- Software - The chunklink script, by Sabine Buchholz
- Software and Data Sets for Collins Natural Language Parser
- Software for the Extraction of N-ary Textual Associations (SENTA)
- Software Tools for NLP
- SProUT - Shallow Processing with Unification and Typed Feature Structures
- Stanford Parser
- SVMTool
- SweSum - Automatic Text Summarizer (with PRM)
- Systemic Coder -- a Text Markup Tool (Version 4.5)
- t2p: Text-to-Phoneme Converter Builder
- Ted Pedersen - Tools for Parallel Text
- Telcordia Latent Semantic Indexing Demo Machine
- Text Encoding Initiative --Tools
- The CLAWS tagging service
- The EGYPT Statistical Machine Translation Toolkit
- The IMS Corpus Toolbox Webpage
- The Java Open Source Spell Checker
- The Naming Company
- TIMEX2 Taggers
- TnT - Statistical Part-of-Speech Tagger
- Tools developed at Columbia University (FUF, Surge, Crep, Segmenter, Verber, Xtract)
- Torch3
- Turbo Lingo
- Uplug
- VarCon (Variant Conversion Info)
- Virtual Language Centre's Web Concordancer
- VisualText
- Wordfreak
- WSD Shell
- XCES: Corpus Encoding Standard for XML
WordNet stuff (placeholder)
- Word-Net SenseRelate
- Word-Net Similarity
- alphabetic version of WordNet 2.0
- Perl interface to WordNet
QA systems (placeholder, needs to go somewhere else)
- Answerbus -- Automatic Language Detection Software
- QuALiM Question Answering System - Searches Wikipedia
- START Natural Language Question Answering System
- Aranea Question Answering System
- Webglimpse
- Qanda: Open source question answering system
Miscellaneous
- BNCweb: A Web-Based Interface to the British National Corpus
- CSLI LinGO Lab (Stanford)
- Chinese sentiment dictionary NTUSD
- Collocate
- FreeLing 1.1
- IR and IE on the web
- JWNL (Java WordNet Library)
- JavaRAP
- MTP Xlex/www
- Natural Language Processing software
- Natural Language Software Registry (at DFKI)
- Roget's Thesaurus as an Electronic Lexical Knowledge Base
- Text Analysis Computing Tools (TACT)
- TextCat
- Unitex