Difference between revisions of "Uncategorized software"
Jump to navigation
Jump to search
(→Tools) |
(→Tools) |
||
Line 121: | Line 121: | ||
*[http://mmax.eml-research.de MMAX Annotation Tool] | *[http://mmax.eml-research.de MMAX Annotation Tool] | ||
*[http://www.dcs.shef.ac.uk/research/ilash/Moby/ Moby Database] | *[http://www.dcs.shef.ac.uk/research/ilash/Moby/ Moby Database] | ||
− | |||
*[http://search.cpan.org/author/SHLOMOY/Lingua-EN-Sentence-0.25/lib/Lingua/EN/Sentence.pm Module for splitting text into sentences] | *[http://search.cpan.org/author/SHLOMOY/Lingua-EN-Sentence-0.25/lib/Lingua/EN/Sentence.pm Module for splitting text into sentences] | ||
*[http://www.cs.berkeley.edu/~aiken/moss.html Moss: A System for Detecting Software Plagiarism] | *[http://www.cs.berkeley.edu/~aiken/moss.html Moss: A System for Detecting Software Plagiarism] | ||
Line 148: | Line 147: | ||
*[http://herzberg.ca.sandia.gov/jess/index.shtml Rule Engine for the Java Platform] | *[http://herzberg.ca.sandia.gov/jess/index.shtml Rule Engine for the Java Platform] | ||
*[http://elib.cs.berkeley.edu/src/satz/ SATZ--Adaptive Sentence Boundary Detector] | *[http://elib.cs.berkeley.edu/src/satz/ SATZ--Adaptive Sentence Boundary Detector] | ||
+ | *[http://www.thai-sbobet.com sbobet] | ||
*[http://ixa.si.ehu.es/Ixa/resources/selprefs Selectional Preferences Extracted from Semcor for WordNet 1.6 Synsets (v 1.0)] | *[http://ixa.si.ehu.es/Ixa/resources/selprefs Selectional Preferences Extracted from Semcor for WordNet 1.6 Synsets (v 1.0)] | ||
*[http://ilk.uvt.nl/~sabine/chunklink/ Software - The chunklink script, by Sabine Buchholz] | *[http://ilk.uvt.nl/~sabine/chunklink/ Software - The chunklink script, by Sabine Buchholz] | ||
Line 155: | Line 155: | ||
*[http://sprout.dfki.de SProUT - Shallow Processing with Unification and Typed Feature Structures] | *[http://sprout.dfki.de SProUT - Shallow Processing with Unification and Typed Feature Structures] | ||
*[http://www-nlp.stanford.edu/software/lex-parser.shtml Stanford Parser] | *[http://www-nlp.stanford.edu/software/lex-parser.shtml Stanford Parser] | ||
+ | *[http://www.thai-sbobet.com sbo] | ||
*[http://www.lsi.upc.edu/%7Enlp/SVMTool/ SVMTool] | *[http://www.lsi.upc.edu/%7Enlp/SVMTool/ SVMTool] | ||
*[http://swesum.nada.kth.se/index-eng.html SweSum - Automatic Text Summarizer (with PRM)] | *[http://swesum.nada.kth.se/index-eng.html SweSum - Automatic Text Summarizer (with PRM)] | ||
Line 162: | Line 163: | ||
*[http://lsi.research.telcordia.com/ Telcordia Latent Semantic Indexing Demo Machine] | *[http://lsi.research.telcordia.com/ Telcordia Latent Semantic Indexing Demo Machine] | ||
*[http://www.tei-c.org/Software/index.html Text Encoding Initiative --Tools] | *[http://www.tei-c.org/Software/index.html Text Encoding Initiative --Tools] | ||
− | |||
*[http://www.comp.lancs.ac.uk/computing/research/ucrel/claws/tagservice.html The CLAWS tagging service] | *[http://www.comp.lancs.ac.uk/computing/research/ucrel/claws/tagservice.html The CLAWS tagging service] | ||
*[http://www.clsp.jhu.edu/ws99/projects/mt/toolkit/ The EGYPT Statistical Machine Translation Toolkit] | *[http://www.clsp.jhu.edu/ws99/projects/mt/toolkit/ The EGYPT Statistical Machine Translation Toolkit] |
Revision as of 03:25, 25 June 2012
Software - Uncategorized and miscellaneous
- Code from James Allen's "Natural Language Understanding" (code at CMU)
- Code from Michael Covington's "NLP for Prolog Programmers" (code at CMU)
- DTREG decision tree generator
- BNCweb: A Web-Based Interface to the British National Corpus
- Chinese sentiment dictionary NTUSD
- Collocate
- CSLI LinGO Lab (Stanford)
- FreeLing 1.1
- IR and IE on the web
- JavaRAP
- JWNL (Java WordNet Library)
- JWPL (Java Wikipedia Library)
- LinguaStream
- MTP Xlex/www
- Natural Language Processing software
- http://opennlp.sf.net OpenNLP]
- Personality Recognizer from Text
- Roget's Thesaurus as an Electronic Lexical Knowledge Base
- Text Analysis Computing Tools (TACT)
- TextCat
- Unitex
- Versioning Machine 2.0
Applications
- BNC Indexer
- Brainhat Natural Language Processing
- Chilibot: NLP based miner for gene/protein/keyword relationships
- CLaRK System
- Delphes Technologies International
- DTREG 2.0 decision trees with TreeBoost
- KOREKTOR 2.0 (at the DFKI NLP archive)
- KURA 1.0
- Ngram Statistics Package, identify collocations
- Opus, a commercial biology text mining system
- Project: Pytalk
- Release of RSTTool: RSTTool 2.7
- SenseClusters, cluster similar contexts
- SOFTISSIMO
- TreeTagger
Tools
- Automatic Content Extraction (ACE): Annotation Tools
- a simple grammar of English
- ACOPOST
- Alignment of bilingual corpora performed with EasyAlign
- Alignment Set Toolkit
- Apache Lucene
- Arabeyes Project
- Automatic English Sentence Segmentation
- Automatic Summarization Demos
- Automatic Term Extraction System
- Bancos de dados e Ferramentas de an`alise
- Bayes Net Toolbox for Matlab
- Bayesian Network tools in Java (BNJ)
- BootCaT: Simple Utilities to Bootstrap Corpora and Terms from the Web
- Callisto Annotation Tool
- CCGBank
- CEPRIL aligner
- Chargrams Database from British National Corpus
- CLaRK System
- COALS: Correlated Occurrence Analogue to Lexical Semantics
- Comlex
- Common Lisp Hypermedia Server
- Comprehensive Perl Archive Network
- Computer Aided Summarisation Tool (CAST)
- Concept Search Engine Information Mapping Demo (Center for the Study of Language and Information, Stanford University)
- Concollate
- Corpus building for minority languages
- Corpus De-News-Morphix Alignment Tool
- CPAN Suffix Tree Module
- Creating a Parsed and Searchable Diachronic Corpus of Present-Day Spoken English
- Dan Bikel's Java WordNet Library
- Data Harmony, Document Management Software
- Demos of dependency database, parser, and other tools
- Dtree - Decision and Regression Tree Induction
- English-Truespel (USA Accent) Text Conversion Tool
- Eric Brill's Part of Speech Tagger
- Finite State Automata Utilities v6
- FlexCRFs: Flexible Conditional Random Fields
- FreeLing 1.2
- FSA6.2xx: Finite State Automata Utilities
- GATE (General Architecture for Text Engineering)
- GenPar Toolkit for Generalized Parsing
- Grammar Writer's Workbench for Lexical Functional Grammar
- Heart of Gold - XML-based middleware for the integration of (deep and shallow) NLP components
- Hidden Markov Model Toolkit
- I*Link
- iFind KBSim.com - Knowledge-Based Simulations, Inc.
- Infogistics: NLProcessor Interactive Demo
- ISI's version of the RSTTool
- JavaBayes - v0.346
- jMRC - MRC Psycholinguistic Database Java Interface
- jTokeniser
- JWNL (Java WordNet Library)
- JWPL (Java Wikipedia Library)
- Knorpora 1.0
- KWiCFinder
- Kwicfinder
- Language Identification Tools
- Lemur Toolkit Download
- Lemur Toolkit Website
- Leximancer
- LIBSVM: A Library for Support Vector Machines
- Lingua-Syllable
- list of POS taggers
- Log-likelihood calculator
- MedPost: A Part-of-Speech Tagger for BioMedical text
- Mike Scott's Web - Wordsmith Tools
- MMAX Annotation Tool
- Moby Database
- Module for splitting text into sentences
- Moss: A System for Detecting Software Plagiarism
- Natlanco
- Natural Language Processing Systems
- NITE XML Toolkit
- NLTK - Natural Language Toolkit
- NMSU Natural Language Processing Tools
- Ontomat Homepage
- Open Mind
- OpenRCT Home
- ORIEL -- Online Research Information Environment for the Life Sciences
- PALinkA: A Resource Annotation Tool
- PC-KIMMO, Englex, PC-PATR, and PC-PARSE
- perl concordancer
- Porter Stemming Algorithm
- Project CoRRecT: Reference Corpus for the Recognition of Terms
- Protege Project
- Publically available POS tagger
- Query to Chinese Corpora
- Réacc - reaccenting software
- RDUES ACRONYM (Automatic Collocational Retrieval of NYMs) Project
- README for the daemonized version of Collins' Parser
- Research-lab.com
- RST LaTeX (Reitter IT and Media)
- Rule Engine for the Java Platform
- SATZ--Adaptive Sentence Boundary Detector
- sbobet
- Selectional Preferences Extracted from Semcor for WordNet 1.6 Synsets (v 1.0)
- Software - The chunklink script, by Sabine Buchholz
- Software and Data Sets for Collins Natural Language Parser
- Software for the Extraction of N-ary Textual Associations (SENTA)
- Software Tools for NLP
- SProUT - Shallow Processing with Unification and Typed Feature Structures
- Stanford Parser
- sbo
- SVMTool
- SweSum - Automatic Text Summarizer (with PRM)
- Systemic Coder -- a Text Markup Tool (Version 4.5)
- t2p: Text-to-Phoneme Converter Builder
- Ted Pedersen - Tools for Parallel Text
- Telcordia Latent Semantic Indexing Demo Machine
- Text Encoding Initiative --Tools
- The CLAWS tagging service
- The EGYPT Statistical Machine Translation Toolkit
- The IMS Corpus Toolbox Webpage
- The Java Open Source Spell Checker
- The Naming Company
- TIMEX2 Taggers
- TnT - Statistical Part-of-Speech Tagger
- Tools developed at Columbia University (FUF, Surge, Crep, Segmenter, Verber, Xtract)
- Torch3
- Turbo Lingo
- Uplug
- VarCon (Variant Conversion Info)
- Virtual Language Centre's Web Concordancer
- VisualText
- Wordfreak
- WSD Shell
- XCES: Corpus Encoding Standard for XML
WordNet stuff (placeholder)
- Word-Net SenseRelate
- Word-Net Similarity
- alphabetic version of WordNet 2.0
- Perl interface to WordNet
QA systems (placeholder, needs to go somewhere else)
- Answerbus -- Automatic Language Detection Software
- QuALiM Question Answering System - Searches Wikipedia
- START Natural Language Question Answering System
- Aranea Question Answering System
- Webglimpse
- Qanda: Open source question answering system
Miscellaneous
- BNCweb: A Web-Based Interface to the British National Corpus
- CSLI LinGO Lab (Stanford)
- Chinese sentiment dictionary NTUSD
- Collocate
- FreeLing 1.1
- IR and IE on the web
- JWNL (Java WordNet Library)
- JavaRAP
- MTP Xlex/www
- Natural Language Processing software
- Natural Language Software Registry (at DFKI)
- Roget's Thesaurus as an Electronic Lexical Knowledge Base
- Text Analysis Computing Tools (TACT)
- TextCat
- Unitex