Difference between revisions of "Resources for English"

From ACL Wiki
Jump to navigation Jump to search
Line 232: Line 232:
  
  
===SOFTWARE - TOOLS===
 
*[http://www.ldc.upenn.edu/Projects/ACE/Tools/  Automatic Content Extraction (ACE): Annotation Tools]
 
  
*[http://www.norvig.com/paip/grammar.lisp a simple grammar of English]
 
 
*[http://sourceforge.net/projects/acopost/ ACOPOST]
 
 
*[http://acdc.linguateca.pt/example_alignment.html Alignment of bilingual corpora performed with EasyAlign]
 
 
*[http://www.lsi.upc.es/~lambert/software/AlignmentSet.html Alignment Set Toolkit]
 
 
*[http://www.clres.com/WordNet.html alphabetic version of WordNet 2.0]
 
 
*[http://lucene.apache.org/java/docs/ Apache Lucene]
 
 
*[http://www.arabeyes.org/ Arabeyes Project]
 
 
*[http://members.aol.com/gnhbos/ocr.htm Aramedia]
 
 
*[http://www.umiacs.umd.edu/~jimmylin/downloads/index.html Aranea Question Answering System]
 
 
*[http://www.ltg.ed.ac.uk/~jo/interarbora/ Arbora Tree Delivery Service]
 
 
*[http://misshoover.si.umich.edu/~zzheng/sentence/ Automatic English Sentence Segmentation]
 
 
*[http://www.clg.wlv.ac.uk/projects/CAST/demos.php Automatic Summarization Demos]
 
 
*[http://www.r.dl.itc.u-tokyo.ac.jp/~nakagawa/resource/termext/atr-e.html Automatic Term Extraction System]
 
 
*[http://lael.pucsp.br/corpora/ Bancos de dados e Ferramentas de an`alise]
 
 
*[http://www.ai.mit.edu/~murphyk/Software/BNT/bnt.html Bayes Net Toolbox for Matlab]
 
 
*[http://bndev.sourceforge.net/ Bayesian Network tools in Java (BNJ)]
 
 
*[http://sslmit.unibo.it/~baroni/bootcat.html BootCaT: Simple Utilities to Bootstrap Corpora and Terms from the Web]
 
 
*[http://callisto.mitre.org/ Callisto Annotation Tool]
 
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2005T13 CCGBank]
 
 
*[http://lael.pucsp.br/corpora/alinhador/ CEPRIL aligner]
 
 
*[http://pie.usna.edu/explorec.html Chargrams Database from British National Corpus]
 
 
*[http://www.bultreebank.org/clark/index.html CLaRK System]
 
 
*[http://dlt4.mit.edu/~dr/COALS/ COALS: Correlated Occurrence Analogue to Lexical Semantics]
 
 
*[ftp://cs.nyu.edu/pub/html/comlex.html/ Comlex]
 
 
*[http://www.ai.mit.edu/projects/iiip/doc/cl-http/home-page.html Common Lisp Hypermedia Server]
 
 
*[http://www.cpan.org/ Comprehensive Perl Archive Network]
 
 
*[http://clg.wlv.ac.uk/projects/CAST/ Computer Aided Summarisation Tool (CAST)]
 
 
*[http://infomap.stanford.edu/webdemo Concept Search Engine Information Mapping Demo (Center for the Study of Language and Information, Stanford University)]
 
 
*[https://sourceforge.net/projects/concollate/ Concollate]
 
 
*[http://borel.slu.edu/crubadan/ Corpus building for minority languages]
 
 
*[http://montev.isi.edu:8000/align-tool/?CORPUS=de-news-morphix&AFILE=full-model1-50-50.gz Corpus De-News-Morphix Alignment Tool]
 
 
*[http://search.cpan.org/dist/SuffixTree/ CPAN Suffix Tree Module]
 
 
*[http://www.ucl.ac.uk/english-usage/diachronic/index.htm Creating a Parsed and Searchable Diachronic Corpus of Present-Day Spoken English]
 
 
*[http://www.cis.upenn.edu/~dbikel/software.html#wn Dan Bikel's Java WordNet Library]
 
 
*[http://www.dataharmony.com/ Data Harmony, Document Management Software]
 
 
*[http://www.cs.ualberta.ca/~lindek/demos.htm Demos of dependency database, parser, and other tools]
 
 
*[http://fuzzy.cs.uni-magdeburg.de/~borgelt/dtree.html Dtree - Decision and Regression Tree Induction]
 
 
*[http://www.foreignword.com/dictionary/truespel/transpel.htm English-Truespel (USA Accent) Text Conversion Tool]
 
 
*[http://www.cs.jhu.edu/~brill/ Eric Brill's Part of Speech Tagger]
 
 
*[http://odur.let.rug.nl/~vannoord/Fsa/Manual/node1.html Finite State Automata Utilities v6]
 
 
*[http://www.jaist.ac.jp/~hieuxuan/flexcrfs/flexcrfs.html FlexCRFs: Flexible Conditional Random Fields]
 
 
*[http://garraf.epsevg.upc.es/freeling/ FreeLing 1.2]
 
 
*[http://grid.let.rug.nl/~vannoord/Fsa/fsa.html FSA6.2xx: Finite State Automata Utilities]
 
 
*[http://gate.ac.uk/ GATE (General Architecture for Text Engineering)]
 
 
*[http://www.clsp.jhu.edu/ws2005/groups/statistical/GenPar.html GenPar Toolkit for Generalized Parsing]
 
 
*[http://www.parc.xerox.com/istl/groups/nltt/medley/ Grammar Writer's Workbench for Lexical Functional Grammar]
 
 
*[http://htk.eng.cam.ac.uk Hidden Markov Model Toolkit]
 
 
*[http://www.ida.liu.se/~nlplab/ILink/ I*Link]
 
 
*[http://www.kbsim.com/ifind.html iFind KBSim.com - Knowledge-Based Simulations, Inc.]
 
 
*[http://www.infogistics.com/posdemo.htm Infogistics: NLProcessor Interactive Demo]
 
 
*[http://www.isi.edu/~marcu/software.html ISI's version of the RSTTool]
 
 
*[http://www-2.cs.cmu.edu/~javabayes/Home/ JavaBayes - v0.346]
 
 
*[http://www.comp.leeds.ac.uk/andyr/software/jTokeniser/ jTokeniser]
 
 
*[http://sourceforge.net/projects/jwordnet/ JWNL (Java WordNet Library)]
 
 
*[http://sslmit.unibo.it/%7ebaroni/welcome_to_knorpora.html Knorpora 1.0]
 
 
*[http://miniappolis.com/KWiCFinder/KWiCFinderHome.html KWiCFinder]
 
 
*[http://www.kwicfinder.com/KWiCFinder.html Kwicfinder]
 
 
*[http://odur.let.rug.nl/~vannoord/TextCat/competitors.html Language Identification Tools]
 
 
*[http://www-2.cs.cmu.edu/~lemur/download.html Lemur Toolkit Download]
 
 
*[http://www.lemurproject.org/ Lemur Toolkit Website]
 
 
*[http://www.leximancer.com/ Leximancer]
 
 
*[http://www.csie.ntu.edu.tw/~cjlin/libsvm/ LIBSVM: A Library for Support Vector Machines]
 
 
*[http://www.alias-i.com/lingpipe/ LingPipe]
 
 
*[http://search.cpan.org/~lgoddard/Lingua-Syllable-0.03/Syllable.pm Lingua-Syllable]
 
 
*[http://listserv.linguistlist.org/cgi-bin/wa?A2=ind0109&L=corpora&P=R729 list of POS taggers]
 
 
*[http://ucrel.lancs.ac.uk/llwizard.html Log-likelihood calculator]
 
 
*[ftp://ftp.ncbi.nlm.nih.gov/pub/lsmith/MedPost/medpost.tar.gz MedPost: A Part-of-Speech Tagger for BioMedical text]
 
 
*[http://www.lexically.net/wordsmith/version4/index.htm Mike Scott's Web - Wordsmith Tools]
 
 
*[http://mmax.eml-research.de MMAX Annotation Tool]
 
 
*[http://www.dcs.shef.ac.uk/research/ilash/Moby/ Moby Database]
 
 
*[http://search.cpan.org/author/SHLOMOY/Lingua-EN-Sentence-0.25/lib/Lingua/EN/Sentence.pm Module for splitting text into sentences]
 
 
*[http://www.cs.berkeley.edu/~aiken/moss.html Moss: A System for Detecting Software Plagiarism]
 
 
*[http://www.clsp.jhu.edu/ws2005/groups/statistical/mtv.html Multitree Viewer (MTV)]
 
 
*[http://www.natlantech.com/lingbench_ide.html Natlanco]
 
 
*[http://www.cs.jhu.edu/~brill/code.html Natural Language Processing Systems]
 
 
*[http://www.ltg.ed.ac.uk/NITE/ NITE XML Toolkit]
 
 
*[http://nltk.sourceforge.net NLTK - Natural Language Toolkit]
 
 
*[http://crl.nmsu.edu/Tools/Software/ NMSU Natural Language Processing Tools]
 
 
*[http://annotation.semanticweb.org/ontomat/index.html Ontomat Homepage]
 
 
*[http://teach-computers.org/word-expert.html Open Mind]
 
 
*[http://davinci.cs.ucdavis.edu/ OpenRCT Home]
 
 
*[http://www.oriel.org/homonym.htm ORIEL -- Online Research Information Environment for the Life Sciences]
 
 
*[http://clg.wlv.ac.uk/projects/PALinkA/ PALinkA: A Resource Annotation Tool]
 
 
*[http://www.sil.org/ PC-KIMMO, Englex, PC-PATR, and PC-PARSE]
 
 
*[http://www.bluem.net/downloads/pdftotext_en/ PDF to Text]
 
 
*[http://wall.jussieu.fr/dyn/Context2 perl concordancer]
 
 
*[http://www.ai.mit.edu/~jrennie/WordNet/ Perl interface to WordNet]
 
 
*[http://www.tartarus.org/~martin/PorterStemmer/index.html Porter Stemming Algorithm]
 
 
*[http://www.sciences.univ-nantes.fr/info/perso/permanents/enguehard/recherche/CoRRecT/CoRRecT_gb.htm Project CoRRecT: Reference Corpus for the Recognition of Terms]
 
 
*[http://protege.stanford.edu/ Protege Project]
 
 
*[http://www.lingsoft.fi/cgi-pub/engcg Publically available POS tagger]
 
 
*[http://www.openchannelsoftware.org/projects/Qanda Qanda: Open source question answering system]
 
 
*[http://corpus.leeds.ac.uk/query-zh.html Query to Chinese Corpora]
 
 
*[http://www-rali.iro.umontreal.ca/Reacc/ Réacc - reaccenting software]
 
 
*[http://rdues.uce.ac.uk/acronym.shtml RDUES ACRONYM (Automatic Collocational Retrieval of NYMs) Project]
 
 
*[http://www.comp.nus.edu.sg/~rpnlpir/daemonCollins/ README for the daemonized version of Collins' Parser]
 
 
*[http://www.research-lab.com/ Research-lab.com]
 
 
*[http://hacks.oreilly.com/pub/h/1011 Robot Karaoke]
 
 
*[http://www.reitter-it-media.de/compling/index.html RST LaTeX  (Reitter IT and Media)]
 
 
*[http://herzberg.ca.sandia.gov/jess/index.shtml Rule Engine for the Java Platform]
 
 
*[http://elib.cs.berkeley.edu/src/satz/ SATZ--Adaptive Sentence Boundary Detector]
 
 
*[http://ixa.si.ehu.es/Ixa/resources/selprefs Selectional Preferences Extracted from Semcor for WordNet 1.6 Synsets (v 1.0)]
 
 
*[http://ilk.uvt.nl/~sabine/chunklink/ Software - The chunklink script, by Sabine Buchholz]
 
 
*[http://people.csail.mit.edu/people/mcollins/code.html Software and Data Sets for Collins Natural Language Parser]
 
 
*[http://senta.di.ubi.pt Software for the Extraction of N-ary Textual Associations (SENTA)]
 
 
*[http://www-a2k.is.tokushima-u.ac.jp/member/kita/NLP/nlp_tools.html Software Tools for NLP]
 
 
*[http://www-nlp.stanford.edu/software/lex-parser.shtml Stanford Parser]
 
 
*[http://start.csail.mit.edu/ START Natural Language Question Answering System]
 
 
*[http://www.lsi.upc.edu/%7Enlp/SVMTool/ SVMTool]
 
 
*[http://swesum.nada.kth.se/index-eng.html SweSum - Automatic Text Summarizer (with PRM)]
 
 
*[http://www.wagsoft.com/Coder/ Systemic Coder -- a Text Markup Tool (Version 4.5)]
 
 
*[http://www-2.cs.cmu.edu/~lenzo/t2p/ t2p: Text-to-Phoneme Converter Builder]
 
 
*[http://www.d.umn.edu/~tpederse/parallel.html Ted Pedersen - Tools for Parallel Text]
 
 
*[http://lsi.research.telcordia.com/ Telcordia Latent Semantic Indexing  Demo Machine]
 
 
*[http://www.tei-c.org/Software/index.html Text Encoding Initiative --Tools]
 
 
*[http://www.comp.lancs.ac.uk/computing/research/ucrel/claws/tagservice.html The CLAWS tagging service]
 
 
*[http://www.clsp.jhu.edu/ws99/projects/mt/toolkit/ The EGYPT Statistical Machine Translation Toolkit]
 
 
*[http://www.ims.uni-stuttgart.de/CorpusToolbox/ The IMS Corpus Toolbox Webpage]
 
 
*[http://jazzy.sourceforge.net/ The Java Open Source Spell Checker]
 
 
*[http://www.findingnames.net/ The Naming Company]
 
 
*[http://stp.ling.uu.se/~corpora/plug/pwa/ The PLUG Word Aligner - PWA]
 
 
*[http://timex2.mitre.org/taggers/timex2_taggers.html TIMEX2 Taggers]
 
 
*[http://www.coli.uni-sb.de/~thorsten/tnt/ TnT - Statistical Part-of-Speech Tagger]
 
 
*[ftp://ftp.ids-mannheim.de/kt/CSSCCb-4.0.tar.bz2 Tool for Identifying Duplicates in a Large Document Collection]
 
 
*[http://www.cs.columbia.edu/nlp/tools.html Tools developed at Columbia University (FUF, Surge, Crep, Segmenter, Verber, Xtract)]
 
 
*[http://www.torch.ch Torch3]
 
 
*[http://main.amu.edu.pl/~sipkadan/lingo.htm Turbo Lingo]
 
 
*[http://stp.ling.uu.se/cgi-bin/joerg/Uplug Uplug]
 
 
*[http://wordlist.sourceforge.net/varcon-readme VarCon (Variant Conversion Info)]
 
 
*[http://www.edict.com.hk/concordance/ Virtual Language Centre's Web Concordancer]
 
 
*[http://www.textanalysis.com/help/help.htm Visual Text - reference documentation]
 
 
*[http://www.textanalysis.com/ VisualText]
 
 
*[http://webglimpse.net/ Webglimpse]
 
 
*[http://search.cpan.org/dist/WordNet-SenseRelate Word-Net SenseRelate]
 
 
*[http://search.cpan.org/dist/WordNet-Similarity Word-Net Similarity]
 
 
*[http://sourceforge.net/projects/wordfreak Wordfreak]
 
 
*[http://www.cogsci.princeton.edu/~wn/ Wordnet]
 
 
*[http://www.d.umn.edu/~tpederse/similarity.html WordNet-Similarity PERL Module]
 
 
*[http://www.d.umn.edu/~tpederse/wsdshell.html WSD Shell]
 
 
*[http://www.xml-ces.org/ XCES: Corpus Encoding Standard for XML]
 
  
 
==TOOLS==
 
==TOOLS==

Revision as of 13:46, 2 November 2006

Most of the early additions have been moved here from the ACL NLP/CL Universe.


LANGUAGE


MAILING

ONLINE

PAPERS

SOFTWARE

TOOLS

UNCATEGORIZED