Resources for English

X2MORF (at the DFKI NLP archive)
AV parser (at the DFKI NLP archive)
CFG parser (at the DFKI NLP archive)
CHARON (at the DFKI NLP archive)
DISCO chart parser (at the DFKI NLP archive)
ENGCG (at the DFKI NLP archive)
ETL parser (at the DFKI NLP archive)
GPSG tools (at the DFKI NLP archive)
GPSG parser (at the DFKI NLP archive)
GRAMTSY (at the DFKI NLP archive)
Hdrug (at the DFKI NLP archive)
JIM III (at the DFKI NLP archive)
JPSG parser and CU-prolog (at the DFKI NLP archive)
LFG parser for Turkish (at the DFKI NLP archive)
Linguistic Kernel Processor (LKP) (at the DFKI NLP archive)
MCHART (at the DFKI NLP archive)
PAPPI (at the DFKI NLP archive)
PAULA (at the DFKI NLP archive)
PC-Translator (at the DFKI NLP archive)
PlayMoBild (at the DFKI NLP archive)
PLEUK (at the DFKI NLP archive)
SLG (at the DFKI NLP archive)
Syntactica (at the DFKI NLP archive)
UBS -- UnifikationsBasierte Sprache (at the DFKI NLP archive)
Xerox Part-of-Speech Tagger (XPOST) (at the DFKI NLP archive)
Zebu (at the DFKI NLP archive)
CUF (at the DFKI NLP archive)
GULP -- Graph Unification Logic Programming (at the DFKI NLP archive)
TFS (Typed Feature Structure) system (at the DFKI NLP archive)
TUG (at the DFKI NLP archive)
UBS (at the DFKI NLP archive)
Alvey Natural Language Tools (at the DFKI NLP archive)
Context Feature Structure System (at the DFKI NLP archive)
MODALYS (at the DFKI NLP archive)
NLL (at the DFKI NLP archive)
SLG (at the DFKI NLP archive)
System for evaluation of anaphoric relations (at the DFKI NLP archive)
AL FRESCO Interactive System (at the DFKI NLP archive)
CAT2(at the DFKI NLP archive)
CHARON (at the DFKI NLP archive)
DECtalk (at the DFKI NLP archive)
ELU (at the DFKI NLP archive)
FUF and SURGE (at the DFKI NLP archive)
GPSG--tools (at the DFKI NLP archive)
Linguistic Kernel Processor (LKP) (at the DFKI NLP archive)
NAUDA generation component (at the DFKI NLP archive)
PlayMoBild (at the DFKI NLP archive)
konnektionistisches Sprachproduktionsmodell (at the DFKI NLP archive)
STEMMA (at the DFKI NLP archive)
TAG--GEN (at the DFKI NLP archive)
TECHDOC (at the DFKI NLP archive)
UBS -- UnifikationsBasierte Sprache (at the DFKI NLP archive)
KRIS -- Knowledge Representation and Inference System (at the DFKI NLP archive)
QDATR (at the DFKI NLP archive)
RHET (at the DFKI NLP archive)
Type description language (at the DFKI NLP archive)
TEMPOS (at the DFKI NLP archive)
ALE -- Attribute Logic Engine (at the DFKI NLP archive)
ALEP (at the DFKI NLP archive)
AL FRESCO Interactive System (at the DFKI NLP archive)
Alvey Natural Language Tools (at the DFKI NLP archive)
BIM LOQUI (at the DFKI NLP archive)
CAT2 (at the DFKI NLP archive)
Context Feature Structure System (at the DFKI NLP archive)
ELU (at the DFKI NLP archive)
EVAR, ERNEST (at the DFKI NLP archive)
Experimental machine translation system (at the DFKI NLP archive)
Geta-run (at the DFKI NLP archive)
GILENA: A Natural-Language Interfaces Generator (at the DFKI NLP archive)
InterBASE (at the DFKI NLP archive) (at the DFKI NLP archive)
JAPE (Joke Analysis and Production Engine) (at the DFKI NLP archive)
KOMET (at the DFKI NLP archive)
Logos Translation Software and LogosClient (at the DFKI NLP archive)
Natural Language (TM) (at the DFKI NLP archive)
NL Builder 5.0 (TM) (at the DFKI NLP archive)
NUGGET (R) (at the DFKI NLP archive)
PAKTUS (at the DFKI NLP archive)
Pangloss (at the DFKI NLP archive)
PARLANCE / Learner (at the DFKI NLP archive)
PENMAN (at the DFKI NLP archive)
PLAIN+ (at the DFKI NLP archive)
POPEL (at the DFKI NLP archive)
PROFGLOT (at the DFKI NLP archive)
Pulavan (at the DFKI NLP archive)
Pundit (at the DFKI NLP archive)
QPATR (at the DFKI NLP archive)
SCISOR / NLToolset (at the DFKI NLP archive)
SNePS (at the DFKI NLP archive)
SNOOP (at the DFKI NLP archive)
SUNDIAL (at the DFKI NLP archive)
Tamil Part-of-speech tagger (at the DFKI NLP archive)
WordFan Conjugation (at the DFKI NLP archive)
XTRA (at the DFKI NLP archive)
YAKR (at the DFKI NLP archive)
ALEP (at the DFKI NLP archive)
COGNATE (at the DFKI NLP archive)
COMPULEXIS (at the DFKI NLP archive)
DCG workbench (at the DFKI NLP archive)
Dictionary Maintenance Utilities (at the DFKI NLP archive)
Dictionary Maintenance Programs (at the DFKI NLP archive)
DITO -- DIagnostic TOol for german syntax (at the DFKI NLP archive)
EGG -- editor for GPS-grammars (at the DFKI NLP archive)
GLOTTO (at the DFKI NLP archive)
Grammar Workbench (at the DFKI NLP archive)
GTU -- Grammatik-Test-Umgebung (at the DFKI NLP archive)
Linguistic DataBase (at the DFKI NLP archive)
LINGUIST (at the DFKI NLP archive)
MONKEY (at the DFKI NLP archive)
P--TRA (at the DFKI NLP archive)
Phono (at the DFKI NLP archive)
SEMBLEX (at the DFKI NLP archive)
Term Rewrite System for non-confluent TRS\'s (at the DFKI NLP archive)
British English Example Pronunciations (BEEP) (at the DFKI NLP archive)
ENGCG (at the DFKI NLP archive)
FJGH--grammar (at the DFKI NLP archive)
Base form reduction and search form production (at the DFKI NLP archive)
Magic (at the DFKI NLP archive)
PC--KIMMO definition files for turkish morphology (at the DFKI NLP archive)
ESTEAM (ESPRIT 316) (at the DFKI NLP archive)
How to Use IT (Mac) (at the DFKI NLP archive)
How to Use IT (MS-DOS) (at the DFKI NLP archive)
Hyphenation and Spell-checking (at the DFKI NLP archive)
ILA multilingual toolkit (at the DFKI NLP archive)
KOREKTOR 2.0 (at the DFKI NLP archive)
ORFO (at the DFKI NLP archive)
Parser (at the DFKI NLP archive)
STAMP (at the DFKI NLP archive)
STEMMA (at the DFKI NLP archive)
WORDSURV (at the DFKI NLP archive)
Bibliographic Search Page, Univ. of Essex
Code from James Allen\'s "Natural Language Understanding" (code at CMU)
Code from Michael Covington\'s "NLP for Prolog Programmers" (code at CMU)
Survey of Electronic Corpora (by Jane A. Edwards, file at CMU)
Yahoo! index on Natural Language Processing
Linguistics resources list at Princeton University
List of resources (at University of Toronto)
List of resources (at University of Stuttgart)
Experimental Corpus Query System (University of Stuttgart, Germany)
Natural Language Computing: An English Generative Grammar in Prolog
Latin Home Page
Comlex Syntax (Syntactic Dictionary of English)
HPSG Mailing List
IR list
The RELATOR language resources server
Language, Journal of the Linguistic Society of America
Collections of texts and corpora
List of stop words
Information Retrieval: Data Structures and Algorithms
Cascadilla Press
Alternative dictionaries
Cognition, a journal from Elsevier Science
Journal of Natural Language Engineering
Russell and Norvig - AI
On-line books at CMU
IMS Corpus Toolbox, Univ. of Stuttgart
Universal Grammar in Prolog
Swedish to English
Swedish to Finnish
English to Estonian
CELEX - The Dutch Center for Lexical Information
Bibliography for Phonetics/Speech Technology
ARIES Natural Language Tools
Bibliography to the book "Artificial Intelligence: A Modern Approach by Russell and Norvig
Survey of English Usage, University College, London
Dictionary site, Bucknell University
The Omicron Inforium
ECHO - Eurodicautom (multilingual technical dictionary)
Survey of the State of the Art of Human Language Technology
Multilingual PC software
MtScript - The Multext multi-lingual text editor
Ergane
MtRecode - Character conversion program
MtStr - Multilingual string library
Publically available POS tagger
Terminology for more than 15 languages
List of on-line dictionaries (from lai.com)
WWW Information on Computational Linguistics and Language Technology
Resource for high-quality tools supporting multi-lingual communication
Resource for professional-quality language translation tools.
Linguistic Data Consortium, University of Pennsylvania
NMSU Natural Language Processing Tools
Kluwer Academic Publishers
Knowledge Representation for Natural Language Processing in Implemented Systems
James Allen - Natural Language Understanding (source code)
Addison Wesley Longman higher education
Course in Corpus Linguistics, Tony McEnery & Andrew Wilson
Syntactic dependency parser for English
Hansards Corpus - Searchable
Annotated list of resources on statistical NLP and corpus-based CL
Corpus of spoken Bulgarian
11-761 Language and Statistics, course at CMU, Spring 1997
Common Lisp Hypermedia Server
BNC Online Service
Moby Database
Texas A&M University Linguistics Course Listings
Machine Learning Journal Special Issue on Natural Language Learning
Le corpus BAF (French and English)
Lexical FreeNet
ETAI - Electronic Transactions on Artificial Intelligence
KPML
Miscellaneous Word Lists from Oxford University
Name lists from US census
Tools developed at Columbia University (FUF, Surge, Crep, Segmenter, Verber, Xtract)
The IMS Corpus Toolbox Webpage
Grammar Writer\'s Workbench for Lexical Functional Grammar
Réacc - reaccenting software
Probability Theory: The Logic Of Science (by E. T. Jaynes, Washington University, Saint Louis)
Data Harmony, Document Management Software
Brainhat Natural Language Processing
Cranfield collection
Medlars collection
1963 Time Magazine corpus
The Moby Corpus
Saarland University, Computational Linguistics
Saarland University, Computational Linguistics
The NLP Dictionary
Speech and Language Processing, by Daniel Jurafsky and James Martin
Korean morphological analyzer and part-of-speech tagger
Russian Corpora
ISI\'s version of the RSTTool
Arbora Tree Delivery Service
Journal of Intelligent Information Systems
Information Retrieval (Journal)
Journal of the American Society for Information Science
ACM Transactions on Information Systems
Information Processing and Management
Kluwer series in Text, Speech and Language Technology
Perl interface to WordNet
Computational Linguistics
The Childes Corpus - Children\'s language
The Negra Corpus - German Syntax annotated
Links to Linguistic and Related Information (University of Passau)
Foundations of Computational Linguistics by Roland Hausser
EuroWordNet
Managing Gigabytes, by Witten, Moffat, and Bell
Syntactic Theory: A Formal Introduction by Ivan Sag and Thomas Wasow
Instructor\'s Manual for Syntactic Theory: A Formal Introduction
Treebank tokenization scheme
CS674: Natural Language Processing (Cornell U., Spring 2000)
Delphes Technologies International, natural language processing.
Delphes Technologies International
German Morphology Browser
ELSNET: Paper and Electronic Publications
The ALPAC Report
WEB-SLS: The European Student Journal of Language and Speech
Machine Translation
SIG-IRList Archives
Brill Tagger (Supervised, Trainable)
CMU Sphinx Group: Open Source Speech Recognition Engines
Natural Language Processing for French
MORLEX - A lexical database for French
VERTEX - A chart parser for unification grammars (French)
Artificial Intelligence NV (Ai)
Brain and Language
Journal of Memory and Language
Journal of Phonetics
Statistical Natural Language Processing: Models and Methods
The Naming Company
IntraText - The missing link between text and hypertext (TM)
VisualText
Opus, a commercial biology text mining system
Sequence learning: Paradigms, Algorithms and Applications
KURA 1.0
The Mariano Silva y Aceves Series
Agglutination on the Basis of Corpus Information
The Language of Word Meaning
REAL WORLD LINGUISTICS 101
The Cross-Language Evaluation Forum
COMPUTER SPEECH AND LANGUAGE
Bigram Statistics Package
POLYSEMY: Theoretical and Computational Approaches
Arabic Newswire Part 1
COPERNIC 2000
KWiCFinder
IMS Corpus Workbench (CWB)
BRITISH NATIONAL CORPUS - WORLD EDITION
LANGUAGE LEARNING CENTER - ACADEMIC CORPUS
WEBCORP
BNC Indexer
The Internet Timelines Project
Project: Pytalk
Release of RSTTool: RSTTool 2.7
The Rosetta PrOject
Russian Phonetics on the Web
NLP: User Modeling 2001
Computer Speech and Language
TransSearch
ICOPOST
Evolutionary Web Development
The Ninth Text REtrieval Conference (TREC 9) Conference Proceedings
TreeTagger - a language independent part-of-speech tagger
The John Bateman and Michael Zock\'s list of Natural Language Generation Systems
Embedded MT Systems: Leveraging for Real World Applications
Linguistic Interpretation of a German Corpus
WEB-CONC
English-Chinese Chinese-English Dictionary of Computer Terms
The XTAG Project
Natural Language Processing Systems
Special Interest Group on Computational Semantics
Workshop on Web-Based Language Documentation and Description Papers
SOFTISSIMO
Slovene-English Parallel Corpus
Czech National Corpus
Lecture Notes in Computer Science Vol. 1835
Lecture Notes in Computer Science Vol. 1835
Information Extraction Towards Scalable, Adaptable Systems
Envisioning Machine Translation in the Information Future 4th Conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca, Mexico, October 10-14, 2000 Proceedings
Text, Speech and Dialogue Third International Workshop, TSD 2000 Brno, Czech Republic, September 13-16, 2000 Proceedings
CORPUS DEL ESPANOL
Author/Institution Self-Archiving
Software Tools for NLP
Software Tools for NLP
Multilingual Text Tools and Corpora
The PLUG Word Aligner - PWA
Ramon Piero Center for Research
IR resources
Reuters Corpus
TELRI Research Archive of Computational Tools and Resources
ROBUSTNESS IN LANGUAGE AND SPEECH TECHNOLOGY
2000 NIST Speaker Recognition Evaluation Corpus
HCRC Map Task Corpus XML annotations
Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio
Speech in Noisy Environments 2 (SPINE2 CODED) Coded Audio
The CLAWS tagging service
A Syntactically Annotated Corpus of German Newspaper Texts
CLaRK System
Resources for Text, Speech and Language Processing
LEXICOGRAPHY AND THE OED: Pioneers in the Untrodden Forest
HANDBOOK OF AUSTRALIAN LANGUAGES
LFG Database: List of Names
LINK GRAMMAR PARSER
ONLINE LINGUISTICS JOURNAL
Architectures and Mechanisms for Language Processing
A Survey of Open Language Archives
The Brooklyn-Geneva-Amsterdam-Helsinki Parsed Corpus of Old English
The Brooklyn-Geneva-Amsterdam-Helsinki Parsed Corpus of Old English
Bilingual Speech: A Typology of Code-Mixing
An Empirical Grammar of the English Verb System
LANGUAGE LINKS
LANGUAGE LINKS
The HALogen Natural Language Generation system
BNCweb a web-based interface to the British National Corpus
Bookmarks for Corpus-based Linguists
Kiel University\'s Institute on Phonetics and Speech Procesing
Susanne: Annotated American English Corpus
Telcordia Latent Semantic Indexing Demo Machine
Search Tools for Web Sites and Intranets
Automatic English Sentence Segmentation
Module for splitting text into sentences
SATZ--Adaptive Sentence Boundary Detector
[1]
a simple grammar of English
Nexing Corpus
Internet Grammar of English
UCREL Semantic Analysis System
The Java Open Source Spell Checker
Aramedia
Dictionaries for International Ispell
Text Encoding Initiative --Tools
The Java Open Source Spell Checker
Canoo.net - German Dictionaries and Grammars
Valencianlanguage.com
Hebrew Morphological Parser
Computational Linguistics, James Pustejovsky, Brandeis University
Perl: extending your pos-tagger using regular expressions, Dan Jurafsky
Hidden Markov Model Toolkit
The Bank of Swedish - A Linguistic Reference Database of Göteborg University
Alpino Treebank
MT List
Dialogue Diversity Corpus
IJECE
Language Identification Tools
Turbo Lingo
Moss -A System for Detecting Software Plagiarism
Russian Newspaper Corpus
GENIA Project Home Page
LIBSVM -- A Library for Support Vector Machines
JavaBayes - v0.346
Bayes Net Toolbox for Matlab
Bayesian Network tools in Java (BNJ)
Computer Speech, Text and Internet Technology
List of English stopwords
COSMAS II
Newspapers on the Internet
Danish news corpus
The Oslo Corpus of Bosnian Texts
Finnish text bank
Oxford Text Archive Corpus of Italian Newspapers
GlossaNet
perl concordancer
UNITEX
EuroWordNet
Moss: A System for Detecting Software Plagiarism
Russian Newspaper Corpus
GENIA corpus version 3.0p
LIBSVM: A Library for Support Vector Machines
[www.torch.ch Torch3]
JavaBayes - version 0.346
Bayes Net Toolbox for Matlab
Bayesian Network tools in Java (BNJ)
English stop words (from SMART)
OpenRCT Home
Lemur Toolkit Download
iFind KBSim.com - Knowledge-Based Simulations, Inc.
Collective (Chaotic - Emergent) Language
Infogistics: NLProcessor Interactive Demo
VISL - Visual Interactive Syntax Learning
RST LaTeX (Reitter IT and Media)
English Intonation in the British Isles -The IViE Corpus
Electronic Text Center -- University of Virginia
YourDictionary
list of systems in multiple languages
Alignment of bilingual corpora performed with EasyAlign
Infogistic - NLProcessor Interactive Demo
Kwicfinder
Demos, University of Alberta, Canada
Hansard French-English parallel corpus
Learner Behaviour on the Internet
Demos of dependency database, parser, and other tools
Automatic Term Extraction System
Mike Scott\'s Web - Wordsmith Tools
Software for the Extraction of N-ary Textual Associations (SENTA)
Studies in Language and Linguistics
St. Jerome Publishing
Leximancer
Concollate
The BNC Index (for the BNCWorld Edition)
Rule Engine for the Java Platform
HAITIAN CREOLE ELECTRONIC TEXTS
Haitian Creole corpus -Teknoloji pou lang kreyol
Natural Language Engineering
Bancos de dados e Ferramentas de an\`alise
CEPRIL aligner
CEPRIL - Portugese Segmenter
OPUS - an open source parallel corpus
NLTK - Natural Language Toolkit
Finite State Automata Utilities v6
Protege Project
Public registry of the Council of the EU
Corpus of Spoken Professional English
Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS
alphabetic version of WordNet 2.0
"Word Frequencies in Written and Spoken English: based on the British National Corpus."
AMALGAM project
books on computational semantics
list of Japanese transitive - intransitive verb pairs
Debian free software community
French Foreign Ministry\'s magazine
Multi-Paradigm Programming in Oz for NLP
COMPARA corpus
Natlanco
Natlanco
Research-lab.com
Visual Text - reference documentation
UN declaration of human rights in multiple languages
Web IR \& IE
Log-likelihood calculator
The Dialogue Diversity Corpus
Mapping WordNet Versions 1.6 and 2.0
AMERICAN NATIONAL CORPUS FIRST RELEASE
Multiword Expression Resources
LingPipe
kfNgram
Exploring Words and Phrases from the British National Corpus
Useful links about parallel corpora, by Olivier Kraif
The LUCY Corpus - Documentation
TreeTalk: Memory - Based Grapheme - Phoneme Conversion Demo
NeXTeNS - Dutch Extension for Text to Speech
English-Truespel (USA Accent) Text Conversion Tool
README for the daemonized version of Collins\' Parser
The Lexical Semantics of a Machine Translation Interlingua
Educational Research Abstracts
Sanskrit Library
Software - The chunklink script, by Sabine Buchholz
Comprehensive Perl Archive Network
Prosogram
CREA
Verbot preview 4.0
Phrases in English
Chilibot: NLP based miner for gene/protein/keyword relationships
NEGRA Corpus
Corpus del Espanol
American English SpeechDat-Car
Freelangonline - many on-line dictionaries + more
TIGERSearch - tools for linguistic text exploration
Lexical information for German
ISI rewrite decoder
Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular
Qanda: Open source question answering system
FLEMMV31 - Inflectional morphology parser for French
Wortlisten: spoken German, English, French, and Dutch
VISL Tagger and Parser
Cambridge Learner Dictionary
Polish subcorpus of the International Corpus of Learner English
Robot Karaoke
Restricted English Corpus from Dr. Caroline Lyon for PhD
Russian Corpus Site
English Resource Grammar
Linguist\'s Search Engine
Phono- Sound Change Model Software
Multext-East Project
Kirrkirr 4.0 Dictionary Program
Phrases in English and the British National Corpus
Russicon Resources
CPAN Lingua EN Sentence Splitter
CPAN Lingua HE Sentence Splitter
CPAN Suffix Tree Module
CSPAN Sentence Splitter
WordNet Domains
Log-likelihood calculator
TREC Video Retrieval Evaluation Page
Python Programming Tutorial
IBM\'s Speech Recognition Modules
CLaRK System
Dan Bikel\'s Parser
Virtual Language Centre\'s Web Concordancer
Bilingual Dictionary French Arabic
LinkGrammar-WN project
EMILLE corpus
Paper on Sentence Boundary Disambiguation
Versioning Machine 2.0
Russian Corpus Site
Lacio Web Corpora
Russian Corpus Page
Wordsmyth Children\'s Dictionary
AFGL Parser Generator
Freelangonline - many on-line dictionaries + more
Prosogram
CREA
Verbot preview 4.0
Phrases in English
Chilibot: NLP based miner for gene/protein/keyword relationships
NEGRA Corpus
Corpus del Espanol
American English SpeechDat-Car
Freelangonline - many on-line dictionaries + more
TIGERSearch - tools for linguistic text exploration
Lexical information for German
ISI rewrite decoder
Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular
Qanda: Open source question answering system
FLEMMV31 - Inflectional morphology parser for French
Wortlisten: spoken German, English, French, and Dutch
Short intensive course: Texts, Discourse and Corpora: Corpora in Linguistics and Related Fields
DTREG decision tree generator
NAACL-Supported Two-Week Summer School in Human Language Technologies
Omphalos Context-Free Language Learning Competition
Senseval-3 Task: Automatic Labeling of Semantic Roles
Senseval-3 Task: Word-Sense Disambiguation of WordNet Glosses
NLSH: Natural Language Shell
NAACL-Supported Two-Week Summer School in Human Language Technologies
CSLI LinGO Lab (Stanford)
UniNE stopword list for Portuguese
Natural Language Processing / Information Retrieval Software Repository
Web Term Document Frequency Form (Berkeley)
3rd North American Summer School in Logic, Language and Information
IR and IE on the web
Corpus building for minority languages
3rd NASSLLI: North American Summer School in Logic, Language and Information
FreeLing 1.1
ICE corpora
Natural Language Processing for Online Applications
Longdo Thai Dictionary

Resources for English

Navigation menu

Search