Resources for English
Jump to navigation
Jump to search
- X2MORF (at the DFKI NLP archive)
- AV parser (at the DFKI NLP archive)
- CFG parser (at the DFKI NLP archive)
- CHARON (at the DFKI NLP archive)
- DISCO chart parser (at the DFKI NLP archive)
- ENGCG (at the DFKI NLP archive)
- ETL parser (at the DFKI NLP archive)
- GPSG tools (at the DFKI NLP archive)
- GPSG parser (at the DFKI NLP archive)
- GRAMTSY (at the DFKI NLP archive)
- Hdrug (at the DFKI NLP archive)
- JIM III (at the DFKI NLP archive)
- JPSG parser and CU-prolog (at the DFKI NLP archive)
- LFG parser for Turkish (at the DFKI NLP archive)
- Linguistic Kernel Processor (LKP) (at the DFKI NLP archive)
- MCHART (at the DFKI NLP archive)
- PAPPI (at the DFKI NLP archive)
- PAULA (at the DFKI NLP archive)
- PC-Translator (at the DFKI NLP archive)
- PlayMoBild (at the DFKI NLP archive)
- PLEUK (at the DFKI NLP archive)
- SLG (at the DFKI NLP archive)
- Syntactica (at the DFKI NLP archive)
- UBS -- UnifikationsBasierte Sprache (at the DFKI NLP archive)
- Xerox Part-of-Speech Tagger (XPOST) (at the DFKI NLP archive)
- Zebu (at the DFKI NLP archive)
- CUF (at the DFKI NLP archive)
- GULP -- Graph Unification Logic Programming (at the DFKI NLP archive)
- TFS (Typed Feature Structure) system (at the DFKI NLP archive)
- TUG (at the DFKI NLP archive)
- UBS (at the DFKI NLP archive)
- Alvey Natural Language Tools (at the DFKI NLP archive)
- Context Feature Structure System (at the DFKI NLP archive)
- MODALYS (at the DFKI NLP archive)
- NLL (at the DFKI NLP archive)
- SLG (at the DFKI NLP archive)
- System for evaluation of anaphoric relations (at the DFKI NLP archive)
- AL FRESCO Interactive System (at the DFKI NLP archive)
- CAT2(at the DFKI NLP archive)
- CHARON (at the DFKI NLP archive)
- DECtalk (at the DFKI NLP archive)
- ELU (at the DFKI NLP archive)
- FUF and SURGE (at the DFKI NLP archive)
- GPSG--tools (at the DFKI NLP archive)
- Linguistic Kernel Processor (LKP) (at the DFKI NLP archive)
- NAUDA generation component (at the DFKI NLP archive)
- PlayMoBild (at the DFKI NLP archive)
- konnektionistisches Sprachproduktionsmodell (at the DFKI NLP archive)
- STEMMA (at the DFKI NLP archive)
- TAG--GEN (at the DFKI NLP archive)
- TECHDOC (at the DFKI NLP archive)
- UBS -- UnifikationsBasierte Sprache (at the DFKI NLP archive)
- KRIS -- Knowledge Representation and Inference System (at the DFKI NLP archive)
- QDATR (at the DFKI NLP archive)
- RHET (at the DFKI NLP archive)
- Type description language (at the DFKI NLP archive)
- TEMPOS (at the DFKI NLP archive)
- ALE -- Attribute Logic Engine (at the DFKI NLP archive)
- ALEP (at the DFKI NLP archive)
- AL FRESCO Interactive System (at the DFKI NLP archive)
- Alvey Natural Language Tools (at the DFKI NLP archive)
- BIM LOQUI (at the DFKI NLP archive)
- CAT2 (at the DFKI NLP archive)
- Context Feature Structure System (at the DFKI NLP archive)
- ELU (at the DFKI NLP archive)
- EVAR, ERNEST (at the DFKI NLP archive)
- Experimental machine translation system (at the DFKI NLP archive)
- Geta-run (at the DFKI NLP archive)
- GILENA: A Natural-Language Interfaces Generator (at the DFKI NLP archive)
- InterBASE (at the DFKI NLP archive) (at the DFKI NLP archive)
- JAPE (Joke Analysis and Production Engine) (at the DFKI NLP archive)
- KOMET (at the DFKI NLP archive)
- Logos Translation Software and LogosClient (at the DFKI NLP archive)
- Natural Language (TM) (at the DFKI NLP archive)
- NL Builder 5.0 (TM) (at the DFKI NLP archive)
- NUGGET (R) (at the DFKI NLP archive)
- PAKTUS (at the DFKI NLP archive)
- Pangloss (at the DFKI NLP archive)
- PARLANCE / Learner (at the DFKI NLP archive)
- PENMAN (at the DFKI NLP archive)
- PLAIN+ (at the DFKI NLP archive)
- POPEL (at the DFKI NLP archive)
- PROFGLOT (at the DFKI NLP archive)
- Pulavan (at the DFKI NLP archive)
- Pundit (at the DFKI NLP archive)
- QPATR (at the DFKI NLP archive)
- SCISOR / NLToolset (at the DFKI NLP archive)
- SNePS (at the DFKI NLP archive)
- SNOOP (at the DFKI NLP archive)
- SUNDIAL (at the DFKI NLP archive)
- Tamil Part-of-speech tagger (at the DFKI NLP archive)
- WordFan Conjugation (at the DFKI NLP archive)
- XTRA (at the DFKI NLP archive)
- YAKR (at the DFKI NLP archive)
- ALEP (at the DFKI NLP archive)
- COGNATE (at the DFKI NLP archive)
- COMPULEXIS (at the DFKI NLP archive)
- DCG workbench (at the DFKI NLP archive)
- Dictionary Maintenance Utilities (at the DFKI NLP archive)
- Dictionary Maintenance Programs (at the DFKI NLP archive)
- DITO -- DIagnostic TOol for german syntax (at the DFKI NLP archive)
- EGG -- editor for GPS-grammars (at the DFKI NLP archive)
- GLOTTO (at the DFKI NLP archive)
- Grammar Workbench (at the DFKI NLP archive)
- GTU -- Grammatik-Test-Umgebung (at the DFKI NLP archive)
- Linguistic DataBase (at the DFKI NLP archive)
- LINGUIST (at the DFKI NLP archive)
- MONKEY (at the DFKI NLP archive)
- P--TRA (at the DFKI NLP archive)
- Phono (at the DFKI NLP archive)
- SEMBLEX (at the DFKI NLP archive)
- Term Rewrite System for non-confluent TRS\'s (at the DFKI NLP archive)
- British English Example Pronunciations (BEEP) (at the DFKI NLP archive)
- ENGCG (at the DFKI NLP archive)
- FJGH--grammar (at the DFKI NLP archive)
- Base form reduction and search form production (at the DFKI NLP archive)
- Magic (at the DFKI NLP archive)
- PC--KIMMO definition files for turkish morphology (at the DFKI NLP archive)
- ESTEAM (ESPRIT 316) (at the DFKI NLP archive)
- How to Use IT (Mac) (at the DFKI NLP archive)
- How to Use IT (MS-DOS) (at the DFKI NLP archive)
- Hyphenation and Spell-checking (at the DFKI NLP archive)
- ILA multilingual toolkit (at the DFKI NLP archive)
- KOREKTOR 2.0 (at the DFKI NLP archive)
- ORFO (at the DFKI NLP archive)
- Parser (at the DFKI NLP archive)
- STAMP (at the DFKI NLP archive)
- STEMMA (at the DFKI NLP archive)
- WORDSURV (at the DFKI NLP archive)
- Bibliographic Search Page, Univ. of Essex
- Code from James Allen\'s "Natural Language Understanding" (code at CMU)
- Code from Michael Covington\'s "NLP for Prolog Programmers" (code at CMU)
- Survey of Electronic Corpora (by Jane A. Edwards, file at CMU)
- Yahoo! index on Natural Language Processing
- Linguistics resources list at Princeton University
- List of resources (at University of Toronto)
- List of resources (at University of Stuttgart)
- Experimental Corpus Query System (University of Stuttgart, Germany)
- Natural Language Computing: An English Generative Grammar in Prolog
- Latin Home Page
- Comlex Syntax (Syntactic Dictionary of English)
- HPSG Mailing List
- IR list
- The RELATOR language resources server
- Language, Journal of the Linguistic Society of America
- Collections of texts and corpora
- List of stop words
- Information Retrieval: Data Structures and Algorithms
- Cascadilla Press
- Alternative dictionaries
- Cognition, a journal from Elsevier Science
- Journal of Natural Language Engineering
- Russell and Norvig - AI
- On-line books at CMU
- IMS Corpus Toolbox, Univ. of Stuttgart
- Universal Grammar in Prolog
- Swedish to English
- Swedish to Finnish
- English to Estonian
- CELEX - The Dutch Center for Lexical Information
- Bibliography for Phonetics/Speech Technology
- ARIES Natural Language Tools
- Bibliography to the book "Artificial Intelligence: A Modern Approach by Russell and Norvig
- Survey of English Usage, University College, London
- Dictionary site, Bucknell University
- The Omicron Inforium
- ECHO - Eurodicautom (multilingual technical dictionary)
- Survey of the State of the Art of Human Language Technology
- Multilingual PC software
- MtScript - The Multext multi-lingual text editor
- Ergane
- MtRecode - Character conversion program
- MtStr - Multilingual string library
- Publically available POS tagger
- Terminology for more than 15 languages
- List of on-line dictionaries (from lai.com)
- WWW Information on Computational Linguistics and Language Technology
- Resource for high-quality tools supporting multi-lingual communication
- Resource for professional-quality language translation tools.
- Linguistic Data Consortium, University of Pennsylvania
- NMSU Natural Language Processing Tools
- Kluwer Academic Publishers
- Knowledge Representation for Natural Language Processing in Implemented Systems
- James Allen - Natural Language Understanding (source code)
- Addison Wesley Longman higher education
- Course in Corpus Linguistics, Tony McEnery & Andrew Wilson
- Syntactic dependency parser for English
- Hansards Corpus - Searchable
- Annotated list of resources on statistical NLP and corpus-based CL
- Corpus of spoken Bulgarian
- 11-761 Language and Statistics, course at CMU, Spring 1997
- Common Lisp Hypermedia Server
- BNC Online Service
- Moby Database
- Texas A&M University Linguistics Course Listings
- Machine Learning Journal Special Issue on Natural Language Learning
- Le corpus BAF (French and English)
- Lexical FreeNet
- ETAI - Electronic Transactions on Artificial Intelligence
- KPML
- Miscellaneous Word Lists from Oxford University
- Name lists from US census
- Tools developed at Columbia University (FUF, Surge, Crep, Segmenter, Verber, Xtract)
- The IMS Corpus Toolbox Webpage
- Grammar Writer\'s Workbench for Lexical Functional Grammar
- Réacc - reaccenting software
- Probability Theory: The Logic Of Science (by E. T. Jaynes, Washington University, Saint Louis)
- Data Harmony, Document Management Software
- Brainhat Natural Language Processing
- Cranfield collection
- Medlars collection
- 1963 Time Magazine corpus
- The Moby Corpus
- Saarland University, Computational Linguistics
- Saarland University, Computational Linguistics
- The NLP Dictionary
- Speech and Language Processing, by Daniel Jurafsky and James Martin
- Korean morphological analyzer and part-of-speech tagger
- Russian Corpora
- ISI\'s version of the RSTTool
- Arbora Tree Delivery Service
- Journal of Intelligent Information Systems
- Information Retrieval (Journal)
- Journal of the American Society for Information Science
- ACM Transactions on Information Systems
- Information Processing and Management
- Kluwer series in Text, Speech and Language Technology
- Perl interface to WordNet
- Computational Linguistics
- The Childes Corpus - Children\'s language
- The Negra Corpus - German Syntax annotated
- Links to Linguistic and Related Information (University of Passau)
- Foundations of Computational Linguistics by Roland Hausser
- EuroWordNet
- Managing Gigabytes, by Witten, Moffat, and Bell
- Syntactic Theory: A Formal Introduction by Ivan Sag and Thomas Wasow
- Instructor\'s Manual for Syntactic Theory: A Formal Introduction
- Treebank tokenization scheme
- CS674: Natural Language Processing (Cornell U., Spring 2000)
- Delphes Technologies International, natural language processing.
- Delphes Technologies International
- German Morphology Browser
- ELSNET: Paper and Electronic Publications
- The ALPAC Report
- WEB-SLS: The European Student Journal of Language and Speech
- Machine Translation
- SIG-IRList Archives
- Brill Tagger (Supervised, Trainable)
- CMU Sphinx Group: Open Source Speech Recognition Engines
- Natural Language Processing for French
- MORLEX - A lexical database for French
- VERTEX - A chart parser for unification grammars (French)
- Artificial Intelligence NV (Ai)
- Brain and Language
- Journal of Memory and Language
- Journal of Phonetics
- Statistical Natural Language Processing: Models and Methods
- The Naming Company
- IntraText - The missing link between text and hypertext (TM)
- VisualText
- Opus, a commercial biology text mining system
- Sequence learning: Paradigms, Algorithms and Applications
- KURA 1.0
- The Mariano Silva y Aceves Series
- Agglutination on the Basis of Corpus Information
- The Language of Word Meaning
- REAL WORLD LINGUISTICS 101
- The Cross-Language Evaluation Forum
- COMPUTER SPEECH AND LANGUAGE
- Bigram Statistics Package
- POLYSEMY: Theoretical and Computational Approaches
- Arabic Newswire Part 1
- COPERNIC 2000
- KWiCFinder
- IMS Corpus Workbench (CWB)
- BRITISH NATIONAL CORPUS - WORLD EDITION
- LANGUAGE LEARNING CENTER - ACADEMIC CORPUS
- WEBCORP
- BNC Indexer
- The Internet Timelines Project
- Project: Pytalk
- Release of RSTTool: RSTTool 2.7
- The Rosetta PrOject
- Russian Phonetics on the Web
- NLP: User Modeling 2001
- Computer Speech and Language
- TransSearch
- ICOPOST
- Evolutionary Web Development
- The Ninth Text REtrieval Conference (TREC 9) Conference Proceedings
- TreeTagger - a language independent part-of-speech tagger
- The John Bateman and Michael Zock\'s list of Natural Language Generation Systems
- Embedded MT Systems: Leveraging for Real World Applications
- Linguistic Interpretation of a German Corpus
- WEB-CONC
- English-Chinese Chinese-English Dictionary of Computer Terms
- The XTAG Project
- Natural Language Processing Systems
- Special Interest Group on Computational Semantics
- Workshop on Web-Based Language Documentation and Description Papers
- SOFTISSIMO
- Slovene-English Parallel Corpus
- Czech National Corpus
- Lecture Notes in Computer Science Vol. 1835
- Lecture Notes in Computer Science Vol. 1835
- Information Extraction Towards Scalable, Adaptable Systems
- Envisioning Machine Translation in the Information Future 4th Conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca, Mexico, October 10-14, 2000 Proceedings
- Text, Speech and Dialogue Third International Workshop, TSD 2000 Brno, Czech Republic, September 13-16, 2000 Proceedings
- CORPUS DEL ESPANOL
- Author/Institution Self-Archiving
- Software Tools for NLP
- Software Tools for NLP
- Multilingual Text Tools and Corpora
- The PLUG Word Aligner - PWA
- Ramon Piero Center for Research
- IR resources
- Reuters Corpus
- TELRI Research Archive of Computational Tools and Resources
- ROBUSTNESS IN LANGUAGE AND SPEECH TECHNOLOGY
- 2000 NIST Speaker Recognition Evaluation Corpus
- HCRC Map Task Corpus XML annotations
- Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio
- Speech in Noisy Environments 2 (SPINE2 CODED) Coded Audio
- The CLAWS tagging service
- A Syntactically Annotated Corpus of German Newspaper Texts
- CLaRK System
- Resources for Text, Speech and Language Processing
- LEXICOGRAPHY AND THE OED: Pioneers in the Untrodden Forest
- HANDBOOK OF AUSTRALIAN LANGUAGES
- LFG Database: List of Names
- LINK GRAMMAR PARSER
- ONLINE LINGUISTICS JOURNAL
- Architectures and Mechanisms for Language Processing
- A Survey of Open Language Archives
- The Brooklyn-Geneva-Amsterdam-Helsinki Parsed Corpus of Old English
- The Brooklyn-Geneva-Amsterdam-Helsinki Parsed Corpus of Old English
- Bilingual Speech: A Typology of Code-Mixing
- An Empirical Grammar of the English Verb System
- LANGUAGE LINKS
- LANGUAGE LINKS
- The HALogen Natural Language Generation system
- BNCweb a web-based interface to the British National Corpus
- Bookmarks for Corpus-based Linguists
- Kiel University\'s Institute on Phonetics and Speech Procesing
- Susanne: Annotated American English Corpus
- Telcordia Latent Semantic Indexing Demo Machine
- Search Tools for Web Sites and Intranets
- Automatic English Sentence Segmentation
- Module for splitting text into sentences
- SATZ--Adaptive Sentence Boundary Detector
- [1]
- a simple grammar of English
- Nexing Corpus
- Internet Grammar of English
- UCREL Semantic Analysis System
- The Java Open Source Spell Checker
- Aramedia
- Dictionaries for International Ispell
- Text Encoding Initiative --Tools
- The Java Open Source Spell Checker
- Canoo.net - German Dictionaries and Grammars
- Valencianlanguage.com
- Hebrew Morphological Parser
- Computational Linguistics, James Pustejovsky, Brandeis University
- Perl: extending your pos-tagger using regular expressions, Dan Jurafsky
- Hidden Markov Model Toolkit
- The Bank of Swedish - A Linguistic Reference Database of Göteborg University
- Alpino Treebank
- MT List
- Dialogue Diversity Corpus
- IJECE
- Language Identification Tools
- Turbo Lingo
- Moss -A System for Detecting Software Plagiarism
- Russian Newspaper Corpus
- GENIA Project Home Page
- LIBSVM -- A Library for Support Vector Machines
- JavaBayes - v0.346
- Bayes Net Toolbox for Matlab
- Bayesian Network tools in Java (BNJ)
- Computer Speech, Text and Internet Technology
- List of English stopwords
- COSMAS II
- Newspapers on the Internet
- Danish news corpus
- The Oslo Corpus of Bosnian Texts
- Finnish text bank
- Oxford Text Archive Corpus of Italian Newspapers
- GlossaNet
- perl concordancer
- UNITEX
- EuroWordNet
- Moss: A System for Detecting Software Plagiarism
- Russian Newspaper Corpus
- GENIA corpus version 3.0p
- LIBSVM: A Library for Support Vector Machines
- [www.torch.ch Torch3]
- JavaBayes - version 0.346
- Bayes Net Toolbox for Matlab
- Bayesian Network tools in Java (BNJ)
- English stop words (from SMART)
- OpenRCT Home
- Lemur Toolkit Download
- iFind KBSim.com - Knowledge-Based Simulations, Inc.
- Collective (Chaotic - Emergent) Language
- Infogistics: NLProcessor Interactive Demo
- VISL - Visual Interactive Syntax Learning
- RST LaTeX (Reitter IT and Media)
- English Intonation in the British Isles -The IViE Corpus
- Electronic Text Center -- University of Virginia
- YourDictionary
- list of systems in multiple languages
- Alignment of bilingual corpora performed with EasyAlign
- Infogistic - NLProcessor Interactive Demo
- Kwicfinder
- Demos, University of Alberta, Canada
- Hansard French-English parallel corpus
- Learner Behaviour on the Internet
- Demos of dependency database, parser, and other tools
- Automatic Term Extraction System
- Mike Scott\'s Web - Wordsmith Tools
- Software for the Extraction of N-ary Textual Associations (SENTA)
- Studies in Language and Linguistics
- St. Jerome Publishing
- Leximancer
- Concollate
- The BNC Index (for the BNCWorld Edition)
- Rule Engine for the Java Platform
- HAITIAN CREOLE ELECTRONIC TEXTS
- Haitian Creole corpus -Teknoloji pou lang kreyol
- Natural Language Engineering
- Bancos de dados e Ferramentas de an\`alise
- CEPRIL aligner
- CEPRIL - Portugese Segmenter
- OPUS - an open source parallel corpus
- NLTK - Natural Language Toolkit
- Finite State Automata Utilities v6
- Protege Project
- Public registry of the Council of the EU
- Corpus of Spoken Professional English
- Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS
- alphabetic version of WordNet 2.0
- "Word Frequencies in Written and Spoken English: based on the British National Corpus."
- AMALGAM project
- books on computational semantics
- list of Japanese transitive - intransitive verb pairs
- Debian free software community
- French Foreign Ministry\'s magazine
- Multi-Paradigm Programming in Oz for NLP
- COMPARA corpus
- Natlanco
- Natlanco
- Research-lab.com
- Visual Text - reference documentation
- UN declaration of human rights in multiple languages
- Web IR \& IE
- Log-likelihood calculator
- The Dialogue Diversity Corpus
- Mapping WordNet Versions 1.6 and 2.0
- AMERICAN NATIONAL CORPUS FIRST RELEASE
- Multiword Expression Resources
- LingPipe
- kfNgram
- Exploring Words and Phrases from the British National Corpus
- Useful links about parallel corpora, by Olivier Kraif
- The LUCY Corpus - Documentation
- TreeTalk: Memory - Based Grapheme - Phoneme Conversion Demo
- NeXTeNS - Dutch Extension for Text to Speech
- English-Truespel (USA Accent) Text Conversion Tool
- README for the daemonized version of Collins\' Parser
- The Lexical Semantics of a Machine Translation Interlingua
- Educational Research Abstracts
- Sanskrit Library
- Software - The chunklink script, by Sabine Buchholz
- Comprehensive Perl Archive Network
- Prosogram
- CREA
- Verbot preview 4.0
- Phrases in English
- Chilibot: NLP based miner for gene/protein/keyword relationships
- NEGRA Corpus
- Corpus del Espanol
- American English SpeechDat-Car
- Freelangonline - many on-line dictionaries + more
- TIGERSearch - tools for linguistic text exploration
- Lexical information for German
- ISI rewrite decoder
- Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular
- Qanda: Open source question answering system
- FLEMMV31 - Inflectional morphology parser for French
- Wortlisten: spoken German, English, French, and Dutch
- VISL Tagger and Parser
- Cambridge Learner Dictionary
- Polish subcorpus of the International Corpus of Learner English
- Robot Karaoke
- Restricted English Corpus from Dr. Caroline Lyon for PhD
- Russian Corpus Site
- English Resource Grammar
- Linguist\'s Search Engine
- Phono- Sound Change Model Software
- Multext-East Project
- Kirrkirr 4.0 Dictionary Program
- Phrases in English and the British National Corpus
- Russicon Resources
- CPAN Lingua EN Sentence Splitter
- CPAN Lingua HE Sentence Splitter
- CPAN Suffix Tree Module
- CSPAN Sentence Splitter
- WordNet Domains
- Log-likelihood calculator
- TREC Video Retrieval Evaluation Page
- Python Programming Tutorial
- IBM\'s Speech Recognition Modules
- CLaRK System
- Dan Bikel\'s Parser
- Virtual Language Centre\'s Web Concordancer
- Bilingual Dictionary French Arabic
- LinkGrammar-WN project
- EMILLE corpus
- Paper on Sentence Boundary Disambiguation
- Versioning Machine 2.0
- Russian Corpus Site
- Lacio Web Corpora
- Russian Corpus Page
- Wordsmyth Children\'s Dictionary
- AFGL Parser Generator
- Freelangonline - many on-line dictionaries + more
- Prosogram
- CREA
- Verbot preview 4.0
- Phrases in English
- Chilibot: NLP based miner for gene/protein/keyword relationships
- NEGRA Corpus
- Corpus del Espanol
- American English SpeechDat-Car
- Freelangonline - many on-line dictionaries + more
- TIGERSearch - tools for linguistic text exploration
- Lexical information for German
- ISI rewrite decoder
- Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular
- Qanda: Open source question answering system
- FLEMMV31 - Inflectional morphology parser for French
- Wortlisten: spoken German, English, French, and Dutch
- Short intensive course: Texts, Discourse and Corpora: Corpora in Linguistics and Related Fields
- DTREG decision tree generator
- NAACL-Supported Two-Week Summer School in Human Language Technologies
- Omphalos Context-Free Language Learning Competition
- Senseval-3 Task: Automatic Labeling of Semantic Roles
- Senseval-3 Task: Word-Sense Disambiguation of WordNet Glosses
- NLSH: Natural Language Shell
- NAACL-Supported Two-Week Summer School in Human Language Technologies
- CSLI LinGO Lab (Stanford)
- UniNE stopword list for Portuguese
- Natural Language Processing / Information Retrieval Software Repository
- Web Term Document Frequency Form (Berkeley)
- 3rd North American Summer School in Logic, Language and Information
- IR and IE on the web
- Corpus building for minority languages
- 3rd NASSLLI: North American Summer School in Logic, Language and Information
- FreeLing 1.1
- ICE corpora
- Natural Language Processing for Online Applications
- Longdo Thai Dictionary