Resources for English
From ACL Wiki
Revision as of 17:31, 26 October 2006 by
Ahakim
(
talk
|
contribs
)
(
→ONLINE
)
(
diff
)
← Older revision
|
Latest revision
(
diff
) |
Newer revision →
(
diff
)
Jump to navigation
Jump to search
Contents
1
BIBLIOGRAPHY
1.1
BIBLIOGRAPHY - SEARCHABLE
2
BOOKS
2.1
BOOKS - ONLINE
2.2
BOOKS - PUBLISHERS
3
COMPREHENSIVE
4
CORPORA
4.1
CORPORA - ENGLISH
4.2
CORPORA - GERMAN
4.3
CORPORA - MULTILINGUAL
5
COURSES
6
DICTIONARY
7
FTP
8
JOURNAL
9
LANGUAGE
9.1
LANGUAGE - LATIN
10
MAILING
11
ONLINE
12
PAPERS
13
SOFTWARE
13.1
SOFTWARE - APPLICATIONS
13.2
SOFTWARE - DATA
13.3
SOFTWARE - FONTS
13.4
SOFTWARE - FORMALISM
13.5
SOFTWARE - GENERATION
13.6
SOFTWARE - KNREP
13.7
SOFTWARE - MISC
13.8
SOFTWARE - MORPHOLOGY
13.9
SOFTWARE - MT
13.10
SOFTWARE - MULTI
13.11
SOFTWARE - MULTILINGUAL
13.12
SOFTWARE - PHONOLOGY
13.13
SOFTWARE - SEMANTICS
13.14
SOFTWARE - SPEECH
13.15
SOFTWARE - SYNTAX
13.16
SOFTWARE - TOOLS
14
TOOLS
15
UNCATEGORIZED
BIBLIOGRAPHY
Bibliography and Useful Links for Data-Driven Language Learning: The Uses of Concordancing in Advanced Language Learning and Teaching
Bibliography for Phonetics/Speech Technology
Bibliography to the book "Artificial Intelligence: A Modern Approach by Russell and Norvig
ELSNET: Paper and Electronic Publications
BIBLIOGRAPHY - SEARCHABLE
Bibliographic Search Page, Univ. of Essex
Bibliography of Computational Linguistics (from Germany)
Bibliography of Natural Language Generation (from Germany)
Bibliography of Research in Natural Language Generation
Bibliography of spech processing and recognition(from Germany)
Miscellaneous bibliographies in AI (including NLP)
Phraseology Bibliography
BOOKS
Netlab: Algorithms for Pattern Recognition
"Word Frequencies in Written and Spoken English: based on the British National Corpus."
Bilingual Speech: A Typology of Code-Mixing
books on computational semantics
CLIN IV Proceedings (Comp. Ling. in the Netherlands)
Instructor's Manual for Syntactic Theory: A Formal Introduction
James Allen - Natural Language Understanding (source code)
Kluwer Academic Publishers
Kluwer series in Text, Speech and Language Technology
Managing Gigabytes, by Witten, Moffat, and Bell
Natural Language Computing: An English Generative Grammar in Prolog
Natural Language Processing for Online Applications
Natural Language Processing for Online Applications
NLP: User Modeling 2001
POLYSEMY: Theoretical and Computational Approaches
Sequence learning: Paradigms, Algorithms and Applications
Survey of the State of the Art of Human Language Technology
Syntactic Theory: A Formal Introduction by Ivan Sag and Thomas Wasow
The Language of Word Meaning
The Omicron Inforium
Universal Grammar in Prolog
BOOKS - ONLINE
Evolutionary Web Development
Geometry and Meaning: Companion Website
ROBUSTNESS IN LANGUAGE AND SPEECH TECHNOLOGY
The Ninth Text REtrieval Conference (TREC 9) Conference Proceedings
Architectures and Mechanisms for Language Processing
Common Lisp - the language (by Guy L. Steele)
Envisioning Machine Translation in the Information Future 4th Conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca, Mexi
HANDBOOK OF AUSTRALIAN LANGUAGES
Information Extraction Towards Scalable, Adaptable Systems
IntraText - The missing link between text and hypertext (TM)
Lecture Notes in Computer Science Vol. 1835
Lecture Notes in Computer Science Vol. 1835
LEXICOGRAPHY AND THE OED: Pioneers in the Untrodden Forest
On-line books (not in NLP)
Project Gutenberg
Text, Speech and Dialogue Third International Workshop, TSD 2000 Brno, Czech Republic, September 13-16, 2000 Proceedings
The ALPAC Report
The Lexical Semantics of a Machine Translation Interlingua
BOOKS - PUBLISHERS
Addison Wesley Longman higher education
Miscellaneous Publishers available on the World-Wide Web
Cambridge University Press
Cascadilla Press
Elsevier Science
Miscellaneous Publishers available on the World-Wide Web
MIT Press
St. Jerome Publishing
Studies in Language and Linguistics
COMPREHENSIVE
CAQDAS Comparison
-
Computational Morphology and Phonology (at Summer Institute for Linguistics)
Lexicon Acquisition, Development, and Analysis
Linguistic Data Consortium, University of Pennsylvania
Linguistics Resources (from SIL)
Linguistics resources list at Princeton University
Links to Linguistic and Related Information (University of Passau)
List of resources (at University of Stuttgart)
List of resources (at University of Toronto)
Meta index of linguistics resources (from Rick Wojcik)
Natural Language Corpora and Dictionaries (at CMU)
Natural Language Interfaces (NLIs) on the Web
Natural Language Interfaces (NLIs) on the Web
NLP page at MIT
Statistical natural language processing and corpus-based computational linguistics: An annotated list of resources
The human languages page
UCI Machine Learning Repository
WWW Information on Computational Linguistics and Language Technology
Yahoo! index on Human Languages and Linguistics
Yahoo! index on Natural Language Processing
CORPORA
1963 Time Magazine corpus
2000 NIST Speaker Recognition Evaluation Corpus
A Syntactically Annotated Corpus of German Newspaper Texts
A Web Corpus and Topic Signatures for All WordNet 1.6 Nominal Senses (v 1.0)
Alpino Treebank
An Empirical Grammar of the English Verb System
Annotated list of resources on statistical NLP and corpus-based CL
AOT
Arabic Newswire Part 1
Base Textuelle de Moyen Francais
BNC Online Service
Bokr Russian Reference Corpus
BRITISH NATIONAL CORPUS - WORLD EDITION
Collections of texts and corpora
Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular
Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular
CORPUS DEL ESPANOL
Corpus del Espanol
Corpus del Espanol
Corpus of spoken Bulgarian
Corpus Resources (Chulalongkorn University, Thailand)
Cranfield collection
CREA
CREA
Czech National Corpus
Danish news corpus
Edinburgh Associative Thesaurus (EAT)
EuroWordNet
Experimental Corpus Query System (University of Stuttgart, Germany)
Finnish text bank
GENIA corpus version 3.0p
HAITIAN CREOLE ELECTRONIC TEXTS
Hansards Corpus - Searchable
HCRC Map Task Corpus XML annotations
Helsinki Corpus of Swahili (HCS)
ICOPOST
IMS Corpus Toolbox, Univ. of Stuttgart
IMS Corpus Workbench (CWB)
International Corpus of Learner English
IPI PAN Polish Corpus
Kiel University's Institute on Phonetics and Speech Procesing
Lacio Web Corpora
LANGUAGE LEARNING CENTER - ACADEMIC CORPUS
Le corpus BAF (French and English)
list of Japanese transitive - intransitive verb pairs
List of stop words
Manuel Barbera: General Corpora and Corpus Linguistics Resources
Medlars collection
Miscellaneous Word Lists from Oxford University
Multilingual Text Tools and Corpora
Name lists from US census
Nexing Corpus
On-line books at CMU
OPUS -- An Open Source Parallel Corpus
Oxford Text Archive Corpus of Italian Newspapers
Parallel Texts of Hong Kong Laws
Polish subcorpus of the International Corpus of Learner English
Ramon Piero Center for Research
Reuters Corpus
Romanian NLP
Russian Corpora
Russian Corpora
Russian Corpus Page
Russian Corpus Site
Russian Corpus Site
Russian Newspaper Corpus
Russian Newspaper Corpus
Russicon Resources
Sanskrit Library
Slovene-English Parallel Corpus
Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio
Speech in Noisy Environments 2 (SPINE2 CODED) Coded Audio
Survey of Electronic Corpora (by Jane A. Edwards, file at CMU)
Survey of English Usage, University College, London
Switchboard Transcription Project
TELRI Research Archive of Computational Tools and Resources
Terminology for more than 15 languages
The Childes Corpus - Children's language
The CORPORA DataCenter (Norway)
The Moby Corpus
The Oslo Corpus of Bosnian Texts
The Sketch Engine
The Sofie Treebank - A Parallel Treebank of North European Languages
Treebank tokenization scheme
CORPORA - ENGLISH
American English SpeechDat-Car
American English SpeechDat-Car
AMERICAN NATIONAL CORPUS FIRST RELEASE
BNCweb a web-based interface to the British National Corpus
Bookmarks for Corpus-based Linguists
British National Corpus (from Oxford University)
British National Corpus project page (from UCREL)
Corpus of Spoken Professional English
Dialogue Diversity Corpus
Electronic Text Center -- University of Virginia
English Intonation in the British Isles -The IViE Corpus
English stop words (from SMART)
English Verb Classes And Alternations: A Preliminary Investigation (Index)
Exploring Words and Phrases from the British National Corpus
GENIA Project Home Page
ICAME
List of English stopwords
Mapping WordNet Versions 1.6 and 2.0
Movie Review Data
Multiword Expression Resources
Phrases in English
Phrases in English
Restricted English Corpus from Dr. Caroline Lyon for PhD
Sketch Engine
Susanne: Annotated American English Corpus
The BNC Index (for the BNCWorld Edition)
The Brooklyn-Geneva-Amsterdam-Helsinki Parsed Corpus of Old English
The Brooklyn-Geneva-Amsterdam-Helsinki Parsed Corpus of Old English
The Dialogue Diversity Corpus
The LUCY Corpus - Documentation
TRAINS Dialogue Corpus
CORPORA - GERMAN
Bavarian Archive for Speech Signals Corpora
COSMAS II
NEGRA Corpus
NEGRA Corpus
Saarland University, Computational Linguistics
The Negra Corpus - German Syntax annotated
CORPORA - MULTILINGUAL
ACQUIS COMMUNAUTAIRE Multilingual Corpus
CELEX - The Dutch Center for Lexical Information
Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS
COMPARA corpus
Debian free software community
EMILLE corpus
European Parliament Proceedings Parallel Corpus 1996-2003
EuroWordNet
French Foreign Ministry's magazine
GlossaNet
Haitian Creole corpus -Teknoloji pou lang kreyol
Hansard French-English parallel corpus
ICE corpora
Learner Behaviour on the Internet
MuchMore Springer Bilingual Corpus
MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages
Multilingual Corpora: Available Resources
MultiSemCor
Newspapers on the Internet
OPUS - an open source parallel corpus
PolyU Language Bank
Public registry of the Council of the EU
The Bible as a Resource for Translation Software
The ECI Multilingual corpus
UN declaration of human rights in multiple languages
UNITEX
Useful links about parallel corpora, by Olivier Kraif
WaCky Project
Wortlisten: spoken German, English, French, and Dutch
Wortlisten: spoken German, English, French, and Dutch
COURSES
11-761 Language and Statistics, course at CMU, Spring 1997
3rd NASSLLI: North American Summer School in Logic, Language and Information
3rd North American Summer School in Logic, Language and Information
ALTSS 2004: Australasian Language Technology Summer School
Computational Linguistics, James Pustejovsky, Brandeis University
Computer Speech, Text and Internet Technology
Course in Corpus Linguistics, Tony McEnery & Andrew Wilson
CS674: Natural Language Processing (Cornell U., Spring 2000)
Databases and Corpora for Chinese Linguistic Research
ESSLLI 2005 - 17th European Summer School in Logic, Language and Information
Foundations of Computational Linguistics by Roland Hausser
Human Language Technology (Cornell CS630, Spring 2006)
Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes
Internet Grammar of English
Multi-Paradigm Programming in Oz for NLP
NAACL-Supported Two-Week Summer School in Human Language Technologies
NAACL-Supported Two-Week Summer School in Human Language Technologies
Perl: extending your pos-tagger using regular expressions, Dan Jurafsky
Probability Theory: The Logic Of Science (by E. T. Jaynes, Washington University, Saint Louis)
REAL WORLD LINGUISTICS 101
Russell and Norvig - AI
Short intensive course: Texts, Discourse and Corpora: Corpora in Linguistics and Related Fields
Speech and Language Processing, by Daniel Jurafsky and James Martin
Statistical Natural Language Processing: Models and Methods
Texas A&M University Linguistics Course Listings
DICTIONARY
Alternative dictionaries
Bilingual Dictionary French Arabic
Cambridge Learner Dictionary
Canoo.net - German Dictionaries and Grammars
Canoo.net: Free German Language Resources
CMU pronunciation dictionary
Comlex Syntax (Syntactic Dictionary of English)
CUVPlus -- Oxford Text Archive
Deutsch-Spanisch Wˆrterbuch (German-Spanish Dictionary)
Dictionary site, Bucknell University
ECHO - Eurodicautom (multilingual technical dictionary)
English to Estonian
FreeDict Downloads
Kirrkirr 4.0 Dictionary Program
LEO: Deutsche-Englisches Wˆrterbuch (German-English Dictionary)
Lexical information for German
Lexical information for German
Linguistics Glossary (from SIL)
List of on-line dictionaries (from lai.com)
Longdo Thai Dictionary
On-line dictionaries (from Univ. of Hannover)
Swedish to English
Swedish to Finnish
The NLP Dictionary
Wordsmyth Children's Dictionary
YourDictionary
FTP
Xerox PARC FTP site.
JOURNAL
Embedded MT Systems: Leveraging for Real World Applications
Natural Language Engineering
ACM Transactions on Information Systems
Cognition, a journal from Elsevier Science
Computational Linguistics
Computational Linguistics (ACL journal)
ETAI - Electronic Transactions on Artificial Intelligence
ICAME Journal
IJECE
Information Processing and Management
Information Retrieval (Journal)
Journal of Artificial Intelligence Research (JAIR)
Journal of Intelligent Information Systems
Journal of Logic and Computation
Journal of Natural Language Engineering
Journal of the American Society for Information Science
Knowledge Representation for Natural Language Processing in Implemented Systems
Language, Journal of the Linguistic Society of America
Machine Learning Journal Special Issue on Natural Language Learning
Machine Translation
ONLINE LINGUISTICS JOURNAL
WEB-SLS: The European Student Journal of Language and Speech
LANGUAGE
Basic Arabic Processing Tools
Dunglish
French Stopword List
Hebrew Spellchecker
Le TrÈsor de la Langue Langue d'Oc
Lexique Morphalou
Malay Concordance Project
Names Files of Selected Countries
REAP Project: Reader-Specific Lexical Practice for Improved Reading Comprehension
Syllable-Level Conversational English Transcriptions
The Bank of Swedish - A Linguistic Reference Database of Göteborg University
The Mariano Silva y Aceves Series
UniNE stopword list for Portuguese
United States Geographic Names
Valencianlanguage.com
LANGUAGE - LATIN
Latin Home Page
MAILING
HPSG Mailing List
IR list
MT List
Natural Semantic Metalanguage List
SIG-IRList Archives
The CORPORA list
ONLINE
A Survey of Open Language Archives
ACL SIGGEN Resources Wiki
AFGL Parser Generator
Agglutination on the Basis of Corpus Information
Algorithms for Linguistic Processing
Artificial Intelligence NV (Ai)
Author/Institution Self-Archiving
Chinese Computing
COPERNIC 2000
CorpÛgrafo
Dan Bikel's Parser
Detecting Text Boundaries
Educational Research Abstracts
Emotional Databases
English Resource Grammar
English-Chinese Chinese-English Dictionary of Computer Terms
Freelangonline - many on-line dictionaries + more
Freelangonline - many on-line dictionaries + more
Freelangonline - many on-line dictionaries + more
Information Retrieval
IR resources
Korean Accented English Pronunciation Simulator
KwicFinder Web Concordancer and Online Research Tool
LANGUAGE LINKS
LANGUAGE LINKS
Lexical FreeNet
LFG Database: List of Names
Linguist's Search Engine
Linguistic Interpretation of a German Corpus
LINK GRAMMAR PARSER
LinkGrammar-WN project
MRC Psycholinguistic Database
Multext East Resources, Version 3
Multext-East Project
MultiWordNet
Natural Language Processing / Information Retrieval Software Repository
NLSH: Natural Language Shell
Omphalos Context-Free Language Learning Competition
Online Business Letter Corpus KWIC Concordancer
Parse Evaluation
Phrases in English and the British National Corpus
PyGoogle: A Python Interface to the Google API
Python Programming Tutorial
Query to Internet Corpora
Regular Expression Exercises
Resources for English-Chinese CLIR
Resources for Text, Speech and Language Processing
Rhetorical Structure Theory (RST)
Russian Phonetics on the Web
Senseval-3 Task: Automatic Labeling of Semantic Roles
Senseval-3 Task: Word-Sense Disambiguation of WordNet Glosses
Sentence Alignment and Word Alignment: Projects, Papers, Evaluation, Etc.
SIG5 OntoWeb
Simple Search of BNC-World
Special Interest Group on Computational Semantics
SUSANNE Analytic Scheme
Telemakus: Mining and Mapping Research Findings to Promote Knowledge Discovery
The Cross-Language Evaluation Forum
The Internet Timelines Project
The John Bateman and Michael Zock's list of Natural Language Generation Systems
The Rosetta PrOject
The XTAG Project
TransSearch
TREC Video Retrieval Evaluation Page
TreeTagger - a language independent part-of-speech tagger
Using BioMed Central's Open Access Full Text Corpus for Data Mining Research
VIEW: Variation in English Words and Phrases
VISL Tagger and Parser
Web IR & IE
Web Term Document Frequency Form (Berkeley)
WEB-CONC
WEBCORP
WebExp2 Experimental Software
Word Frequencies in Written and Spoken English (Based on the British National Corpus)
WordNet Domains
Workshop on Web-Based Language Documentation and Description Papers
World Language Mapping System
XAIRA (XML Aware Indexing and Retrieval Architecture)
PAPERS
SOFTWARE
SOFTWARE - APPLICATIONS
SOFTWARE - DATA
SOFTWARE - FONTS
SOFTWARE - FORMALISM
SOFTWARE - GENERATION
SOFTWARE - KNREP
SOFTWARE - MISC
SOFTWARE - MORPHOLOGY
SOFTWARE - MT
SOFTWARE - MULTI
SOFTWARE - MULTILINGUAL
SOFTWARE - PHONOLOGY
SOFTWARE - SEMANTICS
SOFTWARE - SPEECH
SOFTWARE - SYNTAX
SOFTWARE - TOOLS
TOOLS
UNCATEGORIZED
[1]
ARIES Natural Language Tools
Language and Linguistic Science information sources
The RELATOR language resources server
Navigation menu
Personal tools
Log in
Request account
Namespaces
Page
Discussion
Variants
Views
Read
View source
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information