Difference between revisions of "Corpora (English)"

From ACL Wiki
Jump to: navigation, search
 
(English corpora)
Line 78: Line 78:
 
==English corpora==
 
==English corpora==
 
*[http://www.elda.fr/catalogue/en/speech/S0115.html American English SpeechDat-Car]
 
*[http://www.elda.fr/catalogue/en/speech/S0115.html American English SpeechDat-Car]
 +
*[http://americannationalcorpus.org/ American National Corpus (ANC)]
 
*[http://americannationalcorpus.org/FirstRelease/ AMERICAN NATIONAL CORPUS FIRST RELEASE]
 
*[http://americannationalcorpus.org/FirstRelease/ AMERICAN NATIONAL CORPUS FIRST RELEASE]
 +
*[http://compbio.uchsc.edu/ccp/corpora/index.shtml Biomedical corpora]
 
*[http://homepage.mac.com/bncweb/ BNCweb a web-based interface to the British National Corpus]
 
*[http://homepage.mac.com/bncweb/ BNCweb a web-based interface to the British National Corpus]
 
*[http://devoted.to/corpora Bookmarks for Corpus-based Linguists]
 
*[http://devoted.to/corpora Bookmarks for Corpus-based Linguists]
 
*[http://info.ox.ac.uk/bnc/ British National Corpus (from Oxford University)]
 
*[http://info.ox.ac.uk/bnc/ British National Corpus (from Oxford University)]
 +
*[http://www.natcorp.ox.ac.uk/ British National Corpus (BNC)]
 
*[http://www.comp.lancs.ac.uk/computing/research/ucrel/bnc.html British National Corpus project page (from UCREL)]
 
*[http://www.comp.lancs.ac.uk/computing/research/ucrel/bnc.html British National Corpus project page (from UCREL)]
 +
*[http://clwww.essex.ac.uk/w3c/corpus_ling/content/corpora/list/private/brown/brown.html Brown Corpus]
 +
*[http://www.collins.co.uk/books.aspx?group=154 Collins Wordbanks]
 
*[http://www.athel.com/corpdes.html Corpus of Spoken Professional English]
 
*[http://www.athel.com/corpdes.html Corpus of Spoken Professional English]
 
*[http://www-rcf.usc.edu/~billmann/diversity/DDivers-site.htm Dialogue Diversity Corpus]
 
*[http://www-rcf.usc.edu/~billmann/diversity/DDivers-site.htm Dialogue Diversity Corpus]
Line 90: Line 95:
 
*[http://www-personal.umich.edu/~jlawler/levin.html English Verb Classes And Alternations: A Preliminary Investigation (Index)]
 
*[http://www-personal.umich.edu/~jlawler/levin.html English Verb Classes And Alternations: A Preliminary Investigation (Index)]
 
*[http://usna.edu/LangStudy/BNC/ Exploring Words and Phrases from the British National Corpus]
 
*[http://usna.edu/LangStudy/BNC/ Exploring Words and Phrases from the British National Corpus]
 +
*[http://www.gutenberg.org/wiki/Main_Page Gutenberg]
 
*[http://nora.hd.uib.no/icame.html ICAME]
 
*[http://nora.hd.uib.no/icame.html ICAME]
 
*[http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/naive-bayes/bow-0.8/stopwords.c List of English stopwords]
 
*[http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/naive-bayes/bow-0.8/stopwords.c List of English stopwords]
Line 95: Line 101:
 
*[http://www.cs.cornell.edu/People/pabo/movie-review-data/ Movie Review Data]
 
*[http://www.cs.cornell.edu/People/pabo/movie-review-data/ Movie Review Data]
 
*[http://mwe.stanford.edu/resources/ Multiword Expression Resources]
 
*[http://mwe.stanford.edu/resources/ Multiword Expression Resources]
 +
*[http://www.askoxford.com/oec/mainpage/?view=uk Oxford English Corpus]
 
*[http://pie.usna.edu/ Phrases in English]
 
*[http://pie.usna.edu/ Phrases in English]
 
*[http://homepages.feis.herts.ac.uk/~comrcml/Lyon-thesis.ps Restricted English Corpus from Dr. Caroline Lyon for PhD]
 
*[http://homepages.feis.herts.ac.uk/~comrcml/Lyon-thesis.ps Restricted English Corpus from Dr. Caroline Lyon for PhD]
Line 104: Line 111:
 
*[http://www.grsampson.net/LucyDoc.html The LUCY Corpus - Documentation]
 
*[http://www.grsampson.net/LucyDoc.html The LUCY Corpus - Documentation]
 
*[http://www.cs.rochester.edu/research/cisd/resources/trains.html TRAINS Dialogue Corpus]
 
*[http://www.cs.rochester.edu/research/cisd/resources/trains.html TRAINS Dialogue Corpus]
 +
*[http://www.webcorp.org.uk/guide/ WebCorp]
  
 
==German corpora==
 
==German corpora==

Revision as of 14:06, 2 November 2006

English corpora

German corpora

Multilingual corpora