Difference between revisions of "Corpora, datasets, lexicons"
Jump to navigation
Jump to search
Kevin.Cohen (talk | contribs) (→Corpora: Added a link for the biomedical corpora site at UCHSC) |
Kevin.Cohen (talk | contribs) m (→Corpora: Argh--fixed a stupid formatting error.) |
||
Line 6: | Line 6: | ||
* [http://americannationalcorpus.org/ American National Corpus (ANC)] | * [http://americannationalcorpus.org/ American National Corpus (ANC)] | ||
− | * [http://compbio.uchsc.edu/ccp/index.shtml | + | * [http://compbio.uchsc.edu/ccp/corpora/index.shtml Biomedical corpora] |
* [http://www.natcorp.ox.ac.uk/ British National Corpus (BNC)] | * [http://www.natcorp.ox.ac.uk/ British National Corpus (BNC)] | ||
* [http://clwww.essex.ac.uk/w3c/corpus_ling/content/corpora/list/private/brown/brown.html Brown Corpus] | * [http://clwww.essex.ac.uk/w3c/corpus_ling/content/corpora/list/private/brown/brown.html Brown Corpus] |
Revision as of 17:18, 30 October 2006
Miscellaneous
Corpora
- American National Corpus (ANC)
- Biomedical corpora
- British National Corpus (BNC)
- Brown Corpus
- Collins Wordbanks
- David Lee's Bookmarks for Corpus-based Linguists
- Gutenberg
- Oxford English Corpus
- WebCorp
Datasets
- Edinburgh Associative Thesaurus (EAT)
- Linguistic Data Consortium (LDC)
- MRC Psycholinguistic Database
- Noun Compound Repository
- Reuters-21578 Text Categorization Collection
- University of South Florida Free Association Norms
- WordSimilarity-353 Test Collection