Difference between revisions of "Corpora, datasets, lexicons"
Jump to navigation
Jump to search
Line 21: | Line 21: | ||
* [http://clipdemos.umiacs.umd.edu/catvar/ Catvar 2.0 -- The Categorial Variation Database] | * [http://clipdemos.umiacs.umd.edu/catvar/ Catvar 2.0 -- The Categorial Variation Database] | ||
+ | * [http://xwn.hlt.utdallas.edu/ eXtended WordNet] | ||
* [http://wordnet.princeton.edu/ WordNet] | * [http://wordnet.princeton.edu/ WordNet] |
Revision as of 13:29, 19 October 2006
Corpora
- American National Corpus (ANC)
- British National Corpus (BNC)
- Brown Corpus
- Collins Wordbanks
- David Lee's Bookmarks for Corpus-based Linguists
- Gutenberg
- Oxford English Corpus
- WebCorp
Datasets
- Linguistic Data Consortium (LDC)
- MRC Psycholinguistic Database
- Reuters-21578 Text Categorization Collection
- University of South Florida Free Association Norms
- WordSimilarity-353 Test Collection