Difference between revisions of "Corpora, datasets, lexicons"
Jump to navigation
Jump to search
Line 5: | Line 5: | ||
* [http://clwww.essex.ac.uk/w3c/corpus_ling/content/corpora/list/private/brown/brown.html Brown Corpus] | * [http://clwww.essex.ac.uk/w3c/corpus_ling/content/corpora/list/private/brown/brown.html Brown Corpus] | ||
* [http://www.collins.co.uk/books.aspx?group=154 Collins Wordbanks] | * [http://www.collins.co.uk/books.aspx?group=154 Collins Wordbanks] | ||
+ | * [http://devoted.to/corpora David Lee's Bookmarks for Corpus-based Linguists] | ||
* [http://www.askoxford.com/oec/mainpage/?view=uk Oxford English Corpus] | * [http://www.askoxford.com/oec/mainpage/?view=uk Oxford English Corpus] | ||
* [http://www.webcorp.org.uk/guide/ WebCorp] | * [http://www.webcorp.org.uk/guide/ WebCorp] |
Revision as of 13:25, 19 October 2006
Corpora
- American National Corpus (ANC)
- British National Corpus (BNC)
- Brown Corpus
- Collins Wordbanks
- David Lee's Bookmarks for Corpus-based Linguists
- Oxford English Corpus
- WebCorp
Datasets
- Linguistic Data Consortium (LDC)
- MRC Psycholinguistic Database
- Reuters-21578 Text Categorization Collection
- WordSimilarity-353 Test Collection