Difference between revisions of "Knowledge collections and datasets (English)"
Jump to navigation
Jump to search
(Added spam filtering datasets.) |
m |
||
Line 2: | Line 2: | ||
* [[Clustering by Committee]] - terms clustered and organized using the [[Distributional Hypothesis]] | * [[Clustering by Committee]] - terms clustered and organized using the [[Distributional Hypothesis]] | ||
− | * [[DIRT Paraphrase Collection]] | + | * [[DIRT Paraphrase Collection]] - Discovery of Inference Rules from Text |
* [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)] | * [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)] | ||
* [http://framenet.icsi.berkeley.edu/ FrameNet] | * [http://framenet.icsi.berkeley.edu/ FrameNet] | ||
Line 10: | Line 10: | ||
* [[Spam filtering datasets]] | * [[Spam filtering datasets]] | ||
* [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms] | * [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms] | ||
− | * [[VerbOcean | + | * [[VerbOcean]] - verbs organized by semantic relation, including temporal precedence and strength |
* [http://wordnet.princeton.edu/ WordNet] | * [http://wordnet.princeton.edu/ WordNet] | ||
* [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection] | * [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection] |
Revision as of 09:42, 19 November 2006
Datasets for Computational Linguistics and Natural Language Processing.
- Clustering by Committee - terms clustered and organized using the Distributional Hypothesis
- DIRT Paraphrase Collection - Discovery of Inference Rules from Text
- Edinburgh Associative Thesaurus (EAT)
- FrameNet
- MRC Psycholinguistic Database
- Noun Compound Repository
- Reuters-21578 Text Categorization Collection
- Spam filtering datasets
- University of South Florida Free Association Norms
- VerbOcean - verbs organized by semantic relation, including temporal precedence and strength
- WordNet
- WordSimilarity-353 Test Collection