Difference between revisions of "Knowledge collections and datasets (English)"
Jump to navigation
Jump to search
m |
|||
Line 1: | Line 1: | ||
Datasets for Computational Linguistics and Natural Language Processing. | Datasets for Computational Linguistics and Natural Language Processing. | ||
− | * [[Clustering by Committee]] - terms clustered and organized using the Distributional Hypothesis | + | * [[Clustering by Committee]] - terms clustered and organized using the [[Distributional Hypothesis]] |
* [[DIRT Paraphrase Collection]] | * [[DIRT Paraphrase Collection]] | ||
* [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)] | * [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)] |
Revision as of 17:49, 16 November 2006
Datasets for Computational Linguistics and Natural Language Processing.
- Clustering by Committee - terms clustered and organized using the Distributional Hypothesis
- DIRT Paraphrase Collection
- Edinburgh Associative Thesaurus (EAT)
- FrameNet
- MRC Psycholinguistic Database
- Noun Compound Repository
- Reuters-21578 Text Categorization Collection
- University of South Florida Free Association Norms
- VerbOcean - verbs organized by semantic relation, including temporal precedence, strength, etc.
- WordNet
- WordSimilarity-353 Test Collection