Difference between revisions of "Knowledge collections and datasets (English)"

Revision as of 10:42, 19 November 2006

Datasets for Computational Linguistics and Natural Language Processing.

@@ Line 2: / Line 2: @@
 * [[Clustering by Committee]] - terms clustered and organized using the [[Distributional Hypothesis]]
-* [[DIRT Paraphrase Collection]]
+* [[DIRT Paraphrase Collection]] - Discovery of Inference Rules from Text
 * [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)]
 * [http://framenet.icsi.berkeley.edu/ FrameNet]
@@ Line 10: / Line 10: @@
 * [[Spam filtering datasets]]
 * [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms]
-* [[VerbOcean|VerbOcean - verbs organized by semantic relation, including temporal precedence, strength, etc.]]
+* [[VerbOcean]] - verbs organized by semantic relation, including temporal precedence and strength
 * [http://wordnet.princeton.edu/ WordNet]
 * [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection]