Difference between revisions of "Knowledge collections and datasets (English)"

From ACL Wiki
Jump to navigation Jump to search
(Added spam filtering datasets.)
m
Line 2: Line 2:
  
 
* [[Clustering by Committee]] - terms clustered and organized using the [[Distributional Hypothesis]]
 
* [[Clustering by Committee]] - terms clustered and organized using the [[Distributional Hypothesis]]
* [[DIRT Paraphrase Collection]]
+
* [[DIRT Paraphrase Collection]] - Discovery of Inference Rules from Text
 
* [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)]
 
* [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)]
 
* [http://framenet.icsi.berkeley.edu/ FrameNet]
 
* [http://framenet.icsi.berkeley.edu/ FrameNet]
Line 10: Line 10:
 
* [[Spam filtering datasets]]
 
* [[Spam filtering datasets]]
 
* [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms]
 
* [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms]
* [[VerbOcean|VerbOcean - verbs organized by semantic relation, including temporal precedence, strength, etc.]]
+
* [[VerbOcean]] - verbs organized by semantic relation, including temporal precedence and strength
 
* [http://wordnet.princeton.edu/ WordNet]
 
* [http://wordnet.princeton.edu/ WordNet]
 
* [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection]
 
* [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection]

Revision as of 09:42, 19 November 2006

Datasets for Computational Linguistics and Natural Language Processing.

Additional Dataset Collections