Knowledge collections and datasets (English)

From ACL Wiki

Revision as of 13:46, 3 January 2007 by Jfan (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to navigation Jump to search

The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Datasets for Computational Linguistics and Natural Language Processing.

Clustering by Committee - terms clustered and organized using the Distributional Hypothesis
DIRT Paraphrase Collection - Discovery of Inference Rules from Text
Edinburgh Associative Thesaurus (EAT)
FrameNet
MRC Psycholinguistic Database
Noun Compound Repository
Reuters-21578 Text Categorization Collection
Spam filtering datasets
TEASE - Acquisition of Entailment Relations from the Web
University of South Florida Free Association Norms
VerbOcean - verbs organized by semantic relation, including temporal precedence and strength
WordNet
WordSimilarity-353 Test Collection

Additional Dataset Collections

Linguistic Data Consortium (LDC)

Retrieved from "https://aclweb.org/aclwiki/index.php?title=Knowledge_collections_and_datasets_(English)&oldid=3294"

Knowledge Collections and Datasets