Corpora, datasets, lexicons

From ACL Wiki
Revision as of 06:45, 2 November 2006 by Pdturney (talk | contribs) (→‎Corpora)
Jump to navigation Jump to search

Miscellaneous

Corpora

English

(alphabetical order)

Multilingual

(alphabetical order)

Other lists of corpora

(alphabetical order)

Datasets

Lexicons

  • WordNet - the original
    • eXtended WordNet - glosses are syntactically parsed, transformed into logic forms, and content words are semantically disambiguated
    • WordNet Domains - augmented with Domain Labels, such as POLITICS, ECONOMY, SPORT
    • SentiWordNet - assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity