Corpora, datasets, lexicons

From ACL Wiki

Revision as of 06:45, 2 November 2006 by Pdturney (talk | contribs) (→‎Corpora)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to navigation Jump to search

Miscellaneous

Resources

Corpora

English

(alphabetical order)

Multilingual

(alphabetical order)

Other lists of corpora

(alphabetical order)

David Lee's Bookmarks for Corpus-based Linguists

Datasets

Lexicons

WordNet - the original
- eXtended WordNet - glosses are syntactically parsed, transformed into logic forms, and content words are semantically disambiguated
- WordNet Domains - augmented with Domain Labels, such as POLITICS, ECONOMY, SPORT
- SentiWordNet - assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity

Retrieved from "https://aclweb.org/aclwiki/index.php?title=Corpora,_datasets,_lexicons&oldid=2238"