Difference between revisions of "Corpora for English"

From ACL Wiki
Jump to navigation Jump to search
(+MultiUN corpora)
(HamleDT)
Line 27: Line 27:
 
*[http://gmb.let.rug.nl Groningen Meaning Bank] semantically annotated corpus
 
*[http://gmb.let.rug.nl Groningen Meaning Bank] semantically annotated corpus
 
*[http://www.gutenberg.org/wiki/Main_Page Gutenberg]
 
*[http://www.gutenberg.org/wiki/Main_Page Gutenberg]
 +
*[http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 
*[http://prize.hutter1.net/ Hutter Prize for Lossless Compression of Human Knowledge 100M sample of Wikipedia]
 
*[http://prize.hutter1.net/ Hutter Prize for Lossless Compression of Human Knowledge 100M sample of Wikipedia]
 
*[http://nora.hd.uib.no/icame.html ICAME]
 
*[http://nora.hd.uib.no/icame.html ICAME]

Revision as of 08:41, 26 May 2014

For languages other than English, see List of resources by language.


Link collections

Corpora tools