Difference between revisions of "Corpora for English"

From ACL Wiki
Jump to navigation Jump to search
Line 48: Line 48:
 
*[http://www.webcorp.org.uk/guide/ WebCorp]
 
*[http://www.webcorp.org.uk/guide/ WebCorp]
  
==Galician==
 
<!-- Please keep this list in alphabetical order -->
 
*[http://sli.uvigo.es/CLUVI/ Linguistic Corpus of the University of Vigo (CLUVI)]
 
*[http://sli.uvigo.es/CTG/ Technical Corpus of Galician (CTG)]
 
*[http://www.ti.usc.es/TILG/ Tesouro informatizado da lingua galega (TILG)]
 
 
==German==
 
<!-- Please keep this list in alphabetical order -->
 
 
*[http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]
 
*[http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]
 
*[http://www.coli.uni-sb.de/sfb378/negra-corpus/negra-corpus.html NEGRA Corpus]
 
 
==Iranian==
 
<!-- Please keep this list in alphabetical order -->
 
 
*[http://ece.ut.ac.ir/DBRG/Bijankhan/ Bijankhan corpus]
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC96S50 CALLFRIEND Farsi (speech)]
 
*[http://ece.ut.ac.ir/dbrg/hamshahri/ Hamshahri corpus]
 
*[http://www.elda.org/catalogue/en/speech/S0112.html Persian speech database Farsdat]
 
 
==Russian==
 
<!-- Please keep this list in alphabetical order -->
 
 
*[http://bokrcorpora.narod.ru Bokr Russian Reference Corpus]
 
*[http://www.slav.helsinki.fi/hanco/index_en.html HANCO: The Helsinki annotated corpus of Russian texts]
 
*[http://www.sfb441.uni-tuebingen.de/b1/korpora.html Russian Corpora]
 
*[http://rykov-cl.narod.ru/r.html Russian Corpora]
 
*[http://lib.ru/ Russian Corpus Site]
 
*[http://www.ruscorpora.ru/ The Russian National Corpus]
 
*[http://www.philol.msu.ru/~lex/corpus/ Russian Newspaper Corpus]
 
*[http://schools.keldysh.ru/uvk1838/Sciper/volume2/langres/russiclr.htm Russicon Resources]
 
  
 
==Slovak==
 
==Slovak==

Revision as of 20:19, 24 April 2008

For languages other than English, see List of resources by language.

English


Slovak

Italian

Link collections

Corpora tools

Uncategorized

Arabic

Bosnian

Bulgarian

Croatian

Czech

Danish

English

Finnish

French

German

Haitian Creole

Italian

Japanese

Polish

Romanian

Sanskrit

Slovenian

Spanish

Swahili