Difference between revisions of "Corpora for English"

From ACL Wiki
Jump to navigation Jump to search
Line 52: Line 52:
 
*[http://about.reuters.com/researchandstandards/corpus/ Reuters Corpus]
 
*[http://about.reuters.com/researchandstandards/corpus/ Reuters Corpus]
 
*[http://www.cs.unt.edu/~rada/downloads.html Romanian NLP]
 
*[http://www.cs.unt.edu/~rada/downloads.html Romanian NLP]
*[http://www.sfb441.uni-tuebingen.de/b1/korpora.html Russian Corpora]
 
*[http://rykov-cl.narod.ru/r.html Russian Corpora]
 
*[http://www.ruscorpora.ru/ Russian Corpus Page]
 
*[http://lib.ru/ Russian Corpus Site]
 
*[http://www.ng.ru Russian Corpus Site]
 
*[http://www.philol.msu.ru/~lex/corpus/ Russian Newspaper Corpus]
 
*[http://schools.keldysh.ru/uvk1838/Sciper/volume2/langres/russiclr.htm Russicon Resources]
 
 
*[http://sanskritlibrary.org/ Sanskrit Library]
 
*[http://sanskritlibrary.org/ Sanskrit Library]
*[http://korpus.juls.savba.sk/index.en.html Slovak National Corpus]
 
 
*[http://nl.ijs.si/elan/#corpus Slovene-English Parallel Corpus]
 
*[http://nl.ijs.si/elan/#corpus Slovene-English Parallel Corpus]
 
*[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio]
 
*[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio]
Line 114: Line 106:
  
 
==German corpora==
 
==German corpora==
 +
 
*[http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]
 
*[http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]
 
*[http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]
 
*[http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]
Line 119: Line 112:
  
 
==Multilingual corpora==
 
==Multilingual corpora==
 +
 
*[http://wt.jrc.it/lt/Acquis/ ACQUIS COMMUNAUTAIRE Multilingual Corpus]
 
*[http://wt.jrc.it/lt/Acquis/ ACQUIS COMMUNAUTAIRE Multilingual Corpus]
 
*[http://spraakbanken.gu.se/ Bank of Swedish]
 
*[http://spraakbanken.gu.se/ Bank of Swedish]
Line 152: Line 146:
 
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
 
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
 
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
 
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
*[http://korpus.juls.savba.sk/ Slovak National Corpus (SNK)]
 
 
*[http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
 
*[http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
 
*[http://www.corpusdelespanol.org/ Spanish Corpus]
 
*[http://www.corpusdelespanol.org/ Spanish Corpus]
Line 160: Line 153:
 
*[http://wacky.sslmit.unibo.it/ WaCky Project]
 
*[http://wacky.sslmit.unibo.it/ WaCky Project]
 
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch]
 
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch]
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch]
+
 
 +
==Russian==
 +
 
 +
*[http://www.sfb441.uni-tuebingen.de/b1/korpora.html Russian Corpora]
 +
*[http://rykov-cl.narod.ru/r.html Russian Corpora]
 +
*[http://www.ruscorpora.ru/ Russian Corpus Page]
 +
*[http://lib.ru/ Russian Corpus Site]
 +
*[http://www.ng.ru Russian Corpus Site]
 +
*[http://www.philol.msu.ru/~lex/corpus/ Russian Newspaper Corpus]
 +
*[http://schools.keldysh.ru/uvk1838/Sciper/volume2/langres/russiclr.htm Russicon Resources]
 +
 
 +
==Slovak==
 +
 
 +
*[http://korpus.juls.savba.sk/index.en.html Slovak National Corpus]

Revision as of 14:14, 2 November 2006

English corpora

German corpora

Multilingual corpora

Russian

Slovak