Difference between revisions of "Corpora (English)"

From ACL Wiki
Jump to: navigation, search
(English corpora)
(Multilingual corpora)
Line 120: Line 120:
 
==Multilingual corpora==
 
==Multilingual corpora==
 
*[http://wt.jrc.it/lt/Acquis/ ACQUIS COMMUNAUTAIRE Multilingual Corpus]
 
*[http://wt.jrc.it/lt/Acquis/ ACQUIS COMMUNAUTAIRE Multilingual Corpus]
 +
*[http://spraakbanken.gu.se/ Bank of Swedish]
 +
*[http://hnk.ffzg.hr/ Croatian National Corpus (HNK)]
 +
*[http://ucnk.ff.cuni.cz/ Czech National Corpus (CNC)]
 
*[http://www.kun.nl/celex CELEX - The Dutch Center for Lexical Information]
 
*[http://www.kun.nl/celex CELEX - The Dutch Center for Lexical Information]
 
*[http://www.cdc.gov/ncidod/sars/languages.htm Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS]
 
*[http://www.cdc.gov/ncidod/sars/languages.htm Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS]
Line 130: Line 133:
 
*[http://glossa.fltr.ucl.ac.be/ GlossaNet]
 
*[http://glossa.fltr.ucl.ac.be/ GlossaNet]
 
*[http://hometown.aol.com/mit2haiti/JA-HC-kr.htm Haitian Creole corpus -Teknoloji pou lang kreyol]
 
*[http://hometown.aol.com/mit2haiti/JA-HC-kr.htm Haitian Creole corpus -Teknoloji pou lang kreyol]
 +
*[http://corpus.nytud.hu/mnsz/ Hungarian National Corpus]
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC95T20 Hansard French-English parallel corpus]
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC95T20 Hansard French-English parallel corpus]
 
*[http://www.ucl.ac.uk/english-usage/ice/avail.htm ICE corpora]
 
*[http://www.ucl.ac.uk/english-usage/ice/avail.htm ICE corpora]
 +
*[http://korpus.pl/ IPI PAN Corpus of Polish]
 
*[http://www.tu-chemnitz.de/phil/InternetGrammar/ Learner Behaviour on the Internet]
 
*[http://www.tu-chemnitz.de/phil/InternetGrammar/ Learner Behaviour on the Internet]
 
*[http://muchmore.dfki.de/resources1.htm MuchMore Springer Bilingual Corpus]
 
*[http://muchmore.dfki.de/resources1.htm MuchMore Springer Bilingual Corpus]
 
*[http://nl.ijs.si/ME/ MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages]
 
*[http://nl.ijs.si/ME/ MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages]
 
*[http://tcc.itc.it/people/forner/multilingualcorpora.html Multilingual Corpora: Available Resources]
 
*[http://tcc.itc.it/people/forner/multilingualcorpora.html Multilingual Corpora: Available Resources]
 +
* [http://www.csse.monash.edu.au/~jwb/tanakacorpus.html Tanaka Corpus: Japanese-English sentence pairs]
 
*[http://multisemcor.itc.it MultiSemCor]
 
*[http://multisemcor.itc.it MultiSemCor]
 
*[http://www.ims.uni-stuttgart.de/info/Newspapers.html Newspapers on the Internet]
 
*[http://www.ims.uni-stuttgart.de/info/Newspapers.html Newspapers on the Internet]
 
*[http://logos.uio.no/opus/ OPUS - an open source parallel corpus]
 
*[http://logos.uio.no/opus/ OPUS - an open source parallel corpus]
 +
*[http://www.tekstlab.uio.no/Bosnian/Corpus.html Oslo Corpus of Bosnian]
 
*[http://langbank.engl.polyu.edu.hk/indexl.html PolyU Language Bank]
 
*[http://langbank.engl.polyu.edu.hk/indexl.html PolyU Language Bank]
 +
*[http://www.corpusdoportugues.org/ Portuguese Corpus]
 
*[http://register.consilium.eu.int/ Public registry of the Council of the EU]
 
*[http://register.consilium.eu.int/ Public registry of the Council of the EU]
 +
*[http://www.ruscorpora.ru/ Russian National Corpus (RNK)]
 
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
 
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
 
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
 
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
 +
*[http://korpus.juls.savba.sk/ Slovak National Corpus (SNK)]
 +
*[http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
 +
*[http://www.corpusdelespanol.org/ Spanish Corpus]
 
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages]
 
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages]
 
*[http://www-igm.univ-mlv.fr/~unitex/ UNITEX]
 
*[http://www-igm.univ-mlv.fr/~unitex/ UNITEX]

Revision as of 13:10, 2 November 2006

English corpora

German corpora

Multilingual corpora