Difference between revisions of "Multilingual Corpora"
Jump to navigation
Jump to search
(New page: For individual languages, see List of resources by language. See also Multilingual resources. <!-- Please keep this list in alphabetical order --> *[http://wt.jrc.it/lt/Acquis/ ...) |
(+MultiUN corpora) |
||
(One intermediate revision by one other user not shown) | |||
Line 45: | Line 45: | ||
*[http://www.corpusdelespanol.org/ Spanish Corpus] | *[http://www.corpusdelespanol.org/ Spanish Corpus] | ||
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages] | *[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages] | ||
+ | *[http://www.euromatrixplus.net/multi-un/ UN parallel corpora] | ||
*[http://www-igm.univ-mlv.fr/~unitex/ UNITEX] | *[http://www-igm.univ-mlv.fr/~unitex/ UNITEX] | ||
*[http://www.u-grenoble3.fr/kraif/liens.htm Useful links about parallel corpora, by Olivier Kraif] | *[http://www.u-grenoble3.fr/kraif/liens.htm Useful links about parallel corpora, by Olivier Kraif] | ||
Line 50: | Line 51: | ||
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch] | *[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch] | ||
<!-- Please keep this list in alphabetical order --> | <!-- Please keep this list in alphabetical order --> | ||
+ | |||
+ | |||
+ | [[Category:Resources by language|Multilingual]] |
Revision as of 15:50, 10 December 2013
For individual languages, see List of resources by language.
See also Multilingual resources.
- ACQUIS COMMUNAUTAIRE Multilingual Corpus
- Bank of Swedish
- CLUVI Corpus (Galician-English-Spanish-French parallel corpus)
- Croatian National Corpus (HNK)
- Czech National Corpus (CNC)
- CELEX - The Dutch Center for Lexical Information
- Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS
- COMPARA corpus
- Debian free software community
- EMILLE corpus
- European Parliament Proceedings Parallel Corpus 1996-2003
- EuroWordNet
- French Foreign Ministry's magazine
- GlossaNet
- Haitian Creole corpus -Teknoloji pou lang kreyol
- Hungarian National Corpus
- Hansard French-English parallel corpus
- ICE corpora
- IPI PAN Corpus of Polish
- Learner Behaviour on the Internet
- Leipzig Corpora Collection: Large monolingual raw corpora for 17+ languages, free downloads
- Le Monde Diplomatique-Die Tageszeitung Translation Corpus - French-German, aligned (parallel)
- MuchMore Springer Bilingual Corpus
- MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages
- Multilingual Corpora: Available Resources
- Tanaka Corpus: Japanese-English sentence pairs
- MultiSemCor
- Newspapers on the Internet
- OPUS - an open source parallel corpus
- Oslo Corpus of Bosnian
- PolyU Language Bank
- Portuguese Corpus
- Public registry of the Council of the EU
- Russian National Corpus (RNK)
- The Bible as a Resource for Translation Software
- The ECI Multilingual corpus
- Slovenian Corpus FIDA and FIDA+
- SMULTRON Corpus parallell treebank of English, German and Swedish
- Spanish Corpus
- UN declaration of human rights in multiple languages
- UN parallel corpora
- UNITEX
- Useful links about parallel corpora, by Olivier Kraif
- WaCky Project
- Wortlisten: spoken German, English, French, and Dutch