Multilingual resources
Revision as of 00:41, 10 June 2010 by Gparis (talk | contribs) (→Bilingual and Multilingual Dictionaries)
For individual languages, see List of resources by language.
General Multilingual Resources
- OmegaWiki - a collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
Bilingual and Multilingual Dictionaries
- Bilingual Dictionary French Arabic
- Croatian Old Dictionary Portal - University of Zagreb
- JMdict: Japanese-Multilingual Dictionary file
- LEO: Deutsche-Englisches Woerterbuch - German-English Dictionary
- Longdo Thai Multilingual Dictionary
- Swedish to English
- Deutsch-Spanisch Woerterbuch - German-Spanish Dictionary
- English to Estonian
- ECHO - Eurodicautom - multilingual technical dictionary
Corpora
- ACQUIS COMMUNAUTAIRE Multilingual Corpus
- Bank of Swedish
- CLUVI Corpus (Galician-English-Spanish-French parallel corpus)
- Croatian National Corpus (HNK)
- Czech National Corpus (CNC)
- CELEX - The Dutch Center for Lexical Information
- Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS
- COMPARA corpus
- Debian free software community
- EMILLE corpus
- European Parliament Proceedings Parallel Corpus 1996-2003
- EuroWordNet
- French Foreign Ministry's magazine
- GlossaNet
- Haitian Creole corpus -Teknoloji pou lang kreyol
- Hungarian National Corpus
- Hansard French-English parallel corpus
- ICE corpora
- IPI PAN Corpus of Polish
- Learner Behaviour on the Internet
- Leipzig Corpora Collection: Large monolingual raw corpora for 17+ languages, free downloads
- Le Monde Diplomatique-Tageszeitung Parallel Corpus
- MuchMore Springer Bilingual Corpus
- MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages
- Multilingual Corpora: Available Resources
- Tanaka Corpus: Japanese-English sentence pairs
- MultiSemCor
- Newspapers on the Internet
- OPUS - an open source parallel corpus
- Oslo Corpus of Bosnian
- PolyU Language Bank
- Portuguese Corpus
- Public registry of the Council of the EU
- Russian National Corpus (RNK)
- The Bible as a Resource for Translation Software
- The ECI Multilingual corpus
- Slovenian Corpus FIDA and FIDA+
- SMULTRON Corpus parallell treebank of English, German and Swedish
- Spanish Corpus
- UN declaration of human rights in multiple languages
- UNITEX
- Useful links about parallel corpora, by Olivier Kraif
- WaCky Project
- Wortlisten: spoken German, English, French, and Dutch
Software
- Dictionaries for International Ispell
- Heart of Gold - XML-based middleware for the integration of (deep and shallow) NLP components
- JWKTL - Java Wiktionary Library
- Kirrkirr 4.0 Dictionary Program - software for the exploration of indigenous language dictionaries
- MtRecode - Character conversion program
- MtScript - The Multext multi-lingual text editor
- Natural Language Processing for French
- TIGERSearch - tools for linguistic text exploration
- SProUT - Shallow Processing with Unification and Typed Feature Structures
Other lists of multilingual software
- List of systems in multiple languages - from DFKI Language Technology Lab