Difference between revisions of "Multilingual resources"

From ACL Wiki
Jump to navigation Jump to search
(rm dead link)
(13 intermediate revisions by 5 users not shown)
Line 1: Line 1:
 
For individual languages, see [[List of resources by language]].
 
For individual languages, see [[List of resources by language]].
  
== General Multilingual Resources ==
+
* [[Multilingual Dictionaries]] - including Bilingual Dictionaries
* [http://www.omegawiki.org/ OmegaWiki] - a collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
+
* [[Multilingual Corpora]] - including Bilingual Corpora
 
+
* [[Multilingual Tools and Software]] - including Bilingual Tools and Software
== Bilingual and Multilingual Dictionaries ==
 
 
 
* [http://www.elra.info Bilingual Dictionary French Arabic]  
 
* [http://crodip.ffzg.hr/default_en.aspx Croatian Old Dictionary Portal] - University of Zagreb
 
* [http://www.canoo.net/ Canoo.net] - German Dictionaries and Grammars
 
* [http://www.csse.monash.edu.au/~jwb/edict_doc.html JMdict: Japanese-Multilingual Dictionary file]
 
* [http://dict.leo.org/ LEO: Deutsche-Englisches Woerterbuch] - German-English Dictionary
 
*[http://longdo.ex.nii.ac.jp/ Longdo Thai Multilingual Dictionary]  
 
* [http://www.nada.kth.se/skolverket/sve-eng.html Swedish to English]
 
* [http://dix.osola.com/ Deutsch-Spanisch Woerterbuch] - German-Spanish Dictionary
 
* [http://www.ibs.ee/dict/index.html English to Estonian]
 
* [http://www.uni-frankfurt.de/~felix/eurodictautom.html ECHO - Eurodicautom] - multilingual technical dictionary
 
  
== Corpora ==
+
== Related resources ==
  
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
*[http://wt.jrc.it/lt/Acquis/ ACQUIS COMMUNAUTAIRE Multilingual Corpus]
+
* [http://www.dunglish.nl/ Dunglish (Dutch & English)]
*[http://spraakbanken.gu.se/ Bank of Swedish]
+
* [http://www.omegawiki.org/ OmegaWiki] - a collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
*[http://sli.uvigo.es/CLUVI/ CLUVI Corpus (Galician-English-Spanish-French parallel corpus)]
+
* [https://www.mpi-inf.mpg.de/yago-naga/uwn/ UWN]: Multilingual wordnet covering over 200 languages, freely available as a TSV file
*[http://hnk.ffzg.hr/ Croatian National Corpus (HNK)]
 
*[http://ucnk.ff.cuni.cz/ Czech National Corpus (CNC)]
 
*[http://www.kun.nl/celex CELEX - The Dutch Center for Lexical Information]
 
*[http://www.cdc.gov/ncidod/sars/languages.htm Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS]
 
*[http://www.linguateca.pt/COMPARA/ COMPARA corpus]
 
*[http://www.debian.org/international/ Debian free software community]
 
*[http://www.ling.lancs.ac.uk/corplang/emille EMILLE corpus]
 
*[http://www.statmt.org/europarl/ European Parliament Proceedings Parallel Corpus 1996-2003]
 
*[http://www.illc.uva.nl/EuroWordNet EuroWordNet]
 
*[http://www.france.diplomatie.fr/label_france/index.html French Foreign Ministry's magazine]
 
*[http://glossa.fltr.ucl.ac.be/ GlossaNet]
 
*[http://hometown.aol.com/mit2haiti/JA-HC-kr.htm Haitian Creole corpus -Teknoloji pou lang kreyol]
 
*[http://corpus.nytud.hu/mnsz/ Hungarian National Corpus]
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC95T20 Hansard French-English parallel corpus]
 
*[http://www.ucl.ac.uk/english-usage/ice/avail.htm ICE corpora]
 
*[http://korpus.pl/ IPI PAN Corpus of Polish]
 
*[http://www.tu-chemnitz.de/phil/InternetGrammar/ Learner Behaviour on the Internet]
 
*[http://corpora.informatik.uni-leipzig.de Leipzig Corpora Collection: Large monolingual raw corpora for 17+ languages, free downloads]
 
*[http://muchmore.dfki.de/resources1.htm MuchMore Springer Bilingual Corpus]
 
*[http://nl.ijs.si/ME/ MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages]
 
*[http://tcc.itc.it/people/forner/multilingualcorpora.html Multilingual Corpora: Available Resources]
 
* [http://www.csse.monash.edu.au/~jwb/tanakacorpus.html Tanaka Corpus: Japanese-English sentence pairs]
 
*[http://multisemcor.itc.it MultiSemCor]
 
*[http://www.ims.uni-stuttgart.de/info/Newspapers.html Newspapers on the Internet]
 
*[http://logos.uio.no/opus/ OPUS - an open source parallel corpus]
 
*[http://www.tekstlab.uio.no/Bosnian/Corpus.html Oslo Corpus of Bosnian]
 
*[http://langbank.engl.polyu.edu.hk/indexl.html PolyU Language Bank]
 
*[http://www.corpusdoportugues.org/ Portuguese Corpus]
 
*[http://register.consilium.eu.int/ Public registry of the Council of the EU]
 
*[http://www.ruscorpora.ru/ Russian National Corpus (RNK)]
 
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
 
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
 
*[http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
 
*[http://www.corpusdelespanol.org/ Spanish Corpus]
 
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages]
 
*[http://www-igm.univ-mlv.fr/~unitex/ UNITEX]
 
*[http://www.u-grenoble3.fr/kraif/liens.htm Useful links about parallel corpora, by Olivier Kraif]
 
*[http://wacky.sslmit.unibo.it/ WaCky Project]
 
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch]
 
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
  
==Software==
+
[[Category:Resources by language|Multilingual]]
*[http://nlp.stanford.edu/kirrkirr/ Kirrkirr 4.0 Dictionary Program] - software for the exploration of indigenous language dictionaries
 
 
 
== Interactions among languages ==
 
 
 
*[http://www.dunglish.nl/ Dunglish (Dutch & English)]
 

Revision as of 08:59, 26 June 2016

For individual languages, see List of resources by language.

Related resources

  • Dunglish (Dutch & English)
  • OmegaWiki - a collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
  • UWN: Multilingual wordnet covering over 200 languages, freely available as a TSV file