Difference between revisions of "Multilingual resources"

From ACL Wiki
Jump to navigation Jump to search
m (Change *[http://nltk.sourceforge.net Natural Language Toolkit (NLTK)] to *[http://nltk.net Natural Language Toolkit (NLTK)], move to Multilingual resources, Collaborative projects.)
 
(18 intermediate revisions by 7 users not shown)
Line 1: Line 1:
 
For individual languages, see [[List of resources by language]].
 
For individual languages, see [[List of resources by language]].
  
== Bilingual Dictionaries ==
+
* [[Multilingual Dictionaries]] - including Bilingual Dictionaries
 +
* [[Multilingual Corpora]] - including Bilingual Corpora
 +
* [[Multilingual Tools and Software]] - including Bilingual Tools and Software
  
* [http://www.elra.info Bilingual Dictionary French Arabic]
+
== Collaborative projects ==
* [http://crodip.ffzg.hr/default_en.aspx Croatian Old Dictionary Portal] - University of Zagreb
+
<!-- Please keep this list in alphabetical order -->
* [http://www.canoo.net/ Canoo.net] - German Dictionaries and Grammars
+
;[http://www.language-archives.org/ OLAC: Open Language Archives Community]
* [http://www.csse.monash.edu.au/~jwb/edict_doc.html JMdict: Japanese-Multilingual Dictionary file]
+
:An international partnership of institutions and individuals who are creating a worldwide virtual library of language resources
* [http://dict.leo.org/ LEO: Deutsche-Englisches Woerterbuch] - German-English Dictionary
+
;[http://www.omegawiki.org/ OmegaWiki]
* [http://www.nada.kth.se/skolverket/sve-eng.html Swedish to English]
+
:A collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
* [http://dix.osola.com/ Deutsch-Spanisch Woerterbuch] - German-Spanish Dictionary
+
;[http://nltk.net Natural Language Toolkit (NLTK)]
* [http://www.ibs.ee/dict/index.html English to Estonian]
+
:An amazing library to play with natural language
  
== Corpora ==
+
== Lexical databases ==
 +
<!-- Please keep this list in alphabetical order -->
 +
;[https://www.mpi-inf.mpg.de/yago-naga/uwn/ UWN]: Multilingual wordnet covering over 200 languages, freely available as a TSV file
  
 +
== Weblogs ==
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
*[http://wt.jrc.it/lt/Acquis/ ACQUIS COMMUNAUTAIRE Multilingual Corpus]
+
;[http://www.dunglish.nl/ Dunglish (Dutch & English)]
*[http://spraakbanken.gu.se/ Bank of Swedish]
 
*[http://sli.uvigo.es/CLUVI/ CLUVI Corpus (Galician-English-Spanish-French parallel corpus)]
 
*[http://hnk.ffzg.hr/ Croatian National Corpus (HNK)]
 
*[http://ucnk.ff.cuni.cz/ Czech National Corpus (CNC)]
 
*[http://www.kun.nl/celex CELEX - The Dutch Center for Lexical Information]
 
*[http://www.cdc.gov/ncidod/sars/languages.htm Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS]
 
*[http://www.linguateca.pt/COMPARA/ COMPARA corpus]
 
*[http://www.debian.org/international/ Debian free software community]
 
*[http://www.ling.lancs.ac.uk/corplang/emille EMILLE corpus]
 
*[http://www.statmt.org/europarl/ European Parliament Proceedings Parallel Corpus 1996-2003]
 
*[http://www.illc.uva.nl/EuroWordNet EuroWordNet]
 
*[http://www.france.diplomatie.fr/label_france/index.html French Foreign Ministry's magazine]
 
*[http://glossa.fltr.ucl.ac.be/ GlossaNet]
 
*[http://hometown.aol.com/mit2haiti/JA-HC-kr.htm Haitian Creole corpus -Teknoloji pou lang kreyol]
 
*[http://corpus.nytud.hu/mnsz/ Hungarian National Corpus]
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC95T20 Hansard French-English parallel corpus]
 
*[http://www.ucl.ac.uk/english-usage/ice/avail.htm ICE corpora]
 
*[http://korpus.pl/ IPI PAN Corpus of Polish]
 
*[http://www.tu-chemnitz.de/phil/InternetGrammar/ Learner Behaviour on the Internet]
 
*[http://corpora.informatik.uni-leipzig.de Leipzig Corpora Collection: Large monolingual raw corpora for 17+ languages, free downloads]
 
*[http://muchmore.dfki.de/resources1.htm MuchMore Springer Bilingual Corpus]
 
*[http://nl.ijs.si/ME/ MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages]
 
*[http://tcc.itc.it/people/forner/multilingualcorpora.html Multilingual Corpora: Available Resources]
 
* [http://www.csse.monash.edu.au/~jwb/tanakacorpus.html Tanaka Corpus: Japanese-English sentence pairs]
 
*[http://multisemcor.itc.it MultiSemCor]
 
*[http://www.ims.uni-stuttgart.de/info/Newspapers.html Newspapers on the Internet]
 
*[http://logos.uio.no/opus/ OPUS - an open source parallel corpus]
 
*[http://www.tekstlab.uio.no/Bosnian/Corpus.html Oslo Corpus of Bosnian]
 
*[http://langbank.engl.polyu.edu.hk/indexl.html PolyU Language Bank]
 
*[http://www.corpusdoportugues.org/ Portuguese Corpus]
 
*[http://register.consilium.eu.int/ Public registry of the Council of the EU]
 
*[http://www.ruscorpora.ru/ Russian National Corpus (RNK)]
 
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
 
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
 
*[http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
 
*[http://www.corpusdelespanol.org/ Spanish Corpus]
 
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages]
 
*[http://www-igm.univ-mlv.fr/~unitex/ UNITEX]
 
*[http://www.u-grenoble3.fr/kraif/liens.htm Useful links about parallel corpora, by Olivier Kraif]
 
*[http://wacky.sslmit.unibo.it/ WaCky Project]
 
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch]
 
<!-- Please keep this list in alphabetical order -->
 
  
== Interactions among languages ==
 
  
*[http://www.dunglish.nl/ Dunglish (Dutch & English)]
+
[[Category:Resources by language|Multilingual]]

Latest revision as of 02:03, 12 September 2019

For individual languages, see List of resources by language.

Collaborative projects

OLAC: Open Language Archives Community
An international partnership of institutions and individuals who are creating a worldwide virtual library of language resources
OmegaWiki
A collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
Natural Language Toolkit (NLTK)
An amazing library to play with natural language

Lexical databases

UWN
Multilingual wordnet covering over 200 languages, freely available as a TSV file

Weblogs

Dunglish (Dutch & English)