Difference between revisions of "Multilingual resources"

From ACL Wiki
Jump to navigation Jump to search
m (Change *[http://nltk.sourceforge.net Natural Language Toolkit (NLTK)] to *[http://nltk.net Natural Language Toolkit (NLTK)], move to Multilingual resources, Collaborative projects.)
 
(7 intermediate revisions by 4 users not shown)
Line 1: Line 1:
 
For individual languages, see [[List of resources by language]].
 
For individual languages, see [[List of resources by language]].
  
== General Multilingual Resources ==
+
* [[Multilingual Dictionaries]] - including Bilingual Dictionaries
* [http://www.omegawiki.org/ OmegaWiki] - a collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
+
* [[Multilingual Corpora]] - including Bilingual Corpora
 +
* [[Multilingual Tools and Software]] - including Bilingual Tools and Software
  
== Bilingual and Multilingual Dictionaries ==
+
== Collaborative projects ==
 
 
* [http://www.elra.info Bilingual Dictionary French Arabic]
 
* [http://crodip.ffzg.hr/default_en.aspx Croatian Old Dictionary Portal] - University of Zagreb
 
* [http://www.csse.monash.edu.au/~jwb/edict_doc.html JMdict: Japanese-Multilingual Dictionary file]
 
* [http://dict.leo.org/ LEO: Deutsche-Englisches Woerterbuch] - German-English Dictionary
 
*[http://longdo.ex.nii.ac.jp/ Longdo Thai Multilingual Dictionary]
 
* [http://www.nada.kth.se/skolverket/sve-eng.html Swedish to English]
 
* [http://dix.osola.com/ Deutsch-Spanisch Woerterbuch] - German-Spanish Dictionary
 
* [http://www.ibs.ee/dict/index.html English to Estonian]
 
* [http://www.uni-frankfurt.de/~felix/eurodictautom.html ECHO - Eurodicautom] - multilingual technical dictionary
 
 
 
== Corpora ==
 
 
 
<!-- Please keep this list in alphabetical order -->
 
*[http://wt.jrc.it/lt/Acquis/ ACQUIS COMMUNAUTAIRE Multilingual Corpus]
 
*[http://spraakbanken.gu.se/ Bank of Swedish]
 
*[http://sli.uvigo.es/CLUVI/ CLUVI Corpus (Galician-English-Spanish-French parallel corpus)]
 
*[http://hnk.ffzg.hr/ Croatian National Corpus (HNK)]
 
*[http://ucnk.ff.cuni.cz/ Czech National Corpus (CNC)]
 
*[http://www.kun.nl/celex CELEX - The Dutch Center for Lexical Information]
 
*[http://www.cdc.gov/ncidod/sars/languages.htm Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS]
 
*[http://www.linguateca.pt/COMPARA/ COMPARA corpus]
 
*[http://www.debian.org/international/ Debian free software community]
 
*[http://www.ling.lancs.ac.uk/corplang/emille EMILLE corpus]
 
*[http://www.statmt.org/europarl/ European Parliament Proceedings Parallel Corpus 1996-2003]
 
*[http://www.illc.uva.nl/EuroWordNet EuroWordNet]
 
*[http://www.france.diplomatie.fr/label_france/index.html French Foreign Ministry's magazine]
 
*[http://glossa.fltr.ucl.ac.be/ GlossaNet]
 
*[http://hometown.aol.com/mit2haiti/JA-HC-kr.htm Haitian Creole corpus -Teknoloji pou lang kreyol]
 
*[http://corpus.nytud.hu/mnsz/ Hungarian National Corpus]
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC95T20 Hansard French-English parallel corpus]
 
*[http://www.ucl.ac.uk/english-usage/ice/avail.htm ICE corpora]
 
*[http://korpus.pl/ IPI PAN Corpus of Polish]
 
*[http://www.tu-chemnitz.de/phil/InternetGrammar/ Learner Behaviour on the Internet]
 
*[http://corpora.informatik.uni-leipzig.de Leipzig Corpora Collection: Large monolingual raw corpora for 17+ languages, free downloads]
 
*[http://www.coli.uni-saarland.de/~gparis/LMD-TAZ_corpus/ Le Monde Diplomatique-Die Tageszeitung Translation Corpus] - French-German, aligned (parallel)
 
*[http://muchmore.dfki.de/resources1.htm MuchMore Springer Bilingual Corpus]
 
*[http://nl.ijs.si/ME/ MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages]
 
*[http://tcc.itc.it/people/forner/multilingualcorpora.html Multilingual Corpora: Available Resources]
 
* [http://www.csse.monash.edu.au/~jwb/tanakacorpus.html Tanaka Corpus: Japanese-English sentence pairs]
 
*[http://multisemcor.itc.it MultiSemCor]
 
*[http://www.ims.uni-stuttgart.de/info/Newspapers.html Newspapers on the Internet]
 
*[http://logos.uio.no/opus/ OPUS - an open source parallel corpus]
 
*[http://www.tekstlab.uio.no/Bosnian/Corpus.html Oslo Corpus of Bosnian]
 
*[http://langbank.engl.polyu.edu.hk/indexl.html PolyU Language Bank]
 
*[http://www.corpusdoportugues.org/ Portuguese Corpus]
 
*[http://register.consilium.eu.int/ Public registry of the Council of the EU]
 
*[http://www.ruscorpora.ru/ Russian National Corpus (RNK)]
 
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
 
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
 
*[http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+]
 
*[http://www.ling.su.se/DaLi/research/smultron/index.htm SMULTRON Corpus] parallell treebank of English, German and Swedish
 
*[http://www.corpusdelespanol.org/ Spanish Corpus]
 
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages]
 
*[http://www-igm.univ-mlv.fr/~unitex/ UNITEX]
 
*[http://www.u-grenoble3.fr/kraif/liens.htm Useful links about parallel corpora, by Olivier Kraif]
 
*[http://wacky.sslmit.unibo.it/ WaCky Project]
 
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch]
 
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
 +
;[http://www.language-archives.org/ OLAC: Open Language Archives Community]
 +
:An international partnership of institutions and individuals who are creating a worldwide virtual library of language resources
 +
;[http://www.omegawiki.org/ OmegaWiki]
 +
:A collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
 +
;[http://nltk.net Natural Language Toolkit (NLTK)]
 +
:An amazing library to play with natural language
  
==Software==
+
== Lexical databases ==
<!-- Please keep this list in alphabetical order -->
 
*[http://fmg-www.cs.ucla.edu/geoff/ispell-dictionaries.html#Spanish-dicts Dictionaries for International Ispell]
 
*[http://heartofgold.dfki.de Heart of Gold] - XML-based middleware for the integration of (deep and shallow) NLP components
 
*[http://www.ukp.tu-darmstadt.de/software/jwktl/ JWKTL] - Java Wiktionary Library
 
*[http://nlp.stanford.edu/kirrkirr/ Kirrkirr 4.0 Dictionary Program] - software for the exploration of indigenous language dictionaries
 
*[http://www.lpl.univ-aix.fr/projects/multext/MtRecode/ MtRecode] - Character conversion program
 
*[http://www.lpl.univ-aix.fr/projects/multext/MtScript/ MtScript] - The Multext multi-lingual text editor
 
*[http://bach.arts.kuleuven.ac.be/~piet/fr_nlp.html Natural Language Processing for French]
 
*[http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/ TIGERSearch] - tools for linguistic text exploration
 
*[http://sprout.dfki.de SProUT] - Shallow Processing with Unification and Typed Feature Structures
 
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
 +
;[https://www.mpi-inf.mpg.de/yago-naga/uwn/ UWN]: Multilingual wordnet covering over 200 languages, freely available as a TSV file
  
===Other lists of multilingual software===
+
== Weblogs ==
<!-- Please keep this list in alphabetical order -->
 
*[http://let.dfki.uni-sb.de/~heinz/liste.php3 List of systems in multiple languages] - from DFKI Language Technology Lab
 
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
 +
;[http://www.dunglish.nl/ Dunglish (Dutch & English)]
  
== Interactions among languages ==
 
  
*[http://www.dunglish.nl/ Dunglish (Dutch & English)]
+
[[Category:Resources by language|Multilingual]]

Latest revision as of 03:03, 12 September 2019

For individual languages, see List of resources by language.

Collaborative projects

OLAC: Open Language Archives Community
An international partnership of institutions and individuals who are creating a worldwide virtual library of language resources
OmegaWiki
A collaborative project to produce a free (as in free software) multilingual resource in every language, with lexicological, terminological and thesaurus information
Natural Language Toolkit (NLTK)
An amazing library to play with natural language

Lexical databases

UWN
Multilingual wordnet covering over 200 languages, freely available as a TSV file

Weblogs

Dunglish (Dutch & English)