Resources for Slovenian

From ACL Wiki
Revision as of 10:27, 12 October 2013 by Jonsafari (talk | contribs) (→‎Corpora: +Europarl corpus; reorg)
Jump to navigation Jump to search

Corpora

Free license

  • Europarl corpus, sentence aligned with English
  • IJS - ELAN Slovene-English Parallel Corpus
  • JRC Acquis parallel texts. Languages involved: Bulgarian, Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Hungarian, Italian, Lithuanian, Latvian, Maltese, Dutch, Polish, Portuguese, Romanian, Slovak, Slovene and Swedish.

Non-free license

  • Multext EAST lexica, annotated "1984" corpus, parallel and comparable text and speech corpora. Languages involved: Bulgarian, Croatian, Czech, English, Estonian, Hungarian, Lithuanian, Macedonian, Persian, Polish, Resian, Romanian, Russian, Serbian, Slovak, Slovene, and Ukrainian