Resources for Romanian

From ACL Wiki
Revision as of 08:50, 26 May 2014 by Zeman (talk | contribs) (HamleDT)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Machine translation systems

Free software

Proprietary

Lexical resources

Corpora

Free

  • Europarl corpus, sentence aligned with English
  • HamleDT, harmonized dependency treebanks of many languages, common annotation style.
  • Romanian NLP
  • Southeast European Times (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)

Proprietary

  • Corpora (Monolingual, POS tagged and bilingual English/French<->Romanian).

Bibliography

External links