Resources for Croatian

From ACL Wiki
Revision as of 10:49, 5 December 2007 by Dcavar (talk | contribs) (→‎Corpora)
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

General

Corpora

  • Croatian Language Corpus (continuously growing (currently approx. 100 mil. tokens) corpus of Croatian covering various genres and time periods, using Philologic for online search)

Free

  • Southeast European Times (paragraph aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — 9,678 paragraphs, 92,450— 122,912 words per language)