Difference between revisions of "Resources for Croatian"
From ACL Wiki
|Line 4:||Line 4:|
Revision as of 10:49, 5 December 2007
- Croatian Language Corpus (continuously growing (currently approx. 100 mil. tokens) corpus of Croatian covering various genres and time periods, using Philologic for online search)
- Southeast European Times (paragraph aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — 9,678 paragraphs, 92,450— 122,912 words per language)