Difference between revisions of "Resources for Croatian"

From ACL Wiki
Jump to navigation Jump to search
Line 4: Line 4:
  
 
==Corpora==
 
==Corpora==
 +
 +
* [http://riznica.ihjj.hr/en/ Croatian Language Corpus] (continuously growing (currently approx. 100 mil. tokens) corpus of Croatian covering various genres and time periods, using Philologic for online search)
  
 
===Free===
 
===Free===

Revision as of 09:49, 5 December 2007

General

Corpora

  • Croatian Language Corpus (continuously growing (currently approx. 100 mil. tokens) corpus of Croatian covering various genres and time periods, using Philologic for online search)

Free

  • Southeast European Times (paragraph aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — 9,678 paragraphs, 92,450— 122,912 words per language)