Difference between revisions of "Resources for Georgian"

From ACL Wiki
Jump to navigation Jump to search
Line 1: Line 1:
 
==Corpora==
 
==Corpora==
 +
===Free software===
 +
* [http://igm.univ-mlv.fr/~unitex/ Unitex] includes an [[LGPL-LR]] text of Ancient Georgian (25.900 words; 7.180 forms)
  
 
===Unknown licence, but web searchable===
 
===Unknown licence, but web searchable===
Line 5: Line 7:
 
* The text archive of the [http://www.tavisupleba.org/ Georgian service] of Radio Free Europe/Radio Liberty, around eight million words.
 
* The text archive of the [http://www.tavisupleba.org/ Georgian service] of Radio Free Europe/Radio Liberty, around eight million words.
 
* The largest archive of fictional texts (both prose and poetry): the [http://www.nplg.gov.ge/gsdl/ UNESCO Project digital collection of Georgian classical literature] (both prose and poetry), three million words.
 
* The largest archive of fictional texts (both prose and poetry): the [http://www.nplg.gov.ge/gsdl/ UNESCO Project digital collection of Georgian classical literature] (both prose and poetry), three million words.
 
  
 
==Syntactic and morphological analysis==
 
==Syntactic and morphological analysis==

Revision as of 05:35, 10 February 2011

Corpora

Free software

  • Unitex includes an LGPL-LR text of Ancient Georgian (25.900 words; 7.180 forms)

Unknown licence, but web searchable

Syntactic and morphological analysis

Proprietary

  • The Georgian Grammar project consists of a Morphological analyser, a Georgian LFG Grammar, a demo treebank (which will eventually evolve into a treebank as a lingulistic resource, and an (un-annotated) corpus of non-fictional (mainly newspaper) and fictional texts

Fonts

See the Georgian Grammar project

Translational dictionaries

Proprietary

  • Translate.ge Georgian–English–Georgian dictionary, >42,000 entries per language
  • lykt.info downloadable Georgian-Norwegian-Georgian dictionary, ~20,000 entries