Difference between revisions of "Resources for Georgian"
Jump to navigation
Jump to search
m (→Proprietary) |
|||
Line 1: | Line 1: | ||
==Corpora== | ==Corpora== | ||
+ | ===Free software=== | ||
+ | * [http://igm.univ-mlv.fr/~unitex/ Unitex] includes an [[LGPL-LR]] text of Ancient Georgian (25.900 words; 7.180 forms) | ||
===Unknown licence, but web searchable=== | ===Unknown licence, but web searchable=== | ||
Line 5: | Line 7: | ||
* The text archive of the [http://www.tavisupleba.org/ Georgian service] of Radio Free Europe/Radio Liberty, around eight million words. | * The text archive of the [http://www.tavisupleba.org/ Georgian service] of Radio Free Europe/Radio Liberty, around eight million words. | ||
* The largest archive of fictional texts (both prose and poetry): the [http://www.nplg.gov.ge/gsdl/ UNESCO Project digital collection of Georgian classical literature] (both prose and poetry), three million words. | * The largest archive of fictional texts (both prose and poetry): the [http://www.nplg.gov.ge/gsdl/ UNESCO Project digital collection of Georgian classical literature] (both prose and poetry), three million words. | ||
− | |||
==Syntactic and morphological analysis== | ==Syntactic and morphological analysis== |
Revision as of 05:35, 10 February 2011
Corpora
Free software
Unknown licence, but web searchable
- The electronic newspaper archive Opentext comprises approximately 100 million words and is by far the largest collection of Georgian texts available online.
- The text archive of the Georgian service of Radio Free Europe/Radio Liberty, around eight million words.
- The largest archive of fictional texts (both prose and poetry): the UNESCO Project digital collection of Georgian classical literature (both prose and poetry), three million words.
Syntactic and morphological analysis
Proprietary
- The Georgian Grammar project consists of a Morphological analyser, a Georgian LFG Grammar, a demo treebank (which will eventually evolve into a treebank as a lingulistic resource, and an (un-annotated) corpus of non-fictional (mainly newspaper) and fictional texts
Fonts
See the Georgian Grammar project
Translational dictionaries
Proprietary
- Translate.ge Georgian–English–Georgian dictionary, >42,000 entries per language
- lykt.info downloadable Georgian-Norwegian-Georgian dictionary, ~20,000 entries