Resources for Turkish

From ACL Wiki
Revision as of 15:33, 10 January 2013 by TsCorpus (talk | contribs)
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Morphological analysis

Free software

  • TRMorph "is a relatively complete morphological analyzer for Turkish. It is implemented using SFST, and uses a lexicon based on (but heavily modified) the wordlist of Zemberek spell checker. The morphological analyzer is distributed under the GPL."

Proprietary

Lexical resources

Corpora

Free

  • Southeast European Times (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)
  • TS Corpus (PoSTagged Turkish Corpus. The corpus also has morphological and lemma tags. Consist of 491 Million tokens)

Proprietary

Bibliography

  • K. Oflazer, "Two-level Description of Turkish Morphology," Literary and Linguistic Computing, vol. 9, pp. 137-148, 1995. Backwards PDF

See also

External links