Difference between revisions of "Resources for Greek"

From ACL Wiki
Jump to: navigation, search
(Named entity recognition)
m (Reverted edits by Creek (talk) to last revision by Francis Tyers)
(11 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 +
==Corpora==
 +
 +
===Free===
 +
 +
* [http://www.statmt.org/setimes/ Southeast European Times] (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)
 +
 
==Machine translation systems==
 
==Machine translation systems==
  
Line 5: Line 11:
 
===Proprietary===
 
===Proprietary===
  
==Corpora==
+
==Named entity recognition==
  
 
===Free===
 
===Free===
  
* [http://xixona.dlsi.ua.es/~fran/setimes/ Southeast European Times] (paragraph aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — 9,678 paragraphs, 92,450— 122,912 words per language)
+
* [http://www.aueb.gr/users/ion/software/GREEK_NERC_v2.tar.gz Greek named entity recognizer (version 2)] Currently identifies temporal expressions, person names, and organization names. See [http://www.aueb.gr/users/ion/publications.html here] for publications describing the recognizer. (GPL)
  
==Named entity recognition==
+
==Natural language generation==
 +
 
 +
===Free===
 +
 
 +
* [http://www.aueb.gr/users/ion/software/NaturalOWL1.1.tar.gz NaturalOWL (version 1.1)] Generates Greek and English descriptions of entities and classes from OWL ontologies that have been annotated with linguistic and user modeling resources in RDF. It can also be used as a [http://protege.stanford.edu/ Protégé] plug-in. See [http://www.aueb.gr/users/ion/publications.html here] for publications describing NaturalOWL. (GPL)
 +
 
 +
==Part-of-speech (POS) tagging==
 +
 
 +
===Free===
  
* [http://www.aueb.gr/users/ion/software/GREEK_NERC_v2.tar.gz Greek named entity recognizer (version 2)] Currently identifies temporal expressions, person names, and organization names. See [http://www.aueb.gr/users/ion/publications.html here] for publications describing the recognizer.
+
* [http://nlp.cs.aueb.gr/software_and_datasets/AUEB_Greek_POS_tagger.tar.gz Greek POS tagger] Attempts to automatically determine the part of speech (e.g., noun, adjective, verb, etc.) of each word occurrence in Greek texts. It can also tag (with lower accuracy) each word occurrence with additional information, such as the gender, number, and case of each noun, the voice, tense, and number of each verb etc. (GPL)
  
 
==Bibliography==
 
==Bibliography==
  
 +
* D. Galanis and I. Androutsopoulos, "Generating Multilingual Descriptions from Linguistically Annotated OWL Ontologies: the NaturalOWL System". In ''Proceedings of the 11th European Workshop on Natural Language Generation'' (ENLG 2007), Schloss Dagstuhl, Germany, pp. 143-146, 2007.
 
* G. Lucarelli and I. Androutsopoulos, "A Greek Named-Entity Recognizer that Uses Support Vector Machines and Active Learning". In ''Proceedings of the 4th Hellenic Conference on Artificial Intelligence'' (SETN 2006), Heraklion, Greece, 2006.
 
* G. Lucarelli and I. Androutsopoulos, "A Greek Named-Entity Recognizer that Uses Support Vector Machines and Active Learning". In ''Proceedings of the 4th Hellenic Conference on Artificial Intelligence'' (SETN 2006), Heraklion, Greece, 2006.
* G. Lucarelli, X. Vasilakos and I. Androutsopoulos, "Named Entity Recognition in Greek Texts with an Ensemble of SVMs and Active Learning". International Journal on Artificial Intelligence Tools, World Scientific (accepted for publication).  
+
* G. Lucarelli, X. Vasilakos and I. Androutsopoulos, "Named Entity Recognition in Greek Texts with an Ensemble of SVMs and Active Learning". International Journal on Artificial Intelligence Tools, 16(6):1015-1045, World Scientific, 2007.  
 
* G. Petasis, V. Karkaletsis, D. Farmakiotou, G. Samaritakis, I. Androutsopoulos, C. D. Spyropoulos, "A Greek Morphological Lexicon and its Exploitation by a Greek Controlled Language Checker". In ''Proceedings of the 8th Panhellenic Conference on Informatics'', 8 - 10 November 2001, Nicosia, Cyprus.
 
* G. Petasis, V. Karkaletsis, D. Farmakiotou, G. Samaritakis, I. Androutsopoulos, C. D. Spyropoulos, "A Greek Morphological Lexicon and its Exploitation by a Greek Controlled Language Checker". In ''Proceedings of the 8th Panhellenic Conference on Informatics'', 8 - 10 November 2001, Nicosia, Cyprus.
 
* G. Petasis, G. Paliouras, V. Karkaletsis, C.D. Spyropoulos and I. Androutsopoulos, "Resolving Part-of-Speech Ambiguity in the Greek Language Using Learning Techniques". In Fakotakis, N. et al. (Eds.), ''Machine Learning in Human Language Technology'' (Proceedings of the ACAI Workshop), pp. 29-34, Chania, Greece, 1999.
 
* G. Petasis, G. Paliouras, V. Karkaletsis, C.D. Spyropoulos and I. Androutsopoulos, "Resolving Part-of-Speech Ambiguity in the Greek Language Using Learning Techniques". In Fakotakis, N. et al. (Eds.), ''Machine Learning in Human Language Technology'' (Proceedings of the ACAI Workshop), pp. 29-34, Chania, Greece, 1999.

Revision as of 04:17, 25 June 2012

Corpora

Free

  • Southeast European Times (sentence aligned corpus, Albanian, Bulgarian, English, Greek, Macedonian, Romanian, Serbo-Croatian, Turkish — approximately 4.5 million words per language)

Machine translation systems

Free software

Proprietary

Named entity recognition

Free

Natural language generation

Free

  • NaturalOWL (version 1.1) Generates Greek and English descriptions of entities and classes from OWL ontologies that have been annotated with linguistic and user modeling resources in RDF. It can also be used as a Protégé plug-in. See here for publications describing NaturalOWL. (GPL)

Part-of-speech (POS) tagging

Free

  • Greek POS tagger Attempts to automatically determine the part of speech (e.g., noun, adjective, verb, etc.) of each word occurrence in Greek texts. It can also tag (with lower accuracy) each word occurrence with additional information, such as the gender, number, and case of each noun, the voice, tense, and number of each verb etc. (GPL)

Bibliography

  • D. Galanis and I. Androutsopoulos, "Generating Multilingual Descriptions from Linguistically Annotated OWL Ontologies: the NaturalOWL System". In Proceedings of the 11th European Workshop on Natural Language Generation (ENLG 2007), Schloss Dagstuhl, Germany, pp. 143-146, 2007.
  • G. Lucarelli and I. Androutsopoulos, "A Greek Named-Entity Recognizer that Uses Support Vector Machines and Active Learning". In Proceedings of the 4th Hellenic Conference on Artificial Intelligence (SETN 2006), Heraklion, Greece, 2006.
  • G. Lucarelli, X. Vasilakos and I. Androutsopoulos, "Named Entity Recognition in Greek Texts with an Ensemble of SVMs and Active Learning". International Journal on Artificial Intelligence Tools, 16(6):1015-1045, World Scientific, 2007.
  • G. Petasis, V. Karkaletsis, D. Farmakiotou, G. Samaritakis, I. Androutsopoulos, C. D. Spyropoulos, "A Greek Morphological Lexicon and its Exploitation by a Greek Controlled Language Checker". In Proceedings of the 8th Panhellenic Conference on Informatics, 8 - 10 November 2001, Nicosia, Cyprus.
  • G. Petasis, G. Paliouras, V. Karkaletsis, C.D. Spyropoulos and I. Androutsopoulos, "Resolving Part-of-Speech Ambiguity in the Greek Language Using Learning Techniques". In Fakotakis, N. et al. (Eds.), Machine Learning in Human Language Technology (Proceedings of the ACAI Workshop), pp. 29-34, Chania, Greece, 1999.

External links