Difference between revisions of "Resources for Basque"
Jump to navigation
Jump to search
Line 12: | Line 12: | ||
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1183368317/publikoak/pdf-format main reference] Areta N., Gurrutxaga A., Leturia I., Alegria I., Artola X., Díaz de Ilarraza A., Ezeiza N., Sologaistoa A. 2007. ZT Corpus: Annotation and tools for Basque corpora Copus Linguistics. Birminghan. | ** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1183368317/publikoak/pdf-format main reference] Areta N., Gurrutxaga A., Leturia I., Alegria I., Artola X., Díaz de Ilarraza A., Ezeiza N., Sologaistoa A. 2007. ZT Corpus: Annotation and tools for Basque corpora Copus Linguistics. Birminghan. | ||
* [http://www.corpeus.org Corpeus: web as a corpus Basque] | * [http://www.corpeus.org Corpeus: web as a corpus Basque] | ||
− | ** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1190627973/publikoak/pdf main reference] Leturia I., Gurrutxaga A., Alegria I., Ezeiza A. 2007. CorpEus, a ‘web as corpus’ tool designed for the agglutinative nature of Basque. WAC3 2007 (Web as a Corpus) workshop. | + | ** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1190627973/publikoak/pdf main reference] Leturia I., Gurrutxaga A., Alegria I., Ezeiza A. 2007. CorpEus, a ‘web as corpus’ tool designed for the agglutinative nature of Basque.. WAC3 2007 (Web as a Corpus) workshop. |
==Proofing Tools== | ==Proofing Tools== | ||
Line 24: | Line 24: | ||
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1000911641/publikoak/96LITER_M.ps main reference] Alegria I., Artola X., Sarasola K. 1996. Automatic morphological analysis of Basque. Literary & Linguistic Computing Vol. 11, No. 4, 193-203. Oxford University Press. Oxford. 1996. | ** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1000911641/publikoak/96LITER_M.ps main reference] Alegria I., Artola X., Sarasola K. 1996. Automatic morphological analysis of Basque. Literary & Linguistic Computing Vol. 11, No. 4, 193-203. Oxford University Press. Oxford. 1996. | ||
− | == | + | ==Taggers/parsers== |
− | * EusTagger | + | |
+ | * EusTagger: Lematizar/Tagger for Basque | ||
** [http://ixa2.si.ehu.es/demo/analisimorf.jsp demo] | ** [http://ixa2.si.ehu.es/demo/analisimorf.jsp demo] | ||
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1000911659/publikoak/98ACLdesanb.ps main reference] Ezeiza N., Aduriz I., Alegria I., Arriola J.M., Urizar R. 1998. Combining Stochastic and Rule-Based Methods for Disambiguation in Agglutinative Languages COLING-ACL'98. Pgs. 380 - 384. Vol 1. Montreal (Canada). August 10-14, 1998. | ** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1000911659/publikoak/98ACLdesanb.ps main reference] Ezeiza N., Aduriz I., Alegria I., Arriola J.M., Urizar R. 1998. Combining Stochastic and Rule-Based Methods for Disambiguation in Agglutinative Languages COLING-ACL'98. Pgs. 380 - 384. Vol 1. Montreal (Canada). August 10-14, 1998. | ||
+ | |||
+ | * Eihera: NERC for Basque | ||
+ | ** [http://ixa2.si.ehu.es/demo/entitateak.jsp demo] | ||
+ | ** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1076696580/publikoak/IJCNLP04 main reference] Alegria I., Arregi O., Ezeiza N., Fernandez I., Urizar R. 2004. Design and Development of a Named Entity Recognizer for an Agglutinative Language. First International Joint Conference on NLP (IJCNLP-04). Workshop on Named Entity Recognition. | ||
==Machine translation systems== | ==Machine translation systems== |
Revision as of 05:25, 15 November 2007
Lexical resources
- EDBL - Lexical Data Base for Basque in ELDA
- demo
- main reference Aldezabal I., Ansa O., Arrieta B., Artola X., Ezeiza A., Hernández G., Lersundi M. 2001. EDBL: a General Lexical Basis for the Automatic Processing of Basque. IRCS Workshop on linguistic databases. Philadelphia (USA).
- EuroWordNet - Basque
- main reference Agirre E., Ansa O., Arregi X., Arriola J., Díaz de Ilarraza A., Pociello E., Uria L. 2002. Methodological issues in the building of the Basque WordNet: quantitative and qualitative analysis. Proceedings of First International WordNet Conference. pp. 32-40. Mysore (India).
Corpora
- XX century's Basque corpus Basque corpus XX century
- ZT corpus Basque Corpus of Science and Technology
- main reference Areta N., Gurrutxaga A., Leturia I., Alegria I., Artola X., Díaz de Ilarraza A., Ezeiza N., Sologaistoa A. 2007. ZT Corpus: Annotation and tools for Basque corpora Copus Linguistics. Birminghan.
- Corpeus: web as a corpus Basque
- main reference Leturia I., Gurrutxaga A., Alegria I., Ezeiza A. 2007. CorpEus, a ‘web as corpus’ tool designed for the agglutinative nature of Basque.. WAC3 2007 (Web as a Corpus) workshop.
Proofing Tools
- Xuxen: Spelling checker/corrector for Basque MS-Office, OpenOffice, ...
Free software
Morphology
- Morphological analizer/generator for Basque
- demo
- main reference Alegria I., Artola X., Sarasola K. 1996. Automatic morphological analysis of Basque. Literary & Linguistic Computing Vol. 11, No. 4, 193-203. Oxford University Press. Oxford. 1996.
Taggers/parsers
- EusTagger: Lematizar/Tagger for Basque
- demo
- main reference Ezeiza N., Aduriz I., Alegria I., Arriola J.M., Urizar R. 1998. Combining Stochastic and Rule-Based Methods for Disambiguation in Agglutinative Languages COLING-ACL'98. Pgs. 380 - 384. Vol 1. Montreal (Canada). August 10-14, 1998.
- Eihera: NERC for Basque
- demo
- main reference Alegria I., Arregi O., Ezeiza N., Fernandez I., Urizar R. 2004. Design and Development of a Named Entity Recognizer for an Agglutinative Language. First International Joint Conference on NLP (IJCNLP-04). Workshop on Named Entity Recognition.
Machine translation systems
Free software
- Matxin Spanish->Basque
- Opentrad demo es->eu
- main reference: Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K. 2007. Transfer-based MT from Spanish into Basque: reusability, standardization and open source. LNCS 4394. 374-384. Cicling 2007.