Difference between revisions of "Resources for Basque"

From ACL Wiki
Jump to navigation Jump to search
(HamleDT)
 
(30 intermediate revisions by 6 users not shown)
Line 1: Line 1:
==Machine translation systems==
+
==Lexical resources==
 +
 
 +
* [http://www.elda.org/catalogue/en/text/L0057.html EDBL] - Lexical Data Base for Basque in ELDA
 +
** [http://ixa2.si.ehu.es/edbl/ demo]
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1011897592/publikoak/2001-IRCS.pdf main reference] Aldezabal I., Ansa O., Arrieta B., Artola X., Ezeiza A., Hernández G., Lersundi M.  2001. EDBL: a General Lexical Basis for the Automatic Processing of Basque. IRCS Workshop on linguistic databases. Philadelphia (USA).
 +
* [http://ixa2.si.ehu.es/cgi-bin/mcr/public/wei.consult.perl EuroWordNet - Basque]
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1018537154/publikoak/euswn.ps main reference] Agirre E., Ansa O., Arregi X., Arriola J., Díaz de Ilarraza A., Pociello E., Uria L.  2002. Methodological issues in the building of the Basque WordNet: quantitative and qualitative analysis. Proceedings of First International WordNet Conference. pp. 32-40. Mysore (India).
 +
 
 +
==Corpora==
 +
* [http://www.euskaracorpusa.net/XXmendea/Konts_arrunta_fr.html XX century's Basque corpus] Basque corpus XX century
 +
* [http://www.ztcorpusa.net ZT corpus] Basque Corpus of Science and Technology
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1183368317/publikoak/pdf-format main reference] Areta N., Gurrutxaga A., Leturia I., Alegria I., Artola X., Díaz de Ilarraza A., Ezeiza N., Sologaistoa A.  2007. ZT Corpus: Annotation and tools for Basque corpora Copus Linguistics. Birminghan.
 +
* [http://www.corpeus.org Corpeus: web as a corpus Basque]
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1190627973/publikoak/pdf main reference] Leturia I., Gurrutxaga A., Alegria I., Ezeiza A. 2007. CorpEus, a ‘web as corpus’ tool designed for the agglutinative nature of Basque.. WAC3 2007 (Web as a Corpus) workshop.
 +
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 +
 
 +
==Proofing Tools==
 +
* [http://www.xuxen.com Xuxen: Spelling checker/corrector for Basque] MS-Office, OpenOffice, ...
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1000911636/publikoak/97LITER_Z.ps  main reference] Aduriz I., Alegria I., Artola X., Ezeiza N., Sarasola K.  1997. A spelling corrector for Basque based on morphology. Literary & Linguistic Computing, Vol. 12, No. 1. Oxford University Press. Oxford. 1997.
 +
===Free software===
 +
* [https://addons.mozilla.org/eu/firefox/browse/type:3 Xuxen: free version for Mozilla]
 +
* [http://www.euskara.euskadi.net/r59-738/eu/contenidos/informacion/euskarazko_softwarea/eu_9567/zerrenda.html Xuxen for OpenOffice (hunspell)]
 +
 
 +
==Morphology==
  
 
===Free software===
 
===Free software===
  
* [http://matxin.sourceforge.net Matxin] Spanish->Basque
+
* Morphological analyser/generator from [http://www.apertium.org Apertium project] — [https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-eu-es/apertium-eu-es.eu.dix here]  
** demo [http://www.opentrad.org Opentrad] es->eu
+
** [http://xixona.dlsi.ua.es/~fran/basque/index.php Demo here]
** main reference: Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K2007. Transfer-based MT from Spanish into Basque: reusability, standardization and open source. LNCS 4394. 374-384. Cicling 2007.
+
 
 +
===Proprietary software===
 +
* Morphological analyser/generator for Basque
 +
** [http://ixa2.si.ehu.es/demo/analisianali.jsp demo]
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1000911641/publikoak/96LITER_M.ps main reference] Alegria I., Artola X., Sarasola K.  1996. Automatic morphological analysis of Basque. Literary & Linguistic Computing Vol. 11, No. 4, 193-203. Oxford University Press. Oxford. 1996.
 +
 
 +
==Taggers/parsers==
 +
 
 +
* EusTagger: Lemmatizer/Tagger for Basque
 +
** [http://ixa2.si.ehu.es/demo/analisimorf.jsp demo]
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1000911659/publikoak/98ACLdesanb.ps main reference] Ezeiza N., Aduriz I., Alegria I., Arriola J.M., Urizar R. 1998. Combining Stochastic and Rule-Based Methods for Disambiguation in Agglutinative Languages COLING-ACL'98. Pgs. 380 - 384. Vol 1. Montreal (Canada). August 10-14, 1998.
 +
 
 +
* Eihera: Named Entitity Recognizer (NERC) for Basque
 +
** [http://ixa2.si.ehu.es/demo/entitateak.jsp demo]
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1076696580/publikoak/IJCNLP04 main reference] Alegria I., Arregi O., Ezeiza N., Fernandez I., Urizar R2004. Design and Development of a Named Entity Recognizer for an Agglutinative Language. First International Joint Conference on NLP (IJCNLP-04). Workshop on Named Entity Recognition.
 +
 
 +
* Zatiak: Chunker for Basque
 +
** [http://ixa2.si.ehu.es/demo/zatiak.jsp demo]
 +
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1097683503/publikoak/camara-ready-basqueDP main reference] Aranzabe M., Arriola J.M., Díaz de Ilarraza  2004. Towards a Dependency Parser of Basque. Proceedings of the Coling 2004 Workshop on Recent Advances in Dependency Grammar. Geneva, Switzerland.
  
===Proprietary===
 
  
==Lexical resources==
+
==Machine translation systems==
  
* [http://www.elda.org/catalogue/en/text/L0057.html EDBL] - Lexical Data Base for Basque
+
===Free software===
** [http://ixa2.si.ehu.es/demo/edbl.jsp demo]
 
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1011897592/publikoak/2001-IRCS.pdf main reference] Aldezabal I., Ansa O., Arrieta B., Artola X., Ezeiza A., Hernández G., Lersundi M.  2001. EDBL: a General Lexical Basis for the Automatic Processing of Basque. IRCS Workshop on linguistic databases. Philadelphia (USA).
 
* [http://www.lsi.upc.edu/~nlp/projectes/ewn.html EuroWordNet - Basque]
 
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1018537154/publikoak/euswn.ps main reference] Agirre E., Ansa O., Arregi X., Arriola J., Díaz de Ilarraza A., Pociello E., Uria L.  2002. Methodological issues in the building of the Basque WordNet: quantitative and qualitative analysis. Proceedings of First International WordNet Conference. pp. 32-40. Mysore (India).
 
  
==Corpora==
+
* [http://matxin.sourceforge.net Matxin] Spanish → Basque
* [http://www.ztcorpusa.net ZT corpus] Basque Corpus of Science and Technology
+
** [http://www.opentrad.org Opentrad demo] es->eu
** [http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1183368317/publikoak/pdf-format main reference] Areta N., Gurrutxaga A., Leturia I., Alegria I., Artola X., Díaz de Ilarraza A., Ezeiza N., Sologaistoa A.  2007. ZT Corpus: Annotation and tools for Basque corpora Copus Linguistics. Birminghan.
+
** main reference: Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K.  2007. Transfer-based MT from Spanish into Basque: reusability, standardization and open source. LNCS 4394. 374-384. Cicling 2007.
 +
* [http://www.apertium.org Apertium] Basque → Spanish
 +
** Available in [http://xixona.dlsi.ua.es/wiki/index.php/SVN Apertium SVN].
  
 
==Bibliography==
 
==Bibliography==
  
*
+
*[http://ixa.si.ehu.es/Ixa/Argitalpenak/ Ixa group]
  
 
==External links==
 
==External links==
Line 29: Line 67:
 
* [http://ixa.si.ehu.es Ixa group, University of the Basque Country]
 
* [http://ixa.si.ehu.es Ixa group, University of the Basque Country]
  
[[Category:Resources by language|Catalan]]
+
[[Category:Resources by language|Basque]]

Latest revision as of 08:35, 26 May 2014

Lexical resources

  • EDBL - Lexical Data Base for Basque in ELDA
    • demo
    • main reference Aldezabal I., Ansa O., Arrieta B., Artola X., Ezeiza A., Hernández G., Lersundi M. 2001. EDBL: a General Lexical Basis for the Automatic Processing of Basque. IRCS Workshop on linguistic databases. Philadelphia (USA).
  • EuroWordNet - Basque
    • main reference Agirre E., Ansa O., Arregi X., Arriola J., Díaz de Ilarraza A., Pociello E., Uria L. 2002. Methodological issues in the building of the Basque WordNet: quantitative and qualitative analysis. Proceedings of First International WordNet Conference. pp. 32-40. Mysore (India).

Corpora

  • XX century's Basque corpus Basque corpus XX century
  • ZT corpus Basque Corpus of Science and Technology
    • main reference Areta N., Gurrutxaga A., Leturia I., Alegria I., Artola X., Díaz de Ilarraza A., Ezeiza N., Sologaistoa A. 2007. ZT Corpus: Annotation and tools for Basque corpora Copus Linguistics. Birminghan.
  • Corpeus: web as a corpus Basque
    • main reference Leturia I., Gurrutxaga A., Alegria I., Ezeiza A. 2007. CorpEus, a ‘web as corpus’ tool designed for the agglutinative nature of Basque.. WAC3 2007 (Web as a Corpus) workshop.
  • HamleDT, harmonized dependency treebanks of many languages, common annotation style.

Proofing Tools

  • Xuxen: Spelling checker/corrector for Basque MS-Office, OpenOffice, ...
    • main reference Aduriz I., Alegria I., Artola X., Ezeiza N., Sarasola K. 1997. A spelling corrector for Basque based on morphology. Literary & Linguistic Computing, Vol. 12, No. 1. Oxford University Press. Oxford. 1997.

Free software

Morphology

Free software

Proprietary software

  • Morphological analyser/generator for Basque
    • demo
    • main reference Alegria I., Artola X., Sarasola K. 1996. Automatic morphological analysis of Basque. Literary & Linguistic Computing Vol. 11, No. 4, 193-203. Oxford University Press. Oxford. 1996.

Taggers/parsers

  • EusTagger: Lemmatizer/Tagger for Basque
    • demo
    • main reference Ezeiza N., Aduriz I., Alegria I., Arriola J.M., Urizar R. 1998. Combining Stochastic and Rule-Based Methods for Disambiguation in Agglutinative Languages COLING-ACL'98. Pgs. 380 - 384. Vol 1. Montreal (Canada). August 10-14, 1998.
  • Eihera: Named Entitity Recognizer (NERC) for Basque
    • demo
    • main reference Alegria I., Arregi O., Ezeiza N., Fernandez I., Urizar R. 2004. Design and Development of a Named Entity Recognizer for an Agglutinative Language. First International Joint Conference on NLP (IJCNLP-04). Workshop on Named Entity Recognition.
  • Zatiak: Chunker for Basque
    • demo
    • main reference Aranzabe M., Arriola J.M., Díaz de Ilarraza 2004. Towards a Dependency Parser of Basque. Proceedings of the Coling 2004 Workshop on Recent Advances in Dependency Grammar. Geneva, Switzerland.


Machine translation systems

Free software

  • Matxin Spanish → Basque
    • Opentrad demo es->eu
    • main reference: Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K. 2007. Transfer-based MT from Spanish into Basque: reusability, standardization and open source. LNCS 4394. 374-384. Cicling 2007.
  • Apertium Basque → Spanish

Bibliography

External links