Difference between revisions of "Resources for Italian"

From ACL Wiki
Jump to navigation Jump to search
(fixed broken link)
(updated and extended references)
 
(8 intermediate revisions by 7 users not shown)
Line 9: Line 9:
  
 
===Morphology===
 
===Morphology===
* [http://sslmitdev-online.sslmit.unibo.it/linguistics/morph-it.php Morph-It! version 0.47] - a free morphological resource for the Italian language
+
====Free software====
* [http://archivium.biz/ dic_it: il Verbiario] - a free morphological analizer and verb coniugator for Italian verbs
+
* [http://sslmitdev-online.sslmit.unibo.it/linguistics/morph-it.php Morph-It! version 0.47] - a free morphological resource for the Italian language, includes [[SFST]] sources. [[LGPL]] license.
 +
 
 +
====Unknown license====
 +
* [https://archivium.biz/strumenti/Analisi-Verbi.html Archivium AV] - a verb analyzer for Italian verbs
 +
* [https://archivium.biz/strumenti/Coniugazione-Verbi.html Archivium CV] - a verb coniugator for Italian verbs  
 +
* [https://archivium.biz/strumenti/Analisi-Grammaticale.html Archivium AG] - a morphological analizer for Italian text
 +
Contact m dot mensa at tecnologie-umanistiche dot it for API
  
 
=== Named Entity Recognisers ===
 
=== Named Entity Recognisers ===
Line 29: Line 35:
 
<!-- Please keep this list in alphabetical order -->
 
<!-- Please keep this list in alphabetical order -->
  
 +
* [http://ucts.uniba.sk/aranea_about/ Araneum Italicum], Gigaword Italian web corpus
 
* [http://www.istc.cnr.it/material/database/colfis/ ColFIS Corpus e Lessico di Frequenza dell'Italiano Scritto]
 
* [http://www.istc.cnr.it/material/database/colfis/ ColFIS Corpus e Lessico di Frequenza dell'Italiano Scritto]
 
* [http://corpus.cilta.unibo.it:8080/coris_ita.html Corpus di Italiano Scritto contemporaneo (CORIS/CODIS)]
 
* [http://corpus.cilta.unibo.it:8080/coris_ita.html Corpus di Italiano Scritto contemporaneo (CORIS/CODIS)]
 +
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 +
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 
* [http://corpora.informatik.uni-leipzig.de/ Italian plain text and Co-occurrences at LCC]
 
* [http://corpora.informatik.uni-leipzig.de/ Italian plain text and Co-occurrences at LCC]
 
* [http://languageserver.uni-graz.at/badip/badip/20_corpusLip.php LIP - Lessico di frequenza dell'Italiano Parlato - Access via BADIP]
 
* [http://languageserver.uni-graz.at/badip/badip/20_corpusLip.php LIP - Lessico di frequenza dell'Italiano Parlato - Access via BADIP]

Latest revision as of 14:24, 15 March 2019

Tools for Italian

Tokenisers

POS taggers

Morphology

Free software

Unknown license

Contact m dot mensa at tecnologie-umanistiche dot it for API

Named Entity Recognisers

Temporal Expressions

Parsers

  • Chaos - Robust syntactic parser for Italian and for English

Generators

  • XIG - Interchange to Italian Generator

Resources for Italian

Corpora

Tagsets

  • LemmaPro - Italian POS tagset for LemmaPro

Treebanks

  • ISST - Italian Syntactic-Semantic Treebank
  • TUT - Turin University Treebank
  • VIT - Venice Italian Treebank

WordNets

  • EuroWordNet
  • MultiWordNet - a multilingual lexical database in which the Italian WordNet is strictly aligned with Princeton WordNet 1.6

Lexicons

Links

  • Evalita - Evaluation of NLP tools for Italian