Difference between revisions of "Resources for Italian"
Jump to navigation
Jump to search
(Added: Araneum) |
(updated and extended references) |
||
Line 13: | Line 13: | ||
====Unknown license==== | ====Unknown license==== | ||
− | * [ | + | * [https://archivium.biz/strumenti/Analisi-Verbi.html Archivium AV] - a verb analyzer for Italian verbs |
+ | * [https://archivium.biz/strumenti/Coniugazione-Verbi.html Archivium CV] - a verb coniugator for Italian verbs | ||
+ | * [https://archivium.biz/strumenti/Analisi-Grammaticale.html Archivium AG] - a morphological analizer for Italian text | ||
+ | Contact m dot mensa at tecnologie-umanistiche dot it for API | ||
=== Named Entity Recognisers === | === Named Entity Recognisers === |
Latest revision as of 14:24, 15 March 2019
Tools for Italian
Tokenisers
POS taggers
Morphology
Free software
- Morph-It! version 0.47 - a free morphological resource for the Italian language, includes SFST sources. LGPL license.
Unknown license
- Archivium AV - a verb analyzer for Italian verbs
- Archivium CV - a verb coniugator for Italian verbs
- Archivium AG - a morphological analizer for Italian text
Contact m dot mensa at tecnologie-umanistiche dot it for API
Named Entity Recognisers
Temporal Expressions
Parsers
- Chaos - Robust syntactic parser for Italian and for English
Generators
- XIG - Interchange to Italian Generator
Resources for Italian
Corpora
- Araneum Italicum, Gigaword Italian web corpus
- ColFIS Corpus e Lessico di Frequenza dell'Italiano Scritto
- Corpus di Italiano Scritto contemporaneo (CORIS/CODIS)
- Europarl corpus, sentence aligned with English
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
- Italian plain text and Co-occurrences at LCC
- LIP - Lessico di frequenza dell'Italiano Parlato - Access via BADIP
- MultiSemCor - English/Italian parallel corpus
- Oxford Text Archive Corpus of Italian Newspapers
- Tesoro della lingua italiana delle origini (TLIO)
Tagsets
- LemmaPro - Italian POS tagset for LemmaPro
Treebanks
- ISST - Italian Syntactic-Semantic Treebank
- TUT - Turin University Treebank
- VIT - Venice Italian Treebank
WordNets
- EuroWordNet
- MultiWordNet - a multilingual lexical database in which the Italian WordNet is strictly aligned with Princeton WordNet 1.6
Lexicons
- PAROLE-SIMPLE-CLIPS - a four-layered, general purpose computational lexicon
Links
- Evalita - Evaluation of NLP tools for Italian