Difference between revisions of "Resources for Italian"
Jump to navigation
Jump to search
Line 30: | Line 30: | ||
* [http://languageserver.uni-graz.at/badip/badip/20_corpusLip.php LIP - Lessico di frequenza dell'Italiano Parlato - Access via BADIP] | * [http://languageserver.uni-graz.at/badip/badip/20_corpusLip.php LIP - Lessico di frequenza dell'Italiano Parlato - Access via BADIP] | ||
* [http://multisemcor.itc.it/ MultiSemCor] - English/Italian parallel corpus | * [http://multisemcor.itc.it/ MultiSemCor] - English/Italian parallel corpus | ||
+ | * [http://www.uni-duisburg.de/Fak2/FremdPhil/Romanistik/Personal/Burr/humcomp/ Oxford Text Archive Corpus of Italian Newspapers] | ||
* [http://tlio.ovi.cnr.it/TLIO/ Tesoro della lingua italiana delle origini (TLIO)] | * [http://tlio.ovi.cnr.it/TLIO/ Tesoro della lingua italiana delle origini (TLIO)] | ||
Revision as of 13:15, 26 April 2008
Tools for Italian
Tokenisers
POS taggers
Named Entity Recognisers
Temporal Expressions
Parsers
- Chaos - Robust syntactic parser for Italian and for English
Generators
- XIG - Interchange to Italian Generator
Resources for Italian
Corpora
- ColFIS Corpus e Lessico di Frequenza dell'Italiano Scritto
- Corpus di Italiano Scritto contemporaneo (CORIS/CODIS)
- Italian plain text and Co-occurrences at LCC
- LIP - Lessico di frequenza dell'Italiano Parlato - Access via BADIP
- MultiSemCor - English/Italian parallel corpus
- Oxford Text Archive Corpus of Italian Newspapers
- Tesoro della lingua italiana delle origini (TLIO)
Tagsets
- LemmaPro - Italian POS tagset for LemmaPro
Treebanks
- ISST - Italian Syntactic-Semantic Treebank
- TUT - Turin University Treebank
- VIT - Venice Italian Treebank
WordNets
- EuroWordNet
- MultiWordNet - a multilingual lexical database in which the Italian WordNet is strictly aligned with Princeton WordNet 1.6
Lexicons
- PAROLE-SIMPLE-CLIPS - a four-layered, general purpose computational lexicon
Links
- Evalita - Evaluation of NLP tools for Italian