Difference between revisions of "Resources for Italian"

From ACL Wiki
Jump to: navigation, search
(Corpora)
(Corpora)
Line 30: Line 30:
 
* [http://languageserver.uni-graz.at/badip/badip/20_corpusLip.php LIP - Lessico di frequenza dell'Italiano Parlato - Access via BADIP]
 
* [http://languageserver.uni-graz.at/badip/badip/20_corpusLip.php LIP - Lessico di frequenza dell'Italiano Parlato - Access via BADIP]
 
* [http://multisemcor.itc.it/ MultiSemCor] - English/Italian parallel corpus
 
* [http://multisemcor.itc.it/ MultiSemCor] - English/Italian parallel corpus
 +
* [http://www.uni-duisburg.de/Fak2/FremdPhil/Romanistik/Personal/Burr/humcomp/ Oxford Text Archive Corpus of Italian Newspapers]
 
* [http://tlio.ovi.cnr.it/TLIO/ Tesoro della lingua italiana delle origini (TLIO)]
 
* [http://tlio.ovi.cnr.it/TLIO/ Tesoro della lingua italiana delle origini (TLIO)]
  

Revision as of 14:15, 26 April 2008

Tools for Italian

Tokenisers

POS taggers

Named Entity Recognisers

Temporal Expressions

Parsers

  • Chaos - Robust syntactic parser for Italian and for English

Generators

  • XIG - Interchange to Italian Generator

Resources for Italian

Corpora

Tagsets

  • LemmaPro - Italian POS tagset for LemmaPro

Treebanks

  • ISST - Italian Syntactic-Semantic Treebank
  • TUT - Turin University Treebank
  • VIT - Venice Italian Treebank

WordNets

  • EuroWordNet
  • MultiWordNet - a multilingual lexical database in which the Italian WordNet is strictly aligned with Princeton WordNet 1.6

Lexicons

Links

  • Evalita - Evaluation of NLP tools for Italian