Emanuele Pianta

Also published as: E. Pianta


2012

pdf bib
Key-concept extraction from French articles with KX
Sara Tonelli | Elena Cabrio | Emanuele Pianta
JEP-TALN-RECITAL 2012, Workshop DEFT 2012: DÉfi Fouille de Textes (DEFT 2012 Workshop: Text Mining Challenge)

pdf bib
Making Readability Indices Readable
Sara Tonelli | Ke Tran Manh | Emanuele Pianta
Proceedings of the First Workshop on Predicting and Improving Text Readability for target reader populations

2011

pdf bib
Annotating Events, Temporal Expressions and Relations in Italian: the It-Timeml Experience for the Ita-TimeBank
Tommaso Caselli | Valentina Bartalesi Lenzi | Rachele Sprugnoli | Emanuele Pianta | Irina Prodanof
Proceedings of the 5th Linguistic Annotation Workshop

2010

pdf bib
KX: A Flexible System for Keyphrase eXtraction
Emanuele Pianta | Sara Tonelli
Proceedings of the 5th International Workshop on Semantic Evaluation

pdf bib
A Feature Type Classification for Therapeutic Purposes: A Preliminary Evaluation with Non-Expert Speakers
Gianluca E. Lebani | Emanuele Pianta
Proceedings of the Fourth Linguistic Annotation Workshop

pdf bib
Exploiting Lexical Resources for Therapeutic Purposes: the Case of WordNet and STaRS.sys
Gianluca E. Lebani | Emanuele Pianta
Proceedings of the 2nd Workshop on Cognitive Aspects of the Lexicon

pdf bib
Extending English ACE 2005 Corpus Annotation with Ground-truth Links to Wikipedia
Luisa Bentivogli | Pamela Forner | Claudio Giuliano | Alessandro Marchetti | Emanuele Pianta | Kateryna Tymoshenko
Proceedings of the 2nd Workshop on The People’s Web Meets NLP: Collaboratively Constructed Semantic Resources

pdf bib
Identifying and Ranking Topic Clusters in the Blogosphere
M. Atif Qureshi | Arjumand Younus | Muhammad Saeed | Nasir Touheed | Emanuele Pianta | Kateryna Tymoshenko
Proceedings of the 2nd Workshop on The People’s Web Meets NLP: Collaboratively Constructed Semantic Resources

pdf bib
A Resource for Investigating the Impact of Anaphora and Coreference on Inference.
Azad Abad | Luisa Bentivogli | Ido Dagan | Danilo Giampiccolo | Shachar Mirkin | Emanuele Pianta | Asher Stern
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

Discourse phenomena play a major role in text processing tasks. However, so far relatively little study has been devoted to the relevance of discourse phenomena for inference. Therefore, an experimental study was carried out to assess the relevance of anaphora and coreference for Textual Entailment (TE), a prominent inference framework. First, the annotation of anaphoric and coreferential links in the RTE-5 Search data set was performed according to a specifically designed annotation scheme. As a result, a new data set was created where all anaphora and coreference instances in the entailing sentences which are relevant to the entailment judgment are solved and annotated.. A by-product of the annotation is a new “augmented” data set, where all the referring expressions which need to be resolved in the entailing sentences are replaced by explicit expressions. Starting from the final output of the annotation, the actual impact of discourse phenomena on inference engines was investigated, identifying the kind of operations that the systems need to apply to address discourse phenomena and trying to find direct mappings between these operation and annotation types.

pdf bib
VenPro: A Morphological Analyzer for Venetan
Sara Tonelli | Emanuele Pianta | Rodolfo Delmonte | Michele Brunelli
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

This document reports the process of extending MorphoPro for Venetan, a lesser-used language spoken in the Nort-Eastern part of Italy. MorphoPro is the morphological component of TextPro, a suite of tools oriented towards a number of NLP tasks. In order to extend this component to Venetan, we developed a declarative representation of the morphological knowledge necessary to analyze and synthesize Venetan words. This task was challenging for several reasons, which are common to a number of lesser-used languages: although Venetan is widely used as an oral language in everyday life, its written usage is very limited; efforts for defining a standard orthography and grammar are very recent and not well established; despite recent attempts to propose a unified orthography, no Venetan standard is widely used. Besides, there are different geographical varieties and it is strongly influenced by Italian.

2009

pdf bib
Three Issues in Cross-Language Frame Information Transfer
Sara Tonelli | Emanuele Pianta
Proceedings of the International Conference RANLP-2009

pdf bib
Computing Implicit Entities and Events with Getaruns (short paper)
Rodolfo Delmonte | Emanuele Pianta
Proceedings of the Eight International Conference on Computational Semantics

pdf bib
A novel approach to mapping FrameNet lexical units to WordNet synsets (short paper)
Sara Tonelli | Emanuele Pianta
Proceedings of the Eight International Conference on Computational Semantics

2008

pdf bib
Answering Why-Questions in Closed Domains from a Discourse Model
Rodolfo Delmonte | Emanuele Pianta
Semantics in Text Processing. STEP 2008 Conference Proceedings

pdf bib
L-ISA: Learning Domain Specific Isa-Relations from the Web
Alessandra Potrich | Emanuele Pianta
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Automated extraction of ontological knowledge from text corpora is a relevant task in Natural Language Processing. In this paper, we focus on the problem of finding hypernyms for relevant concepts in a specific domain (e.g. Optical Recording) in the context of a concrete and challenging application scenario (patent processing). To this end information available on the Web is exploited. The extraction method includes four mains steps. Firstly, the Google search engine is exploited to retrieve possible instances of isa-patterns reported in the literature. Then, the returned snippets are filtered on the basis of lexico-syntactic criteria (e.g. the candidate hypernym must be expressed as a noun phrase without complex modifiers). In a further filtering step, only candidate hypernyms compatible with the target domain are kept. Finally a candidate ranking mechanism is applied to select one hypernym as output of the algorithm. The extraction method was evaluated on 100 concepts of the Optical Recording domain. Moreover, the reliability of isa-patterns reported in the literature as predictors of isa-relations was assessed by manually evaluating the template instances remaining after lexico-syntactic filtering, for 3 concepts of the same domain. While more extensive testing is needed the method appears promising especially for its portability across different domains.

pdf bib
The TextPro Tool Suite
Emanuele Pianta | Christian Girardi | Roberto Zanoli
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

We present TextPro, a suite of modular Natural Language Processing (NLP) tools for analysis of Italian and English texts. The suite has been designed so as to integrate and reuse state of the art NLP components developed by researchers at FBK. The current version of the tool suite provides functions ranging from tokenization to chunking and Named Entity Recognition (NER). The system’s architecture is organized as a pipeline of processors wherein each stage accepts data from an initial input or from an output of a previous stage, executes a specific task, and sends the resulting data to the next stage, or to the output of the pipeline. TextPro performed the best on the task of Italian NER and Italian PoS Tagging at EVALITA 2007. When tested on a number of other standard English benchmarks, TextPro confirms that it performs as state of the art system. Distributions for Linux, Solaris and Windows are available, for both research and commercial purposes. A web-service version of the system is under development.

pdf bib
Frame Information Transfer from English to Italian
Sara Tonelli | Emanuele Pianta
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

We describe an automatic projection algorithm for transferring frame-semantic information from English to Italian texts as a first sep towards the creation of Italian FrameNet. Given an English text with frame information and its Italian translation, we project the annotation in four steps: first the Italian text is parsed, then English-Italian alignment is automatically carried out at word level, then we extract the semantic head for every annotated constituent on the English corpus side and finally we project annotation from English to Italian using aligned semantic heads as bridge. With our work, we point out typical features of the Italian language as regards frame-semantic annotation, in particular we describe peculiarities of Italian that at the moment make the projection task more difficult than in the above-mentioned examples. Besides, we created a gold standard with 987 manually annotated sentences to evaluate the algorithm.

2007

pdf bib
IRST-BP: Preposition Disambiguation based on Chain Clarifying Relationships Contexts
Octavian Popescu | Sara Tonelli | Emanuele Pianta
Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)

2006

pdf bib
Ontology Population from Textual Mentions: Task Definition and Benchmark
Bernardo Magnini | Emanuele Pianta | Octavian Popescu | Manuela Speranza
Proceedings of the 2nd Workshop on Ontology Learning and Population: Bridging the Gap between Text and Knowledge

pdf bib
Representing and Accessing Multilevel Linguistic Annotation using the MEANING Format
Emanuele Pianta | Luisa Bentivogli | Christian Girardi | Bernardo Magnini
Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-Dimensional Markup in Natural Language Processing

pdf bib
I-CAB: the Italian Content Annotation Bank
B. Magnini | E. Pianta | C. Girardi | M. Negri | L. Romano | M. Speranza | V. Bartalesi Lenzi | R. Sprugnoli
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

In this paper we present work in progress for the creation of the Italian Content Annotation Bank (I-CAB), a corpus of Italian news annotated with semantic information at different levels. The first level is represented by temporal expressions, the second level is represented by different types of entities (i.e. person, organizations, locations and geo-political entities), and the third level is represented by relations between entities (e.g. the affiliation relation connecting a person to an organization). So far I-CAB has been manually annotated with temporal expressions, person entities and organization entities. As we intend I-CAB to become a benchmark for various automatic Information Extraction tasks, we followed a policy of reusing already available markup languages. In particular, we adopted the annotation schemes developed for the ACE Entity Detection and Time Expressions Recognition and Normalization tasks. As the ACE guidelines have originally been developed for English, part of the effort consisted in adapting them to the specific morpho-syntactic features of Italian. Finally, we have extended them to include a wider range of entities, such as conjunctions.

2004

pdf bib
Evaluating Cross-Language Annotation Transfer in the MultiSemCor Corpus
Luisa Bentivogli | Pamela Forner | Emanuele Pianta
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics

pdf bib
Knowledge Intensive Word Alignment with KNOWA
Emanuele Pianta | Luisa Bentivogli
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics

pdf bib
Revising the Wordnet Domains Hierarchy: semantics, coverage and balancing
Luisa Bentivogli | Pamela Forner | Bernardo Magnini | Emanuele Pianta
Proceedings of the Workshop on Multilingual Linguistic Resources

pdf bib
The Italian NESPOLE! Corpus: a Multilingual Database with Interlingua Annotation in Tourism and Medical Domains
Nadia Mana | Roldano Cattoni | Emanuele Pianta | Franca Rossi | Fabio Pianesi | Susanne Burger
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

2003

pdf bib
Beyond Lexical Units: Enriching WordNets with Phrasets
Luisa Bentivogli | Emanuele Pianta
10th Conference of the European Chapter of the Association for Computational Linguistics

2002

pdf bib
Opportunistic Semantic Tagging
Luisa Bentivogli | Emanuele Pianta
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

2000

pdf bib
Coping with Lexical Gaps when Building Aligned Multilingual Wordnets
Luisa Bentivogli | Emanuele Pianta | Fabio Pianesi
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)

pdf bib
Generating from a discourse model
Rodolfo Delmonte | Dario Bianchi | Emanuele Pianta
Proceedings of the International Conference on Machine Translation and Multilingual Applications in the new Millennium: MT 2000

1993

pdf bib
Undestanding Stories in Different Languages with GETA-RUN
Dario Bianchi | Rodolfo Delmonte | Emanuele Pianta
Sixth Conference of the European Chapter of the Association for Computational Linguistics