Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction

Michaela Regneri, Rui Wang, Manfred Pinkal


Abstract
Paraphrases and paraphrasing algorithms have been found of great importance in various natural language processing tasks. While most paraphrase extraction approaches extract equivalent sentences, sentences are an inconvenient unit for further processing, because they are too specific, and often not exact paraphrases. Paraphrase fragment extraction is a technique that post-processes sentential paraphrases and prunes them to more convenient phrase-level units. We present a new approach that uses semantic roles to extract paraphrase fragments from sentence pairs that share semantic content to varying degrees, including full paraphrases. In contrast to previous systems, the use of semantic parses allows for extracting paraphrases with high wording variance and different syntactic categories. The approach is tested on four different input corpora and compared to two previous systems for extracting paraphrase fragments. Our system finds three times as many good paraphrase fragments per sentence pair as the baselines, and at the same time outputs 30% fewer unrelated fragment pairs.
Anthology ID:
L14-1134
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4300–4307
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1195_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Michaela Regneri, Rui Wang, and Manfred Pinkal. 2014. Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 4300–4307, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction (Regneri et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1195_Paper.pdf