Semi-supervised Relation Extraction with Large-scale Word Clustering

Ang Sun,  Ralph Grishman,  Satoshi Sekine
New York University


Abstract

We present a simple semi-supervised relation extraction system with large-scale word clustering. We focus on systematically exploring the effectiveness of different cluster-based features. We also propose several statistical methods for selecting clusters at an appropriate level of granularity. When training on different sizes of data, our semi-supervised approach consistently outperformed a state-of-the-art supervised baseline system.




Full paper: http://www.aclweb.org/anthology/P/P11/P11-1053.pdf