Nonparametric Bayesian Machine Transliteration with Synchronous Adaptor Grammars

Yun Huang1,  Min Zhang2,  Chew Lim Tan1
1Department of Computer Science, National University of Singapore, 2Human Language Department, Institute for Infocomm Research, A-STAR


Abstract

Machine transliteration is defined as automatic phonetic translation of names across languages. In this paper, we propose synchronous adaptor grammar, a novel nonparametric Bayesian learning approach, for machine transliteration. This model provides a general framework without heuristic or restriction to automatically learn syllable equivalents between languages. The proposed model outperforms the state-of-the-art EM-based model in the English to Chinese transliteration task.




Full paper: http://www.aclweb.org/anthology/P/P11/P11-2094.pdf