Bayesian Word Alignment for Statistical Machine Translation

Coskun Mermer1 and Murat Saraclar2
1TUBITAK-BILGEM, 2Bogazici University


Abstract

In this work, we compare the translation performance of word alignments obtained via Bayesian inference to those obtained via expectation-maximization (EM). We propose a Gibbs sampler for fully Bayesian inference in IBM Model 1, integrating over all possible parameter values in finding the alignment distribution. We show that Bayesian inference outperforms EM in all of the tested language pairs, domains and data set sizes, by up to 2.99 BLEU points. We also show that the proposed method effectively addresses the well-known rare word problem in EM-estimated models; and at the same time induces a much smaller dictionary of bilingual word-pairs.




Full paper: http://www.aclweb.org/anthology/P/P11/P11-2032.pdf