Hypothesis Mixture Decoding for Statistical Machine Translation

Nan Duan1,  Mu Li2,  Ming Zhou2
1Tianjin University, 2Microsoft Research Asia


Abstract

This paper presents hypothesis mixture decoding (HM decoding), a new decoding scheme that performs translation reconstruction using hypotheses generated by multiple translation systems. HM decoding involves two decoding stages: first, each component system decodes independently, with the explored search space kept for use in the next step; second, a new search space is constructed by composing existing hypotheses produced by all component systems using a set of rules provided by the HM decoder itself, and a new set of model independent features are used to seek the final best translation from this new search space. Few assumptions are made by our approach about the underlying component systems, enabling us to leverage SMT models based on arbitrary paradigms. We compare our approach with several related techniques, and demonstrate significant BLEU improvements in large-scale Chinese-to-English translation tasks.




Full paper: http://www.aclweb.org/anthology/P/P11/P11-1126.pdf