Extreme Learning Machine for Huge Hypotheses Re-ranking in Statistical Machine Translation |
| |
Authors: | Yan Liu Chi Man Vong Pak Kin Wong |
| |
Affiliation: | 1.Department of Computer and Information Science,University of Macau,Macau,China;2.Department of Electromechanical Engineering,University of Macau,Macau,China |
| |
Abstract: | In statistical machine translation (SMT), a possibly infinite number of translation hypotheses can be decoded from a source sentence, among which re-ranking is applied to sort out the best translation result. Undoubtedly, re-ranking is an essential component of SMT for effective and efficient translation. A novel re-ranking method called Scaled Sorted Classification Re-ranking (SSCR) based on extreme learning machine (ELM) classification and minimum error rate training (MERT) is proposed. SSCR contains four steps: (1) the input features are normalized to the range of 0 to 1; (2) an ELM classification model is constructed for hypothesis ranking; (3) each translation hypothesis is ranked using the ELM classification model; and (4) the highest ranked subset of hypotheses are selected, in which the hypothesis with best predicted score based on MERT (system score) is returned as the final translation result. Compared with the baseline score (lower bound), SSCR with ELM classification can raise the translation quality up to 6.7% in IWSLT 2014 Chinese to English corpus. Compared with the state-of-the-art rank boosting, SSCR has a relatively 7.8% of improvement on BLEU in a larger WMT 2015 English-to-French corpus. Moreover, the training time of the proposed method is about 160 times faster than traditional regression-based re-ranking. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|