首页 | 本学科首页   官方微博 | 高级检索  
     


Extreme Learning Machine for Huge Hypotheses Re-ranking in Statistical Machine Translation
Authors:Yan Liu  Chi Man Vong  Pak Kin Wong
Affiliation:1.Department of Computer and Information Science,University of Macau,Macau,China;2.Department of Electromechanical Engineering,University of Macau,Macau,China
Abstract:In statistical machine translation (SMT), a possibly infinite number of translation hypotheses can be decoded from a source sentence, among which re-ranking is applied to sort out the best translation result. Undoubtedly, re-ranking is an essential component of SMT for effective and efficient translation. A novel re-ranking method called Scaled Sorted Classification Re-ranking (SSCR) based on extreme learning machine (ELM) classification and minimum error rate training (MERT) is proposed. SSCR contains four steps: (1) the input features are normalized to the range of 0 to 1; (2) an ELM classification model is constructed for hypothesis ranking; (3) each translation hypothesis is ranked using the ELM classification model; and (4) the highest ranked subset of hypotheses are selected, in which the hypothesis with best predicted score based on MERT (system score) is returned as the final translation result. Compared with the baseline score (lower bound), SSCR with ELM classification can raise the translation quality up to 6.7% in IWSLT 2014 Chinese to English corpus. Compared with the state-of-the-art rank boosting, SSCR has a relatively 7.8% of improvement on BLEU in a larger WMT 2015 English-to-French corpus. Moreover, the training time of the proposed method is about 160 times faster than traditional regression-based re-ranking.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号