首页 | 本学科首页   官方微博 | 高级检索  
检索        

基于语音转换技术的普通话电子喉语音增强方法研究
引用本文:董睿,李立峰,牛海军,史晚晴,李阳.基于语音转换技术的普通话电子喉语音增强方法研究[J].北京生物医学工程,2015(4).
作者姓名:董睿  李立峰  牛海军  史晚晴  李阳
作者单位:北京航空航天大学生物医学工程学院北 京100191;北京航空航天大学生物医学工程学院北 京100191;北京航空航天大学生物医学工程学院北 京100191;北京航空航天大学生物医学工程学院北 京100191;北京航空航天大学生物医学工程学院北 京100191
摘    要:目的电子喉是喉切除患者使用最多的语音恢复工具,但是电子喉语音存在发声机械、音调单一、辐射噪声大等缺点,本文拟运用语音转换技术改善电子喉语音的发声效果,提高语音自然度和可懂度。方法选择200句分别以自然发声和电子喉发声的标准普通话日常用语作为训练语料,采用基于混合高斯模型(Gaussian mixed model,GMM)的语音转换方法对电子喉语音进行转换,转换参数为基频轨迹和声道谱参数(0~24阶梅尔倒谱系数),然后对转换后的语音质量进行主客观评价。结果转换语音的高频辐射噪声得到了有效抑制,基频变化出现。主观分析结果显示,转换语音的自然度和可接受度有所提高,但可懂度变化不大。结论使用语音转换技术可以降低电子喉语音的高频辐射噪声,改变声调和韵律信息,提高自然度和可接受度,对改善电子喉语音的听觉质量有较大帮助。

关 键 词:电子喉  普通话  语音转换  语音增强

Enhancement of mandarin electrolarynx speech based on voice conversion technology
Abstract:Objective Electrolarynx(EL)is the most common assistant device to provide a voice for laryngectomees. However,EL still has several severe problems,such as the extremely unnaturalness and the non-ignorable radiation noises. In this paper,we conduct a study of enhancement of EL speech based on voice conversion(VC)technology in order to improve the naturalness and intelligibility of EL speech. Methods In this article,200 mandarin daily utterance pairs,recorded as normal speech and EL speech,were served as training data. A Gaussian mixed model(GMM)based method was used to improve the quality of EL speech, and subjective and objective estimation were used to evaluate converted speech. The converting features were F0 and spectrum parameters ( 0th through 24th Mel-cepstral coefficients ). Results The objective results demonstrated that the VC-based method could greatly reduce the radiation noises and improve the F0 contour of mandarin EL speech,closer to that of the target speech. The subjective results indicated that the naturalness and acceptability of mandarin EL speech were upgraded and the intelligibility had no significant difference after converting. Conclusions The VC technology can effectively reduce the high frequency radiation noises, complement tone and rhythm information,upgrade naturalness and acceptability of EL speech,which are greatly helpful to improve speech quality.
Keywords:electrolarynx  mandarin  voice conversion  speech enhancement
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号