首页 | 本学科首页   官方微博 | 高级检索  
检索        

梅尔频率倒谱系数在声带息肉手术前后嗓音分析中的价值研究
引用本文:刘茉,葛鑫颖,赵晓畅,郝青青,李祖飞.梅尔频率倒谱系数在声带息肉手术前后嗓音分析中的价值研究[J].中国耳鼻咽喉颅底外科杂志,2024,30(2):102-105.
作者姓名:刘茉  葛鑫颖  赵晓畅  郝青青  李祖飞
作者单位:首都医科大学附属北京朝阳医院 耳鼻咽喉头颈外科, 北京 100020
摘    要:目的 本研究拟通过提取患者嗓音中的梅尔频率倒谱系数(MFCC)指标,探讨其在声带息肉手术前后嗓音分析中的临床价值。方法 回顾性分析于2018年1月—2019年8月行声带息肉手术且术前及术后1个月均行嗓音评估的患者41例,男31例,女10例;平均年龄(42.9±11.4)岁。另选取无声嘶且无声带病变的正常受试者21例作为基线对照。使用基于Python编程语言的librosa语音处理包进行MFCC特征提取,分别提取每位患者的MFCC均值,MFCC方差与MFCC标准差,使用配对样本t检验比较声带息肉手术前后上述各MFCC特征的差异。结果 声带息肉患者术后MFCC均值1.25±1.01、MFCC方差561.34±154.98及MFCC标准差21.74±4.03比术前MFCC均值6.81±2.05、MFCC方差1 019.66±295.87及MFCC标准差34.37±6.63显著下降,差异具有统计学意义(t=18.596,P=0.000;t=10.338,P=0.000;t=11.852,P=0.000)。声带息肉组患者术后1个月其MFCC均值、MFCC方差及MFCC标准差与正常受试者相比差异均无统计学意义,表明绝大部分声带息肉患者术后嗓音得到良好的恢复。结论 本研究首次探索了MFCC在声带息肉手术前后嗓音分析中的价值, MFCC各特征可作为评估声带息肉术后嗓音恢复的指标。

关 键 词:声带息肉  声嘶  梅尔频率倒谱系数  嗓音分析  手术
收稿时间:2023/2/26 0:00:00

Value of Mel frequency cepstrum coefficient in voice analysis before and after vocal polyp surgery
LIU Mo,GE Xinying,ZHAO Xiaochang,HAO Qingqing,LI Zufei.Value of Mel frequency cepstrum coefficient in voice analysis before and after vocal polyp surgery[J].Chinese Journal of Otorhinolaryngology-skull Base Surgery,2024,30(2):102-105.
Authors:LIU Mo  GE Xinying  ZHAO Xiaochang  HAO Qingqing  LI Zufei
Institution:Department of Otorhinolaryngology Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing 100020, China
Abstract:Objective Mel frequency cepstrum coefficient (MFCC) has a wide range of applications in the field of speech recognition, but its application has not been reported in the field of voice analysis at home and abroad. This study intends to analyze its research value in voice analysis before and after vocal polyp surgery by extracting the MFCC index.Methods A total of 41 patients who underwent vocal polyp surgery in our hospital from January 2018 to August 2019 and received voice evaluation before and 1 month after the surgery were retrospectively analyzed. In addition, 21 normal subjects who had neither hoarseness nor vocal cord lesions were selected as the baseline control. The librosa speech processing package based on Python programming language was used for MFCC feature extraction. The mean value, variance and standard deviation of MFCC in each patient were extracted respectively. The paired sample t-test was used to compare the differences of the above MFCC features before and after surgery.Results The enrolled patients included 31 males and 10 females with an average age of (42.9±11.4) years. Their postoperative mean value, variance and standard deviation of MFCC got decreased significantly compared to the preoperative ones (6.81±2.05 vs 1.25±1.01, t=18.596, P=0.000; 1 019.66±295.87 vs 561.34±154.98, t=10.338, P=0.000; 34.37±6.63 vs 21.74±4.03, t=11.852, P=0.000). The differences of mean value, variance and standard deviation of MFCC between the patients one month after surgery and the normal subjects were statistically insignificant, indicating that the voice of most patients recovered well after surgery.Conclusions This study is the first to explore the value of MFCC in voice analysis before and after vocal polyp surgery. The characteristics of MFCC can be used as indexes to evaluate the voice recovery after vocal polyp surgery.
Keywords:Vocal polyp  Hoarseness  Mel frequency cepstrum coefficient  Voice analysis  Surgery
点击此处可从《中国耳鼻咽喉颅底外科杂志》浏览原始摘要信息
点击此处可从《中国耳鼻咽喉颅底外科杂志》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号