语音识别技术在咳嗽声音自动识别中的应用 Application of speech recognition technology in the automatic recognition of cough sounds期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

语音识别技术在咳嗽声音自动识别中的应用

引用本文：	马璇,郑则广,陈荣昌,田联房,莫鸿强,魏栋,李寅环,赖克方,钟南山. 语音识别技术在咳嗽声音自动识别中的应用[J]. 中华生物医学工程杂志, 2010, 16(6). DOI: 10.3760/cma.j.issn.1674-1927.2010.06.013

作者姓名：	马璇郑则广陈荣昌田联房莫鸿强魏栋李寅环赖克方钟南山

作者单位：	1. 广州医学院第一附属医院广州呼吸疾病研究所呼吸疾病国家重点实验室,510120 2. 华南理工大学自动化学院

摘要：	目的利用双门限法、Mel频率倒普系数(MFCC)法及矢量量化(VQ)法的语音识别技术对咳嗽声音进行自动识别.方法在安静环境下,对5例健康成年人和15例咳嗽患者的非咳嗽和咳嗽声音进行录音,分别随机分为训练样本和测试样本.训练样本用于生成咳嗽识别软件的码本,并用该码本对测试样本进行自动识别分析.同时与人工识别的结果进行对比,计算敏感性、特异性,记录两种方法的识别时间.结果用于码本生成的咳嗽声音和非咳嗽声音均为200次,测试样本的咳嗽和非咳嗽声音分别为375次和125次.人工识别和通过码本自动识别测试样本的时间分别为33 min 18 s和1min 35 s;码本自动识别咳嗽声音的敏感性和特异性分别为98.93%和100%.结论基于VQ的双门限法及MFCC法可用于咳嗽声音的自动识别.
关键词：	咳嗽语言识别软件模式识别,自动敏感性与特异性人工识别
Application of speech recognition technology in the automatic recognition of cough sounds

MA Xuan,ZHENG Ze-guang,CHEN Rong-chang,TIAN Lian-fang,MO Hong-qiang,WEI Dong,LI Yin-huan,LAI Ke-fang,ZHONG Nan-shan. Application of speech recognition technology in the automatic recognition of cough sounds[J]. Chinese Journal of Biomedical Engineering, 2010, 16(6). DOI: 10.3760/cma.j.issn.1674-1927.2010.06.013

Authors:	MA Xuan ZHENG Ze-guang CHEN Rong-chang TIAN Lian-fang MO Hong-qiang WEI Dong LI Yin-huan LAI Ke-fang ZHONG Nan-shan

Abstract:	Objective To investigate the feasibility of speech recognition technologies including double threshold method, Mel frequency cepstral coefficient (MFCC) and vector quantization (VQ) in automatic recognition of cough sounds. Methods Five healthy adults and 15 patients with cough were recruited for recording of both non- cough and cough sounds in a quiet environment. The records were randomized into training and testing samples. The training samples were used to generate the code book for cough recognition software, which was then used to recognize and analyze the testing samples automatically.The sensitivity and specificity of recognition were calculated by comparison with outcomes from human ear recognition. The recognition time in two approaches was recorded. Results Two hundred cough and 200 noncough sound samples were used to generate the code book for cough recognition software, while 375 cough sound samples and 125 non-cough sound samples were used as the testing samples. The recognition time of the testing samples needed was 33 minutes and 18 seconds by human ear recognition vs 1 minute and 35seconds by code book-based automatic recognition. In addition, the sensitivity and specificity in code bookbased automatic recognition of the cough sound were 98.93% and 100% respectively. Conclusion The double threshold method based on VQ and MFCC appears feasible in automatic recognition of cough sounds.

Keywords:	Cough Speech recognition software Pattern recognition,automatic Sensitivity and specificity Human ear recognition
本文献已被万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏