首页 | 本学科首页   官方微博 | 高级检索  
     


The application of Gammatone frequency cepstral coefficients for forensic voice comparison under noisy conditions
Authors:Huapeng Wang
Affiliation:1. Department of Forensic Science &2. Technology, Criminal Investigation Police University of China , Shenyang, China;3. Chongqing Institutes of Higher Education Key Forensic Science Laboratory , Chongqing, China
Abstract:ABSTRACT

Compared with humans, who have more powerful auditory ability in discriminating and identifying speakers in noisy environments, traditional forensic automatic speaker recognizers do not perform well when dealing with noisy recordings. This paper proposes a GMM-UBM Forensic Automatic Speaker Recognition (FASR) System to reduce the effect of noise on performance. The system uses Gammatone Frequency Cepstral Coefficients (GFCC) based on an auditory periphery model and also incorporates a Principal Component Analysis (PCA) algorithm. The system was tested and validated using Mandarin voice databases compromised with different levels of white noise and office noise. The performance of the system was compared with a baseline system using Mel Frequency Cepstral Coefficients (MFCC) and also PCA under the same conditions. The results show that the performance of the combined GFCC system achieved a substantial improvement when compared with the baseline MFCC system under conditions of a high level of office noise.
Keywords:Forensic voice comparison  likelihood ratio  GFCC  MFCC  PCA
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号