Novel windowing technique of MFCC for speaker identification with Modified Polynomial Classifiers

Speech is one of the most popular parameter used to identify a speaker by her spoken phrase. Feature extraction from speech is a necessary first step in a speaker identification process. Traditionally computation of the Mel Frequency Cepstral Coefficient (MFCC) features use hamming window, as a preprocessing step to reduce spectral leakages. However, hamming window results in reasonable side lobes along with the desired main lobe. This paper deals with a modified algorithm to compute MFCC feature by integrating phase as well as slope information in computing power spectrum. A modified training algorithm is used to train the polynomial classifier which is used for speaker identification. Experimental results using Matlab show that the novel windowing technique with Modified Polynomial Classifier shows consistently better performance over hamming window. There is an improvement in the accuracy of identification especially for large database with low memory usage. It also reduces computational complexity.

[1]  Sunil Kumar Kopparapu,et al.  Effect of noise-in-speech on MFCC parameters , 2009 .

[2]  Yunlong Wang,et al.  An Effective Approach to Finding Differentiator Window Functions Based on Sinc Sum Function , 2012, Circuits Syst. Signal Process..

[3]  M. Mottaghi-Kashtiban,et al.  New efficient window function, replacement for the hamming window , 2011 .

[4]  Dusan M. Kodek,et al.  Using asymmetric windows in automatic speech recognition , 2007, Speech Commun..

[5]  A. M. Ahmad,et al.  Comparative Study of Speaker Recognition Methods : DTW , GMM and SVM , 2008 .

[6]  William M. Campbell,et al.  Speaker recognition with polynomial classifiers , 2002, IEEE Trans. Speech Audio Process..

[7]  Douglas E. Sturim,et al.  Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.

[8]  Goutam Saha,et al.  A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition , 2012, IEEE Signal Processing Letters.

[9]  Saudi Arabia,et al.  FAST ALGORITHM FOR NOISY SPEAKER RECOGNITION USING ANN , 2014 .

[10]  Douglas E. Sturim,et al.  SVM Based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11]  Haizhou Li,et al.  Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Rajesh M. Hegde,et al.  Significance of the Modified Group Delay Feature in Speech Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  V. Tiwari MFCC and its applications in speaker recognition , 2010 .

[14]  Longbiao Wang,et al.  Speaker Identification and Verification by Combining MFCC and Phase Information , 2009, IEEE Transactions on Audio, Speech, and Language Processing.