Speaker verification in a noisy environment by enhancing the speech signal using various approaches of spectral subtraction

Enhancement of speech signal degraded by several types of noise is a topic of interest for last many years. The main aim of speech enhancement algorithm is to improve the quality and/or intelligibility of the noisy speech signals by using various techniques and algorithms. Among the all available methods, the spectral subtraction algorithm is the one of the first algorithm proposed for removing additive background noise. This paper describes various noise reduction algorithms to analyse the performance of speaker verification in noisy environment. Speaker verification involves processing the speech signal and authenticating the speaker. The speaker verification process is affected by various factors such as noise, channel mismatch, health condition of the speaker, aging, emotion, fatigue etc. Speech signals from the uncontrolled environment may contain noise along with required speech components, speech signals are degraded by this noise and it renders the performance of speaker verification systems unacceptable. We enhance the speech signal by using the basic spectral subtraction and the modified approaches of spectral subtraction like spectral Subtraction with over subtraction, non-linear spectral subtraction and multiband spectral subtraction. The speech signal has been enhanced by each of the above methods then speaker models are trained with support vector machine using mel frequency cepstral coefficient of each speaker in the system. The performance of the speaker verification system is analysed with enhanced speech signal.

[1]  R. Halavati,et al.  A Novel Noise Immune, Fuzzy Approach to Speaker Independent, Isolated Word Speech Recognition , 2006, 2006 World Automation Congress.

[2]  Hanspeter Schmid,et al.  How to use the FFT and Matlab's pwelch function for signal and noise simulations and measurements , 2012 .

[3]  Shashikant L. Sahare,et al.  Different Approaches of Spectral Subtraction method for Enhancing the Speech Signal in Noisy Environments , 2011 .

[4]  Weaam Alkhaldi,et al.  Automatic speech/speaker recognition in noisy environments using wavelet transform , 2002, The 2002 45th Midwest Symposium on Circuits and Systems, 2002. MWSCAS-2002..

[5]  Roberto Togneri,et al.  A psychoacoustic spectral subtraction method for noise suppression in automatic speech recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  G. Bachur,et al.  1 Separation of Voiced and Unvoiced using Zero crossing rate and Energy of the Speech Signal , 2008 .

[7]  François Capman,et al.  Acoustic echo cancellation and noise reduction in the frequency-domain: A global optimisation , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[8]  James R. Glass,et al.  Robust Speaker Recognition in Noisy Conditions , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Shashikant L. Sahare,et al.  Noise estimation Algorithms for Speech Enhancement in highly non-stationary Environments , 2011 .

[10]  S. El-Rabaie,et al.  Speech enhancement with an adaptive Wiener filter , 2013, International Journal of Speech Technology.

[11]  Andrzej Drygajlo,et al.  Speaker verification in noisy environments with combined spectral subtraction and missing feature theory , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).