Noise estimation using negentropy based voice-activity detector

This paper presents a noise robust voice activity detector (VAD) based on the chaos measure of the quasistationary segment of speech in the frequency domain. The basic idea behind the proposed method is that the addition of noise in the clean speech produces less disorganization in the speech part than the silent/paused part as a result of which spectral contents of the speech part become less chaotic than the silent part in a noisy speech signal. The negentropy can be used as the measure for such a distinction in organization and accordingly noisy speech frames and the noise-only frames are demarcated. The algorithm is almost independent of the SNR level of the noisy speech signal.

[1]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[2]  E. Oja,et al.  Independent Component Analysis , 2013 .

[3]  Nozomu Hamada,et al.  Voice activity detection with array signal processing in the wavelet domain , 2002, 2002 11th European Signal Processing Conference.

[4]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[5]  A. Kondoz,et al.  Analysis and improvement of a statistical model-based voice activity detector , 2001, IEEE Signal Processing Letters.

[6]  Giuseppe Ruggeri,et al.  Performance evaluation and comparison of ITU-T/ETSI voice activity detectors , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  Kiyohiro Shikano,et al.  Probability Distribution of Time-Series of Speech Spectral Components(Audio/Speech Coding)( Applications and Implementations of Digital Signal Processing) , 2004 .

[8]  Francesco Beritelli,et al.  A robust voice activity detector for wireless communications using soft computing , 1998, IEEE J. Sel. Areas Commun..

[9]  Wei Zhang,et al.  A soft voice activity detector based on a Laplacian-Gaussian model , 2003, IEEE Trans. Speech Audio Process..

[10]  Javier Ortega-Garcia,et al.  Overview of speech enhancement techniques for automatic speaker recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.