Musical-noise-free blind speech extraction integrating microphone array and iterative spectral subtraction

Abstract In this paper, we propose a musical-noise-free blind speech extraction method using a microphone array for application to nonstationary noise. In our previous study, it was found that optimized iterative spectral subtraction (SS) results in speech enhancement with almost no musical noise generation, but this method is valid only for stationary noise. The proposed method consists of iterative blind dynamic noise estimation by, e.g., independent component analysis (ICA) or multichannel Wiener filtering, and musical-noise-free speech extraction by modified iterative SS, where multiple iterative SS is applied to each channel while maintaining the multichannel property reused for the dynamic noise estimators. Also, in relation to the proposed method, we discuss the justification of applying ICA to signals nonlinearly distorted by SS. From objective and subjective evaluations simulating a real-world hands-free speech communication system, we reveal that the proposed method outperforms the conventional methods.

[1]  Kiyohiro Shikano,et al.  Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Sheng Li,et al.  Iterative spectral subtraction method for millimeter-wave conducted speech enhancement , 2010 .

[3]  Wei-Yang Lin,et al.  Robust and Accurate Curvature Estimation Using Adaptive Line Integrals , 2010, EURASIP J. Adv. Signal Process..

[4]  Kohei Yamashita,et al.  Spectral subtraction iterated with weighting factors , 2002, Speech Coding, 2002, IEEE Workshop Proceedings..

[5]  T. Hasan,et al.  Iterative noise power subtraction technique for improved speech quality , 2008, 2008 International Conference on Electrical and Computer Engineering.

[6]  Kiyohiro Shikano,et al.  Blind source separation based on a fast-convergence algorithm combining ICA and beamforming , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[8]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[9]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[10]  Kiyohiro Shikano,et al.  Automatic optimization scheme of spectral subtraction based on musical noise assessment via higher-order statistics , 2008 .

[11]  Kiyohiro Shikano,et al.  Musical Noise Controllable Algorithm of Channelwise Spectral Subtraction and Adaptive Beamforming Based on Higher Order Statistics , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  I. Cohen Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator , 2002, IEEE Signal Processing Letters.

[13]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[14]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[15]  Kiyohiro Shikano,et al.  Musical noise generation analysis for noise reduction methods based on spectral subtraction and MMSE STSA estimation , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[16]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[17]  Kiyohiro Shikano,et al.  Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Martin Bouchard,et al.  Improved Noise Power Spectrum Density Estimation for Binaural Hearing Aids Operating in a Diffuse Noise Field Environment , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Hiroshi Saruwatari,et al.  Theoretical analysis of iterative weak spectral subtraction via higher-order statistics , 2010, 2010 IEEE International Workshop on Machine Learning for Signal Processing.

[20]  Kiyohiro Shikano,et al.  Musical-Noise Analysis in Methods of Integrating Microphone Array and Spectral Subtraction Based on Higher-Order Statistics , 2010, EURASIP J. Adv. Signal Process..

[21]  Kiyohiro Shikano,et al.  Theoretical Analysis of Amounts of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array , 2012, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[22]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.