Theoretical analysis of musical noise in nonlinear noise reduction based on higher-order statistics

In this paper, we review a musical-noise-generation analysis of nonlinear noise reduction techniques with using higher-order statistics (HOS). Recently, an objective metric based on HOS to analyze nonlinear artifacts, i.e., musical noise, caused by nonlinear noise reduction techniques has been proposed. Such metric enables us to perform objective comparison of any nonlinear methods from the perspective of the amount of musical noise generated. Furthermore, such metric enables us to control the musical noise generated by nonlinear noise reduction techniques. In the paper, first, the mathematical principle of the analysis for the amount of musical noise based on HOS is described, and analyses and comparison examples of typical nonlinear noise reduction techniques are demonstrated. Next, it is clarified that to find a fixed point in HOS leads to no-musical noise property in noise reduction. Finally, several expansions on the theory are discussed.

[1]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[2]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[3]  Kiyohiro Shikano,et al.  Musical Noise Controllable Algorithm of Channelwise Spectral Subtraction and Adaptive Beamforming Based on Higher Order Statistics , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Kiyohiro Shikano,et al.  Speech enhancement in presence of diffuse background noise: Why using blind signal extraction? , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Kiyohiro Shikano,et al.  Theoretical Analysis of Musical Noise in Generalized Spectral Subtraction Based on Higher Order Statistics , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Joseph Sylvester Chang,et al.  A parametric formulation of the generalized spectral subtraction method , 1998, IEEE Trans. Speech Audio Process..

[7]  Kiyohiro Shikano,et al.  Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series, with Engineering Applications , 1949 .

[9]  Kiyohiro Shikano,et al.  Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[11]  Kiyohiro Shikano,et al.  Musical-noise-free speech enhancement: Theory and evaluation , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Kohei Yamashita,et al.  Spectral subtraction iterated with weighting factors , 2002, Speech Coding, 2002, IEEE Workshop Proceedings..

[13]  Kiyohiro Shikano,et al.  Theoretical analysis of parametric blind spatial subtraction array and its application to speech recognition performance prediction , 2011, 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays.

[14]  Gary W. Elko,et al.  Microphone array systems for hands-free telecommunication , 1996, Speech Commun..

[15]  Kondo Kazunobu,et al.  Mathematical Metric of Musical Noise for Various Nonlinear Speech Enhancement Algorithms , 2012 .

[16]  Kiyohiro Shikano,et al.  Musical-Noise Analysis in Methods of Integrating Microphone Array and Spectral Subtraction Based on Higher-Order Statistics , 2010, EURASIP J. Adv. Signal Process..

[17]  Kiyohiro Shikano,et al.  Theoretical Analysis of Musical Noise Generation in Noise Reduction Methods with Decision-Directed a Priori SNR Estimator , 2012, IWAENC.

[18]  Kiyohiro Shikano,et al.  Theoretical analysis of musical noise in Wiener filtering family via higher-order statistics , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Kiyohiro Shikano,et al.  Musical-noise-free blind speech extraction using ICA-based noise estimation and iterative spectral subtraction , 2012, 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA).

[20]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[21]  Joon-Hyuk Chang,et al.  Statistical modeling of speech signals based on generalized gamma distribution , 2005, IEEE Signal Process. Lett..

[22]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[23]  Kiyohiro Shikano,et al.  BLIND SPATIAL SUBTRACTION ARRAY WITH INDEPENDENT COMPONENT ANALYSIS FOR HANDS-FREE SPEECH RECOGNITION , 2006 .

[24]  E. Stacy A Generalization of the Gamma Distribution , 1962 .