Single-channel noise reduction via semi-orthogonal transformations and reduced-rank filtering

A general framework is developed that combines semi-orthogonal transformation and reduced-rank filtering for noise reduction.Under this new framework, several optimal reduced-rank filters are derived, including the maximum SNR, the Wiener, the tradeoff, and the MVDR filters.Discussions are also provided on how to derive different semi-orthogonal transformations under four estimation criteria, including minimum correlation, minimum MSE, minimum distortion, and minimum residual noise.Simulations are performed and the results show the properties of the deduced optimal reduced-rank filters. This paper investigates the problem of single-channel noise reduction in the time domain. The objective is to find a lower dimensional filter that can yield a noise reduction performance as close as possible to or even better than that obtained by the full-rank solution. This is achieved in three steps. First, we transform the observation signal vector sequence, through a semi-orthogonal matrix, into a sequence of transformed signal vectors with a reduced dimension. Second, a reduced-rank filter is applied to get an estimate of the clean speech in the transformed domain. Third, the estimate of the clean speech in the time domain is obtained by an inverse semi-orthogonal transformation. The focus of this paper is on the derivation of semi-orthogonal transformations under certain estimation criteria in the first step and the design of the reduced-rank optimal filters that can be used in the second step. We show how noise reduction using the principle of rank reduction can be cast as an optimal filtering problem, and how different semi-orthogonal transformations affect the noise reduction performance. Simulations are performed under various conditions to validate the deduced filters for noise reduction.

[1]  Jacob Benesty,et al.  A Class of Optimal Rectangular Filtering Matrices for Single-Channel Signal Enhancement in the Time Domain , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  J. Scott Goldstein,et al.  Reduced-rank adaptive filtering , 1997, IEEE Trans. Signal Process..

[3]  Louis L. Scharf,et al.  A Multistage Representation of the Wiener Filter Based on Orthogonal Projections , 1998, IEEE Trans. Inf. Theory.

[4]  Van Trees,et al.  Detection, Estimation, and Modulation Theory. Part 1 - Detection, Estimation, and Linear Modulation Theory. , 1968 .

[5]  Jacob Benesty,et al.  Noise Reduction in Speech Processing , 2009 .

[6]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[7]  Louis L. Scharf,et al.  Rank reduction for modeling stationary signals , 1987, IEEE Trans. Acoust. Speech Signal Process..

[8]  R. Kumaresan,et al.  Singular value decomposition and improved frequency estimation using linear prediction , 1982 .

[9]  Jacob Benesty,et al.  Single-channel noise reduction using unified joint diagonalization and optimal filtering , 2014, EURASIP J. Adv. Signal Process..

[10]  Jiasong Mu,et al.  Throat polyp detection based on compressed big data of voice with support vector machine algorithm , 2014, EURASIP Journal on Advances in Signal Processing.

[11]  Håkan Johansson,et al.  An Approach for Synthesis of Modulated-Channel FIR Filter Banks Utilizing the Frequency-Response Masking Technique , 2006, EURASIP J. Adv. Signal Process..

[12]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[13]  Sabine Van Huffel,et al.  Enhanced resolution based on minimum variance estimation and exponential data modeling , 1993, Signal Process..

[14]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[15]  Jacob Benesty,et al.  Optimal Time-Domain Noise Reduction Filters - A Theoretical Study , 2011, Springer Briefs in Electrical and Computer Engineering.

[16]  Louis L. Scharf,et al.  The SVD and reduced rank signal processing , 1991, Signal Process..

[17]  Søren Holdt Jensen,et al.  FIR filter representations of reduced-rank noise reduction , 1998, IEEE Trans. Signal Process..

[18]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[19]  Yi Hu,et al.  A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[20]  Nam C. Phamdo,et al.  Signal/noise KLT based approach for enhancing speech degraded by colored noise , 2000, IEEE Trans. Speech Audio Process..

[21]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[22]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[23]  Bart De Moor,et al.  The singular value decomposition and long and short spaces of noisy matrices , 1993, IEEE Trans. Signal Process..

[24]  Saeed Gazor,et al.  An adaptive KLT approach for speech enhancement , 2001, IEEE Trans. Speech Audio Process..

[25]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[26]  Jun Huang,et al.  An energy-constrained signal subspace method for speech enhancement and recognition in white and colored noises , 1998, Speech Commun..

[27]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series , 1964 .

[28]  R. Kumaresan,et al.  Data adaptive signal estimation by singular value decomposition of a data matrix , 1982, Proceedings of the IEEE.

[29]  Jacob Benesty,et al.  A reduced-rank approach to single-channel noise reduction , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).

[30]  Jun Huang,et al.  A DCT-based fast signal subspace technique for robust speech recognition , 2000, IEEE Trans. Speech Audio Process..

[31]  George Carayannis,et al.  Speech enhancement from noise: A regenerative approach , 1991, Speech Commun..

[32]  Søren Holdt Jensen,et al.  Subspace-Based Noise Reduction for Speech Signals via Diagonal and Triangular Matrix Decompositions: Survey and Analysis , 2007, EURASIP J. Adv. Signal Process..

[33]  Søren Holdt Jensen,et al.  Reduction of broad-band noise in speech by truncated QSVD , 1995, IEEE Trans. Speech Audio Process..