Postfiltering Using Multichannel Spectral Estimation in Multispeaker Environments

This paper investigates the problem of enhancing a single desired speech source from a mixture of signals in multispeaker environments. A beamformer structure is proposed which combines a fixed beamformer with postfiltering. In the first stage, the fixed multiobjective optimal beamformer is designed to spatially extract the desired source by suppressing all other undesired sources. In the second stage, a multichannel power spectral estimator is proposed and incorporated in the postfilter, thus enabling further suppression capability. The combined scheme exploits both spatial and spectral characteristics of the signals. Two new multichannel spectral estimation methods are proposed for the postfiltering using, respectively, inner product and joint diagonalization. Evaluations using recordings from a real-room environment show that the proposed beamformer offers a good interference suppression level whilst maintaining a low-distortion level of the desired source.

[1]  R. Zelinski,et al.  A microphone array with adaptive post-filtering for noise reduction in reverberant rooms , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[2]  Sven Nordholm,et al.  Adaptive microphone array employing calibration signals: an analytical evaluation , 1999, IEEE Trans. Speech Audio Process..

[3]  Sven Nordholm,et al.  Design of oversampled uniform DFT filter banks with delay specification using quadratic optimization , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[4]  Sven Nordholm,et al.  Spectral subtraction using reduced delay convolution and adaptive averaging , 2001, IEEE Trans. Speech Audio Process..

[5]  Israel Cohen,et al.  Multichannel post-filtering in nonstationary noise environments , 2004, IEEE Transactions on Signal Processing.

[6]  Hervé Bourlard,et al.  Microphone array post-filter based on noise field coherence , 2003, IEEE Trans. Speech Audio Process..

[7]  Gerald L. Fudge,et al.  A calibrated generalized sidelobe canceller for wideband beamforming , 1994, IEEE Trans. Signal Process..

[8]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[9]  Antoine Souloumiac,et al.  Jacobi Angles for Simultaneous Diagonalization , 1996, SIAM J. Matrix Anal. Appl..

[10]  Marc Moonen,et al.  GSVD-based optimal filtering for single and multimicrophone speech enhancement , 2002, IEEE Trans. Signal Process..

[11]  Jae S. Lim,et al.  Speech enhancement , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Hai Quang Dam,et al.  Post-Filtering with Multichannel Power Spectral Estimation Using Joint Diagonalization in Multi-Speaker Environments , 2006, 2006 Asia-Pacific Conference on Communications.

[13]  Jacob Benesty,et al.  Separation and Dereverberation of Speech Signals with Multiple Microphones , 2005 .

[14]  Sven Nordholm,et al.  Adaptive microphone array with noise statistics updates , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[15]  Steve Rogers,et al.  Adaptive Filter Theory , 1996 .

[16]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[17]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[18]  Harvey F. Silverman,et al.  Position calibration of large-aperture microphone arrays , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19]  Hai Quang Dam,et al.  Maximum Likelihood Estimation and Cramer-Rao Lower Bounds for the Multichannel Spectral Evaluation in Hands-Free Communication , 2005, 2005 Asia-Pacific Conference on Communications.

[20]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[21]  S. Nordholm,et al.  Optimal FIR subband beamforming for speech enhancement in multipath environments , 2003, IEEE Signal Processing Letters.

[22]  S. Nordholm,et al.  Adaptive beamforming: Spatial filter designed blocking matrix , 1994 .

[23]  Joseph Sylvester Chang,et al.  A parametric formulation of the generalized spectral subtraction method , 1998, IEEE Trans. Speech Audio Process..

[24]  Sridha Sridharan,et al.  Speech Enhancement Iby Simulation Of Cocktail Party Effect With Neural Network Controlled Iterative Filter , 1996, Fourth International Symposium on Signal Processing and Its Applications.