Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals

In many practical environments we wish to extract several desired speech signals, which are contaminated by nonstationary and stationary interfering signals. The desired signals may also be subject to distortion imposed by the acoustic room impulse responses (RIRs). In this paper, a linearly constrained minimum variance (LCMV) beamformer is designed for extracting the desired signals from multimicrophone measurements. The beamformer satisfies two sets of linear constraints. One set is dedicated to maintaining the desired signals, while the other set is chosen to mitigate both the stationary and nonstationary interferences. Unlike classical beamformers, which approximate the RIRs as delay-only filters, we take into account the entire RIR [or its respective acoustic transfer function (ATF)]. The LCMV beamformer is then reformulated in a generalized sidelobe canceler (GSC) structure, consisting of a fixed beamformer (FBF), blocking matrix (BM), and adaptive noise canceler (ANC). It is shown that for spatially white noise field, the beamformer reduces to a FBF, satisfying the constraint sets, without power minimization. It is shown that the application of the adaptive ANC contributes to interference reduction, but only when the constraint sets are not completely satisfied. We show that relative transfer functions (RTFs), which relate the desired speech sources and the microphones, and a basis for the interference subspace suffice for constructing the beamformer. The RTFs are estimated by applying the generalized eigenvalue decomposition (GEVD) procedure to the power spectral density (PSD) matrices of the received signals and the stationary noise. A basis for the interference subspace is estimated by collecting eigenvectors, calculated in segments where nonstationary interfering sources are active and the desired sources are inactive. The rank of the basis is then reduced by the application of the orthogonal triangular decomposition (QRD). This procedure relaxes the common requirement for nonoverlapping activity periods of the interference sources. A comprehensive experimental study in both simulated and real environments demonstrates the performance of the proposed beamformer.

[1]  Jacob Benesty,et al.  On Microphone-Array Beamforming From a MIMO Acoustic Signal Processing Perspective , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Yi Hu,et al.  A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[3]  Satoshi Nakamura,et al.  Speech enhancement based on the subspace method , 2000, IEEE Trans. Speech Audio Process..

[4]  Antonio Cantoni,et al.  Derivative constraints for broad-band element space antenna array processors , 1983 .

[5]  J. Shynk Frequency-domain and multirate adaptive filtering , 1992, IEEE Signal Processing Magazine.

[6]  Israel Cohen,et al.  System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Marc Moonen,et al.  Combined frequency-domain dereverberation and noise reduction technique for multi-microphone speech enhancement , 2001 .

[8]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[9]  Saeed Gazor,et al.  Robust adaptive beamforming via target tracking , 1996, IEEE Trans. Signal Process..

[10]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[11]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[12]  Reinhold Häb-Umbach,et al.  Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[14]  Jean-Francois Cardoso,et al.  Eigen-structure of the fourth-order cumulant tensor with application to the blind source separation problem , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[15]  Dennis R. Morgan,et al.  Exploring permutation inconsistency in blind separation of speech signals in a reverberant environment , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[16]  R. O. Schmidt,et al.  Multiple emitter location and signal Parameter estimation , 1986 .

[17]  Marc Moonen,et al.  Spatially pre-processed speech distortion weighted multi-channel Wiener filtering for noise reduction , 2003, Signal Process..

[18]  Israel Cohen,et al.  Dual-Source Transfer-Function Generalized Sidelobe Canceller , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Marc Moonen,et al.  GSVD-based optimal filtering for single and multimicrophone speech enhancement , 2002, IEEE Trans. Signal Process..

[20]  Sharon Gannot,et al.  Adaptive Beamforming and Postfiltering , 2008 .

[21]  Israel Cohen,et al.  Speech enhancement based on the general transfer function GSC and postfiltering , 2003, IEEE Transactions on Speech and Audio Processing.

[22]  Sofiène Affes,et al.  A signal subspace tracking algorithm for microphone array processing of speech , 1997, IEEE Trans. Speech Audio Process..

[23]  Israel Cohen,et al.  Performance analysis of dual source transfer-function generalized sidelobe canceller , 2007, Speech Commun..

[24]  J. Capon High-resolution frequency-wavenumber spectrum analysis , 1969 .

[25]  Jose C. Principe,et al.  A unifying criterion for blind source separation and decorrelation: simultaneous diagonalization of correlation matrices , 1997, Neural Networks for Signal Processing VII. Proceedings of the 1997 IEEE Signal Processing Society Workshop.

[26]  B. Widrow,et al.  Adaptive noise cancelling: Principles and applications , 1975 .

[27]  O. L. Frost,et al.  An algorithm for linearly constrained adaptive array processing , 1972 .

[28]  Sofiène Affes,et al.  Wideband multi-source beamforming with adaptive array location calibration and direction finding , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[29]  L. J. Griffiths,et al.  An alternative approach to linearly constrained adaptive beamforming , 1982 .

[30]  T. Ens,et al.  Blind signal separation : statistical principles , 1998 .

[31]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[32]  B. Breed,et al.  A short proof of the equivalence of LCMV and GSC beamforming , 2002, IEEE Signal Processing Letters.

[33]  Bin Yang,et al.  Projection approximation subspace tracking , 1995, IEEE Trans. Signal Process..

[34]  Lucas C. Parra,et al.  Convolutive blind separation of non-stationary sources , 2000, IEEE Trans. Speech Audio Process..

[35]  Schuster,et al.  Separation of a mixture of independent signals using time delayed correlations. , 1994, Physical review letters.

[36]  Sofiène Affes,et al.  An algorithm for multi-source beamforming and multi-target tracking: Further results , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[37]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[38]  Jacob Benesty,et al.  Springer handbook of speech processing , 2007, Springer Handbooks.

[39]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[40]  Sofiène Affes,et al.  An algorithm for multisource beamforming and multitarget tracking , 1996, IEEE Trans. Signal Process..

[41]  J. Flanagan,et al.  Microphone arrays for speech processing , 1995, Proceedings of ISSE'95 - International Symposium on Signals, Systems and Electronics.

[42]  I. Cohen Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator , 2002, IEEE Signal Processing Letters.

[43]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[44]  S. Gannot,et al.  A comparison between alternative beamforming strategies for interference cancelation in noisy and reverberant environment , 2008, 2008 IEEE 25th Convention of Electrical and Electronics Engineers in Israel.