Delay and Predict Equalization for Blind Speech Dereverberation

In this paper, we consider the blind multichannel dereverberation problem for a single source. The multichannel reverberation impulse response is assumed to be stationary enough to allow estimation of the correlations it induces from the received signals. It is well-known that a single-input multi-output (SIMO) filter can be equalized blindly by applying multichannel linear prediction (LP) to its output when the input is white. When the input is colored, the multichannel linear prediction will both equalize the reverberation filter and whiten the source. We exploit the channel spatial diversity, and the speech signal non-stationarity to estimate the source correlation structure, which can hence be used to determine a source whitening filter. Multichannel linear prediction is then applied to the sensor signals filtered by the source whitening filter, to obtain source dereverberation. Particular attention is paid to the alignment of the received signals on the various microphones. This leads to an increase in the prediction performance, and allows the use of shorter predictor. The proposed approach represents hence a paradigm shift from the delay-and-sum beamformer to the delay-and-predict equalizer

[1]  P. Peterson Simulating the response of multiple microphones to a single acoustic source in a reverberant room. , 1986, The Journal of the Acoustical Society of America.

[2]  Peter Kabal,et al.  Reverberant speech enhancement using cepstral processing , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[3]  J. Flanagan,et al.  Computer‐steered microphone arrays for sound transduction in large rooms , 1985 .

[4]  Les E. Atlas,et al.  Acoustic diversity for improved speech recognition in reverberant environments , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Patrick A. Naylor,et al.  Multi-microphone speech dereverberation using spatio-temporal averaging , 2004, 2004 12th European Signal Processing Conference.

[6]  Rodney A. Kennedy,et al.  Equalization in an acoustic reverberant environment: robustness results , 2000, IEEE Trans. Speech Audio Process..

[7]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[8]  Henrique S. Malvar,et al.  Speech dereverberation via maximum-kurtosis subband adaptive filtering , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[9]  Constantinos B. Papadias,et al.  Fractionally spaced equalization of linear polyphase channels and related blind techniques based on multichannel linear prediction , 1999, IEEE Trans. Signal Process..

[10]  Mahdi Triki Blind dereverberation of quasi-periodic sources based on multichannel linear prediction , 2005 .

[11]  D. Ward,et al.  ON THE USE OF LINEAR PREDICTION FOR DEREVERBERATION OF SPEECH , 2003 .

[12]  Masato Miyoshi,et al.  Inverse filtering of room acoustics , 1988, IEEE Trans. Acoust. Speech Signal Process..

[13]  A. E. Pearson,et al.  On time delay estimation involving received signals , 1984 .

[14]  T. Hikichi,et al.  Blind dereverberation based on estimates of signal transmission channels without precise information on channel order [speech processing applications] , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[15]  Walter Kellermann,et al.  TRINICON: a versatile framework for multichannel blind signal processing , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  Subbarayan Pasupathy,et al.  Time delay estimation via generalized correlation with adaptive spatial prefiltering , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Ea-Ee Jan,et al.  Spatially selective sound capture for speech and audio processing , 1993, Speech Commun..

[18]  Jacob Benesty,et al.  An adaptive blind SIMO identification approach to joint multichannel time delay estimation , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19]  D. Slock,et al.  BLIND DEREVERBERATION OF A SINGLE SOURCE BASED ON MULTICHANNEL LINEAR PREDICTION , 2005 .

[20]  Jacob Benesty,et al.  Adaptive eigenvalue decomposition algorithm for real time acoustic source localization system , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[21]  Dirk T. M. Slock From sinusoids in noise to blind deconvolution in communications , 1997 .

[22]  Peter J. W. Rayner,et al.  Blind single channel deconvolution using nonstationary signal processing , 2003, IEEE Trans. Speech Audio Process..

[23]  A. Oppenheim,et al.  Nonlinear filtering of multiplied and convolved signals , 1968 .