A new cepstral prefiltering technique for estimating time delay under reverberant conditions

Abstract A microphone array can be used for hands-free acquisition of speech under reverberant conditions. This requires knowledge about the desired talker location, which can be obtained by estimating the time delays between the signals received by one or more pairs of spatially separated microphones. However, in a typical audio-conference room, strong reverberation is usually present and can have disastrous effects on the performance of conventional time delay estimation (TDE) methods. In this article, we present and evaluate a new cepstral prefiltering technique which can be applied on the received signals before the actual TDE in order to obtain a more accurate estimate of the delay in a typical reverberant environment. The technique is based on the estimation and the subtraction of the minimum-phase component (MPC) of the channel cepstrum from the total cepstrum of each microphone signal. So, in the same way that it is necessary in certain TDE methods to estimate the power spectral densities of the signals of interest from the received data, the new method requires the estimation of the channel MPC in the cepstral domain. The performances of a TDE system with and without cepstral prefiltering are compared via Monte-Carlo simulations for fixed random and speech sources as well as for a moving random source. The results clearly demonstrate the beneficial effects of the new cepstral prefiltering technique on TDE performance when the source is fixed or slowly moving.

[1]  K. C. Ho,et al.  A novel constrained algorithm for delay estimation in the presence of multipath transmissions , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Yiu-Tong Chan,et al.  Constrained adaptation for time delay estimation with multipath propagation , 1991 .

[3]  Delores M. Etter,et al.  Multiple short-length adaptive filters for time-varying echo cancellations , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  P. Peterson Simulating the response of multiple microphones to a single acoustic source in a reverberant room. , 1986, The Journal of the Acoustical Society of America.

[5]  James L. Flanagan,et al.  Autodirective Microphone Systems , 1991 .

[6]  Julius O. Smith,et al.  Adaptive multipath delay estimation , 1984, IEEE Trans. Acoust. Speech Signal Process..

[7]  John Mourjopoulos On the variation and invertibility of room impulse response functions , 1985 .

[8]  Jont B. Allen,et al.  Invertibility of a room impulse response , 1979 .

[9]  Heinrich Kuttruff,et al.  Room acoustics , 1973 .

[10]  J. Ianniello High resolution multipath time delay estimation for broadband random signals , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  P. C. Ching,et al.  Non-stationary time delay estimation with a multipath , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[12]  G. Carter,et al.  A Practical Approach to the Estimation of Amplitude and Time-Delay Parameters of a Composite Signal , 1987 .

[13]  Stephen A. Dyer,et al.  Digital signal processing , 2018, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[14]  Mikio Tohyama,et al.  Source waveform recovery in a reverberant space by cepstrum dereverberation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[16]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[17]  Benoît Champagne,et al.  Performance of time-delay estimation in the presence of room reverberation , 1996, IEEE Trans. Speech Audio Process..

[18]  Benoit Champagne Simulation of the response of multiple microphones to a moving point source , 1994 .

[19]  Harvey F. Silverman,et al.  A two-stage algorithm for determining talker location from linear microphone array data , 1992 .