Source localization in reverberant environments: modeling and statistical analysis

Room reverberation is typically the main obstacle for designing robust microphone-based source localization systems. The purpose of the paper is to analyze the achievable performance of acoustical source localization methods when room reverberation is present. To facilitate the analysis, we apply well known results from room acoustics to develop a simple but useful statistical model for the room transfer function. The properties of the statistical model are found to correlate well with results from real data measurements. The room transfer function model is further applied to analyze the statistical properties of some existing methods for source localization. In this respect we consider especially the asymptotic error variance and the probability of an anomalous estimate. A noteworthy outcome of the analysis is that the so-called PHAT time-delay estimator is shown to be optimal among a class of cross-correlation based time-delay estimators. To verify our results on the error variance and the outlier probability we apply the image method for simulation of the room transfer function.

[1]  P. Peterson Simulating the response of multiple microphones to a single acoustic source in a reverberant room. , 1986, The Journal of the Acoustical Society of America.

[2]  Maurizio Omologo,et al.  Acoustic source location in noisy and reverberant environment using CSP analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Steven A. Tretter,et al.  Optimum processing for delay-vector estimation in passive signal arrays , 1973, IEEE Trans. Inf. Theory.

[4]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[5]  M. Schroeder Frequency‐Correlation Functions of Frequency Responses in Rooms , 1962 .

[6]  Petre Stoica,et al.  Performance study of conditional and unconditional direction-of-arrival estimation , 1990, IEEE Trans. Acoust. Speech Signal Process..

[7]  Benoît Champagne,et al.  Performance of time-delay estimation in the presence of room reverberation , 1996, IEEE Trans. Speech Audio Process..

[8]  Michael S. Brandstein,et al.  A hybrid real-time face tracking system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[9]  Michael S. Brandstein,et al.  A practical methodology for speech source localization with microphone arrays , 1997, Comput. Speech Lang..

[10]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[11]  Rodney A. Kennedy,et al.  Equalization in an acoustic reverberant environment: robustness results , 2000, IEEE Trans. Speech Audio Process..

[12]  Michael S. Brandstein,et al.  A closed-form location estimator for use with room environment microphone arrays , 1997, IEEE Trans. Speech Audio Process..

[13]  Maurizio Omologo,et al.  Use of the crosspower-spectrum phase in acoustic event location , 1997, IEEE Trans. Speech Audio Process..

[14]  Mohan M. Trivedi,et al.  Analysis of time-delay estimation in reverberant environments , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[16]  H. C. Schau,et al.  Passive source localization employing intersecting spherical surfaces from time-of-arrival differences , 1987, IEEE Trans. Acoust. Speech Signal Process..

[17]  M. Brandstein,et al.  Multi-source face tracking with audio and visual data , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[18]  Ea-Ee Jan,et al.  Sound source localization in reverberant environments using an outlier elimination algorithm , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[19]  John P. Ianniello High-resolution multipath time delay estimation for broad-band random signals , 1988, IEEE Trans. Acoust. Speech Signal Process..

[20]  Heinrich Kuttruff,et al.  Room acoustics , 1973 .

[21]  J. Ianniello High resolution multipath time delay estimation for broadband random signals , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  J. Ianniello,et al.  Time delay estimation via cross-correlation in the presence of large estimation errors , 1982 .

[23]  M. S. Brandstein A pitch-based approach to time-delay estimation of reverberant speech , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[24]  Steven Kay,et al.  Fundamentals Of Statistical Signal Processing , 2001 .