Position and Trajectory Learning for Microphone Arrays

In this paper, we tackle the problem of source localization by example. We present a methodology that allows a user to train a microphone array system using signals from a set of positions and trajectories and subsequently recall the localization information when presented with new input signals. To do so we present a new statistical model which is capable of accurately describing features from the cross spectra of the microphone signals so as to model the room responses from all positions of interest. We further extend this model to allow modeling of sequences of positions, thereby also enabling the learning and recognition of trajectories. Because of its learning nature this method provides practical advantages in setting up a microphone array, by not requiring favorable room acoustics, careful element positioning or uniformity of sensors. It also introduces an approach to localization which can be extended to other problems requiring models of transfer functions. We present tests on synthetic and real-world data and present the resulting recognition rates for a variety of situations

[1]  Julius O. Smith,et al.  Closed-form least-squares source location estimation from range-difference measurements , 1987, IEEE Trans. Acoust. Speech Signal Process..

[2]  Michael S. Brandstein,et al.  A practical time-delay estimator for localizing speech sources with a microphone array , 1995, Comput. Speech Lang..

[3]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[4]  F. A. Sakarya,et al.  Speaker localization for far-field and near-field wideband sources using neural networks , 1999, NSIP.

[5]  Maurizio Omologo,et al.  Acoustic event localization using a crosspower-spectrum phase based technique , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Benesty,et al.  Adaptive eigenvalue decomposition algorithm for passive acoustic source localization , 2000, The Journal of the Acoustical Society of America.

[7]  Stanley T. Birchfield,et al.  Fast Bayesian acoustic localization , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  J. Smith,et al.  The spherical interpolation method for closed-form passive source localization using range difference measurements , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Brian M. Sadler,et al.  WIDEBAND ARRAY PROCESSING ALGORITHMS FOR ACOUSTIC TRACKING OF GROUND VEHICLES , 1998 .

[10]  G. Carter,et al.  Special issue on time delay estimation , 1980 .

[11]  Durand R. Begault,et al.  3-D Sound for Virtual Reality and Multimedia Cambridge , 1994 .

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  Jeffrey Glenn Borish,et al.  Electronic simulation of auditorium acoustics , 1984 .

[14]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[15]  R. Schmidt A New Approach to Geometry of Range Difference Location , 1972, IEEE Transactions on Aerospace and Electronic Systems.