Performance of 3D speaker localization using a small array of microphones

Automatic speaker localization is an important feature in several applications such as videoconferencing and distance learning, where usually only a small array of microphones is available. This feature allows the camera to focus automatically on the person who is speaking. In this work we compare several adaptive filtering algorithms configured as time delay estimators to locate the talker in 3D space. Simulations showing their performance and tracking capability are provided.

[1]  Maurizio Omologo,et al.  Acoustic event localization using a crosspower-spectrum phase based technique , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Marc Rybowicz,et al.  Maple V release 4 , 1997 .

[3]  Douglas E. Sturim,et al.  Tracking multiple talkers using microphone-array measurements , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[5]  Richard A. F. Grieve,et al.  CARTER , 1945 .

[6]  N. Bershad,et al.  Time delay estimation using the LMS adaptive filter--Dynamic behavior , 1981 .

[7]  C. Burrus,et al.  Array Signal Processing , 1989 .

[8]  Michael S. Brandstein,et al.  A practical time-delay estimator for localizing speech sources with a microphone array , 1995, Comput. Speech Lang..

[9]  D. Etter,et al.  Adaptive estimation of time delays in sampled data systems , 1981 .

[10]  James L. Flanagan,et al.  A DSP implementation of source location using microphone arrays. , 1996 .

[11]  Michael S. Brandstein,et al.  A closed-form location estimator for use with room environment microphone arrays , 1997, IEEE Trans. Speech Audio Process..

[12]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[13]  K. C. Ho,et al.  A simple and efficient estimator for hyperbolic location , 1994, IEEE Trans. Signal Process..

[14]  Don H. Johnson,et al.  Array Signal Processing: Concepts and Techniques , 1993 .

[15]  B. Friedlander A passive localization algorithm and its accuracy analysis , 1987 .

[16]  Maurizio Omologo,et al.  Acoustic source location in noisy and reverberant environment using CSP analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[17]  Harvey F. Silverman,et al.  A two-stage algorithm for determining talker location from linear microphone array data , 1992 .

[18]  A. N. Willson,et al.  Analysis of conjugate gradient algorithms for adaptive filtering , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[19]  David A. Redfern,et al.  The Maple Handbook: Maple V Release 3 , 1995 .

[20]  Alan N. Willson,et al.  Adaptive filtering using modified conjugate gradient , 1995, 38th Midwest Symposium on Circuits and Systems. Proceedings.

[21]  Maurizio Omologo,et al.  Acoustic source location in a three-dimensional space using crosspower spectrum phase , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.