A baseline algorithm for estimating talker orientation using acoustical data from a large-aperture microphone array

We have shown that knowing the orientation of a talker in a large-aperture microphone array system can significantly improve location-estimation and beamforming algorithms. Measurements of a talker in an anechoic chamber have shown significant anisotropy in radiation patterns that may be used to influence the selection, processing, and weighting of microphone signals in such algorithms. We introduce a simple method for determining the orientation of a talker within a large focal area using only acoustic energy data obtained from the array. The mathematical basis for this procedure is presented and computed performance, based solely on acoustical measurements in a real environment, are listed and discussed.

[1]  J. Flanagan Analog Measurements of Sound Radiation from the Mouth , 1960 .

[2]  J. Flanagan,et al.  Computer‐steered microphone arrays for sound transduction in large rooms , 1985 .

[3]  Harvey F Silverman,et al.  Factors affecting the performance of large-aperture microphone arrays. , 2002, The Journal of the Acoustical Society of America.

[4]  James L. Flanagan,et al.  The huge microphone array , 1998, IEEE Concurr..

[5]  Norbert Krüger,et al.  Determination of face position and pose with a learned representation based on labelled graphs , 1997, Image Vis. Comput..

[6]  James L. Flanagan,et al.  A digital processing system for source location and sound capture by large microphone arrays , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Koichiro Deguchi,et al.  Head pose determination from one image using a generic model , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[8]  Thomas S. Huang,et al.  Head Pose Computation for Very Low Bit-rate Video Coding , 1995, CAIP.

[9]  Michael S. Brandstein,et al.  A hybrid real-time face tracking system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).