Head orientation estimation of a speaker by utilizing kurtosis of a DOA histogram with restoration of distance effect

In this paper, we propose a head-orientation estimation method from multichannel acoustic signals. Sharpness of a DOA histogram which is extracted by using the sparseness based DOA estimation method varies depending on the head orientation of a speaker. The proposed method utilizes this phenomenon to estimate the head orientation of the speaker. The proposed method uses more than two microphone arrays. In addition to estimation of the speaker location, the proposed method estimates kurtosis of the DOA histogram of each array. Kurtosis is regarded as a measure of sharpness of a DOA histogram in the proposed method. However, kurtosis also depends on the distance between the speaker and the microphone array (distance effect). The distance effect is experimentally revealed by the regression analysis. The head orientation of a speaker is estimated by the restored kurtosis which is free from the distance effect. Experimental results on a reverberant environment show that the proposed method can estimate the head orientation of a speaker more accurately than a conventional head-orientation estimation method.

[1]  Masahito Togami,et al.  Automatic Speech Recognition of Human-Symbiotic Robot EMIEW , 2007 .

[2]  Harvey F. Silverman,et al.  A baseline algorithm for estimating talker orientation using acoustical data from a large-aperture microphone array , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[4]  Don H. Johnson,et al.  Array Signal Processing: Concepts and Techniques , 1993 .

[5]  Climent Nadeu,et al.  Audio-based approaches to head orientation estimation in a smart-room , 2007, INTERSPEECH.

[6]  Alessio Brutti,et al.  Speaker localization based on oriented global coherence field , 2006, INTERSPEECH.

[7]  Parham Aarabi,et al.  Enhanced sound localization , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  Masahito Togami,et al.  Stepwise Phase Difference Restoration Method for DOA Estimation of Multiple Sources , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..