An environment aware ML estimation of acoustic radiation pattern with distributed microphone pairs

This paper presents a parametric approach to classify the radiation pattern of an acoustic source given the signals captured by multiple microphones. The radiation pattern influences the way the acoustic waves propagate within an enclosure, with direct implications on the behavior of most audio processing algorithms. In particular, the Generalized Cross-Correlation PHAse Transform is affected by the emission pattern as well as by the orientation of the source. A Maximum Likelihood estimator is introduced by using descriptors of the acoustic characteristics of the environment, e.g. wall absorption coefficients and room dimensions, from which models of the observed Generalized Cross-Correlation PHAse Transform are derived for a specific emission pattern. A generic unimodal source directivity is modeled using a parameterized cardioid function. A sub-band implementation is proposed to account for the frequency dependence of the source emission pattern. Experiments on simulated and real data show that the acoustic radiation pattern can be estimated in an effective way under noisy and reverberant conditions.

[1]  John McDonough,et al.  Distant Speech Recognition , 2009 .

[2]  Terence Betlehem,et al.  Acoustic beamforming exploiting directionality of human speech sources , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[3]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[4]  Masashi Unoki,et al.  A speech dereverberation method based on the MTF concept using adaptive time-frequency divisions , 2003, 2004 12th European Signal Processing Conference.

[5]  J. Flanagan Analog Measurements of Sound Radiation from the Mouth , 1960 .

[6]  Alessio Brutti,et al.  Speaker localization based on oriented global coherence field , 2006, INTERSPEECH.

[7]  Seiichi Nakagawa,et al.  Automatic estimation of position and orientation of an acoustic source by a microphone array network. , 2009, The Journal of the Acoustical Society of America.

[8]  Heinrich Kuttruff,et al.  Room acoustics , 1973 .

[9]  Masahito Togami,et al.  Head orientation estimation of a speaker by utilizing kurtosis of a DOA histogram with restoration of distance effect , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Deborah Estrin,et al.  Collaborative Sensor Networking Towards Real-Time Acoustical Beamforming in Free-Space and Limited Reverberance , 2004, IEEE Trans. Mob. Comput..

[11]  Cha Zhang,et al.  Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[13]  Mohan M. Trivedi,et al.  Source localization in reverberant environments: modeling and statistical analysis , 2003, IEEE Trans. Speech Audio Process..

[14]  Douglas L. Jones,et al.  Blind estimation of reverberation time. , 2003, The Journal of the Acoustical Society of America.

[15]  Emanuel A. P. Habets,et al.  An online quasi-Newton algorithm for blind SIMO identification , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[16]  Alessio Brutti,et al.  Inference of acoustic source directivity using environment awareness , 2011, 2011 19th European Signal Processing Conference.

[17]  Richard Heusdens,et al.  A Statistical Room Impulse Response Model with Frequency Dependent Reverberation Time for Single-Microphone Late Reverberation Suppression , 2011, INTERSPEECH.

[18]  Harvey F. Silverman,et al.  Characterization of talker radiation pattern using a microphone array , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  Kazuhiro Nakadai,et al.  Sound source tracking with directivity pattern estimation using a 64 ch microphone array , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20]  Jyri Huopaniemi,et al.  Measurement and modeling techniques for directional sound radiation from the mouth , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[21]  Alessio Brutti,et al.  Oriented global coherence field for the estimation of the head orientation in smart rooms equipped with distributed microphone arrays , 2005, INTERSPEECH.

[22]  Mohan M. Trivedi,et al.  Role of head pose estimation in speech acquisition from distant microphones , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[23]  Emanuel A. P. Habets,et al.  Blind estimation of reverberation time based on the distribution of signal decay rates , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Yusuke Hioka,et al.  Estimation of sound source orientation using eigenspace of spatial correlation matrix , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.