A precedence effect based far-field DoA estimation algorithm

A robust far-field DoA estimation algorithm is proposed in this paper. The algorithm is inspired by the precedence effect which results in humans robust capability in localizing sound sources. The algorithm implements the concept of the precedence effect by applying a proper threshold and an onset detection mechanism in the cross-correlation domain. Experiment results show that by cascading the proposed algorithm to a conventional cross-correlation based DoA algorithm, the accuracy of the DoA estimation is significantly improved in far-field test conditions.

[1]  Alan R. Palmer,et al.  Binaural and Spatial Coding in the Inferior Colliculus , 2005 .

[2]  Trevor Darrell,et al.  Learning a Precedence Effect-Like Weighting Function for the Generalized Cross-Correlation Framework , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  C. Faller,et al.  Source localization in complex listening situations: selection of binaural cues based on interaural coherence. , 2004, The Journal of the Acoustical Society of America.

[4]  Daniel J. Tollin,et al.  Psychophysical and physiological studies of the precedence effect and echo threshold in the behaving cat , 2005 .

[5]  Barbara Shinn-Cunningham,et al.  Accurate Sound Localization in Reverberant Environments Is Mediated by Robust Encoding of Spatial Cues in the Auditory Midbrain , 2009, Neuron.

[6]  M. Omologo,et al.  Comparison Between Different Sound Source Localization Techniques Based on a Real Data Collection , 2008, 2008 Hands-Free Speech Communication and Microphone Arrays.

[7]  José Carlos Príncipe,et al.  Modeling the precedence effect for speech using the gamma filter , 1999, Neural Networks.

[8]  H S Colburn,et al.  The precedence effect. , 1999, The Journal of the Acoustical Society of America.

[9]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[10]  Michael S. Brandstein,et al.  A robust method for speech signal time-delay estimation in reverberant rooms , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.