Monaural Azimuth Localization Using Spectral Dynamics of Speech

We tackle the task of localizing speech signals on the horizontal plane using monaural cues. We show that monaural cues as incorporated in speech are efficiently captured by amplitude modulation spectra patterns. We demonstrate that by using these patterns, a linear Support Vector Machine can use directionality related information to learn to discriminate and classify sound location at high resolution. We propose a straightforward and robust way of integrating information from two ears: treating each ear as an independent processor and integrate the information at the decision level by doing that ambiguity is to a large extent resolved.

[1]  B. Kollmeier,et al.  Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction. , 1994, The Journal of the Acoustical Society of America.

[2]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[3]  Volker Hohmann,et al.  Database of Multichannel In-Ear and Behind-the-Ear Head-Related and Binaural Room Impulse Responses , 2009, EURASIP J. Adv. Signal Process..

[4]  D. M. Green,et al.  Sound localization by human listeners. , 1991, Annual review of psychology.

[5]  Birger Kollmeier,et al.  Modulation-based detection of speech in real background noise: Generalization to novel background classes , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  A. V. van Opstal,et al.  Binaural weighting of pinna cues in human sound localization , 2003, Experimental Brain Research.

[7]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[8]  W. Marsden I and J , 2012 .

[9]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[10]  Eric D Young,et al.  Cues for sound localization are encoded in multiple aspects of spike trains in the inferior colliculus. , 2008, Journal of neurophysiology.

[11]  Daniel E. Shub,et al.  Discrimination and identification of azimuth using spectral shape. , 2008, The Journal of the Acoustical Society of America.