A probability model for interaural phase difference

In this paper, we derive a probability model for interaural phase differences at individual spectrogram points. Such a model can combine observations across arbitrary time and frequency regions in a structured way and does not make any assumptions about the characteristics of the sound sources. In experiments with speech from twenty speakers in simulated reverberant environments, this probabilistic method predicted the correct interaural delay of a signal more accurately than generalized cross-correlation methods.

[1]  A. Zeiberg,et al.  Lateralization of complex binaural stimuli: a weighted-image model. , 1988, The Journal of the Acoustical Society of America.

[2]  Nathaniel I. Durlach,et al.  Chapter 11 – MODELS OF BINAURAL INTERACTION , 1978 .

[3]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[4]  B. Shinn-Cunningham,et al.  Neural representation of source direction in reverberant space , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[5]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[6]  Barbara G. Shinn-Cunningham,et al.  Auditory Localization in Rooms: Acoustic Analysis and Behavior , 2002 .

[7]  Barbara G Shinn-Cunningham,et al.  Localizing nearby sound sources in a classroom: binaural room impulse responses. , 2005, The Journal of the Acoustical Society of America.

[8]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .