The Technology of Binaural Listening & Understanding : Paper ICA 2016-445 Exploiting envelope fluctuations to achieve robust extraction and intelligent integration of binaural cues

The human auditory system achieves remarkably robust communication performance, even in complex environments featuring multiple talkers, distracting noises, echoes, and reverberation. Although the neural mechanisms of this facility are not well understood, many studies point to the importance of binaural and spatial cues present at sound onset or during other fluctuations of the temporal envelope. Specifically, transient increases in the amplitude envelope appear to trigger the sampling of binaural information, independent of binaural-cue type or frequency range. This paper begins with a review of the psychophysical and neural evidence for such a triggering process, and an exploration of signal-processing algorithms that mimic and/or exploit that process. Such algorithms can be applied in two key directions of importance to communication acoustics: First, temporal envelopes are used to guide the strategic application of spatial cues in spatial sound synthesis for human listeners. Second, temporal fluctuations are used to guide the extraction of spatial cues from binaural recordings and intelligently group those cues into temporally and spatially coherent binaural proto-objects. These applications provide critical tests of the triggering hypothesis, the general role of temporal envelope fluctuations in binaural hearing, and the neural mechanisms of integrated binaural perception. Further, they provide powerful tools for the design of efficient audio communication systems and devices that interface with human participants in real or virtual spatial settings. Supported by NIH DC011548.

[1]  E R Hafter,et al.  Detection of interaural differences of time in trains of high-frequency clicks as a function of interclick interval and number. , 1983, The Journal of the Acoustical Society of America.

[2]  Andrew D Brown,et al.  Temporal weighting functions for interaural time and level differences. III. Temporal weighting for lateral position judgments. , 2013, The Journal of the Acoustical Society of America.

[3]  R. Wickesberg,et al.  Delayed, frequency-specific inhibition in the cochlear nuclei of mice: a mechanism for monaural echo suppression , 1990, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[4]  Jerry V. Tobias,et al.  Effective Onset Duration of Auditory Stimuli , 1959 .

[5]  Andrew D Brown,et al.  Temporal weighting of interaural time and level differences in high-rate click trains. , 2010, The Journal of the Acoustical Society of America.

[6]  Andrew D Brown,et al.  Temporal weighting of binaural cues revealed by detection of dynamic interaural differences in high-rate Gabor click trains. , 2010, The Journal of the Acoustical Society of America.

[7]  T Houtgast,et al.  Lateralization threshold of a signal in noise. , 1968, The Journal of the Acoustical Society of America.

[8]  Mathias Dietz,et al.  Emphasis of spatial cues in the temporal fine structure during the rising segments of amplitude-modulated sounds , 2013, Proceedings of the National Academy of Sciences.

[9]  P. Heil Representation of Sound Onsets in the Auditory System , 2001, Audiology and Neurotology.

[10]  G Christopher Stecker,et al.  Nonuniform temporal weighting of interaural time differences in 500 Hz tones. , 2014, The Journal of the Acoustical Society of America.

[11]  Anna C. Diedesch,et al.  Temporal weighting of binaural information at low frequencies: Discrimination of dynamic interaural time and level differences. , 2015, The Journal of the Acoustical Society of America.

[12]  G Christopher Stecker,et al.  Temporal weighting functions for interaural time and level differences. IV. Effects of carrier frequency. , 2014, The Journal of the Acoustical Society of America.

[13]  G. C. Stecker,et al.  Temporal weighting functions for interaural time and level differences. II. The effect of binaurally synchronous temporal jitter. , 2011, The Journal of the Acoustical Society of America.

[14]  E R Hafter,et al.  Detection of interaural differences of intensity in trains of high-frequency clicks as a function of interclick interval and number. , 1983, The Journal of the Acoustical Society of America.

[15]  K Saberi,et al.  Observer weighting of interaural delays in filtered impulses , 1996, Perception & psychophysics.

[16]  R L Freyman,et al.  Onset dominance in lateralization. , 1997, The Journal of the Acoustical Society of America.