Hearing out repeating elements in randomly varying multitone sequences: a case of streaming?: From Sensory Processing to Perception

One of the most essential functions of perception is to detect and track signals of interest amid other, potentially distracting stimuli. Understanding the mechanisms behind this function is important not just from a theoretical perspective, but also because of its potential applications in artificial scene analysis. An important aspect of auditory scene analysis is the organization of sounds into streams (Bregman 1990). While the vast majority of studies on auditory streaming have used repeating sound sequences, perhaps one of the most informative findings regarding the potential usefulness of streaming in everyday life comes from a study that involved randomly varying tones. Kidd, Mason, Deliwala, Woods, and Colburn (1994) measured detection thresholds for a fixed-frequency target tone presented simultaneously with multiple other spectral components, the frequencies of which varied unpredictably across trials; a situation known to produce large amounts of informational masking (Neff and Green 1987). Kidd et al. found that if the target-plus-masker bursts were repeated identically several times, so that they formed an unchanging sequence, large informational masking effects remained. However, if the masker component frequencies were allowed to vary from one burst to the next within the course of the stimulus sequence, as illustrated schematically in Fig. 1, thresholds were substantially reduced. Kidd et al. explained this outcome as being due to the repeating target tones being perceptually organized into a coherent auditory stream, which stood out from the randomly varying background. From this point of view, Kidd et al.’s results appear to demonstrate that stream segregation can greatly alleviate informational masking effects, which might otherwise Fig. 1. Schematic spectro gram showing a sequence of regularly repeating tones amid random masker tones. Time F re qu en cy

[1]  H S Colburn,et al.  Reducing informational masking by sound segregation. , 1994, The Journal of the Acoustical Society of America.

[2]  A S Bregman,et al.  Auditory streaming is cumulative. , 1978, Journal of experimental psychology. Human perception and performance.

[3]  Albert S. Bregman,et al.  Effects of time intervals and tone durations on auditory stream segregation , 2000, Perception & psychophysics.

[4]  Laurent Demany,et al.  On the binding of successive sounds: perceiving shifts in nonperceived pitches. , 2005, The Journal of the Acoustical Society of America.

[5]  D. M. Green,et al.  Masking produced by spectral uncertainty with multicomponent maskers , 1987, Perception & psychophysics.

[6]  C. Schreiner,et al.  Sequence sensitivity of neurons in cat primary auditory cortex. , 2000, Cerebral cortex.

[7]  Stephen McAdams,et al.  Auditory signal processing : physiology, psychoacoustics, and models , 2005 .

[8]  J. Rauschecker,et al.  Perceptual Organization of Tone Sequences in the Auditory Cortex of Awake Macaques , 2005, Neuron.

[9]  Rhodri Cusack,et al.  Performance measures of auditory organization , 2005 .

[10]  C S Watson,et al.  Factors in the discrimination of tonal patterns. I. Component frequency, temporal position, and silent intervals. , 1975, The Journal of the Acoustical Society of America.

[11]  Mitchell Steinschneider,et al.  Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey , 2001, Hearing Research.

[12]  A. Bregman Auditory Scene Analysis , 2008 .

[13]  Gerald Kidd,et al.  Multiple bursts, multiple looks, and stream coherence in the release from informational masking. , 2003, The Journal of the Acoustical Society of America.