A Detection-Theoretic Analysis of Auditory Streaming and Its Relation to Auditory Masking

Research on hearing has long been challenged with understanding our exceptional ability to hear out individual sounds in a mixture (the so-called cocktail party problem). Two general approaches to the problem have been taken using sequences of tones as stimuli. The first has focused on our tendency to hear sequences, sufficiently separated in frequency, split into separate cohesive streams (auditory streaming). The second has focused on our ability to detect a change in one sequence, ignoring all others (auditory masking). The two phenomena are clearly related, but that relation has never been evaluated analytically. This article offers a detection-theoretic analysis of the relation between multitone streaming and masking that underscores the expected similarities and differences between these phenomena and the predicted outcome of experiments in each case. The key to establishing this relation is the function linking performance to the information divergence of the tone sequences, DKL (a measure of the statistical separation of their parameters). A strong prediction is that streaming and masking of tones will be a common function of DKL provided that the statistical properties of sequences are symmetric. Results of experiments are reported supporting this prediction.

[1]  Ching-Ju Liu,et al.  A method for evaluating the relation between sound source segregation and masking. , 2011, The Journal of the Acoustical Society of America.

[2]  Alexandra Bendixen,et al.  Regular patterns stabilize auditory streams. , 2010, The Journal of the Acoustical Society of America.

[3]  Ching-Ju Liu,et al.  Level dominance in sound source identification. , 2008, The Journal of the Acoustical Society of America.

[4]  R. Thompson A Note on Restricted Maximum Likelihood Estimation with an Alternative Outlier Model , 1985 .

[5]  Brian C. J. Moore Basic Aspects of Hearing : Physiology and Perception , 2013 .

[6]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[7]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[8]  R A Lutfi,et al.  Informational masking by everyday sounds. , 1999, The Journal of the Acoustical Society of America.

[9]  R A Lutfi A model of auditory pattern analysis based on component-relative-entropy. , 1993, The Journal of the Acoustical Society of America.

[10]  Ching-Ju Liu,et al.  A new approach to sound source segregation. , 2013, Advances in experimental medicine and biology.

[11]  Alexandra Bendixen,et al.  The effects of rhythm and melody on auditory stream segregation. , 2014, The Journal of the Acoustical Society of America.

[12]  H. B. Barlow,et al.  Possible Principles Underlying the Transformations of Sensory Messages , 2012 .

[13]  Robert A. Lutfi,et al.  Correlation coefficients and correlation ratios as estimates of observer weights in multiple‐observation tasks , 1995 .

[14]  Lynn R. Gilbertson,et al.  The information-divergence hypothesis of informational masking. , 2013, The Journal of the Acoustical Society of America.

[15]  D. Wang,et al.  Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , 2008, IEEE Trans. Neural Networks.

[16]  T. Dau,et al.  Auditory stream formation affects comodulation masking release retroactively. , 2009, The Journal of the Acoustical Society of America.

[17]  R A Lutfi,et al.  Effect of masker harmonicity on informational masking. , 2000, The Journal of the Acoustical Society of America.

[18]  G. Kidd,et al.  Similarity, uncertainty, and masking in the identification of nonspeech auditory patterns. , 2002, The Journal of the Acoustical Society of America.

[19]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[20]  Robert A. Lutfi Comment on ‘‘Analysis of weights in multiple observation tasks’’ [J. Acoust. Soc. Am. 86, 1743–1746 (1989)] , 1992 .

[21]  A. Oxenham,et al.  Objective and Subjective Psychophysical Measures of Auditory Stream Integration and Segregation , 2010, Journal of the Association for Research in Otolaryngology.

[22]  Yi Shen,et al.  Toward an objective measure for a "stream segregation" task. , 2012, The Journal of the Acoustical Society of America.

[23]  R A Lutfi,et al.  Effect of component-relative-entropy on the discrimination of simultaneous tone complexes. , 1994, The Journal of the Acoustical Society of America.

[24]  I. Winkler,et al.  Different roles of similarity and predictability in auditory stream segregation , 2013 .

[25]  Shihab Shamma,et al.  Temporal coherence versus harmonicity in auditory stream formation. , 2013, The Journal of the Acoustical Society of America.

[26]  B. Shinn-Cunningham,et al.  Informational masking: counteracting the effects of stimulus uncertainty by decreasing target-masker similarity. , 2003, The Journal of the Acoustical Society of America.

[27]  H S Colburn,et al.  Reducing informational masking by sound segregation. , 1994, The Journal of the Acoustical Society of America.

[28]  C. K. Ogden A Source Book Of Gestalt Psychology , 2013 .

[29]  Douglas Johnson,et al.  Stream Segregation and Peripheral Channeling , 1991 .

[30]  Brian C J Moore,et al.  Properties of auditory stream formation , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[31]  R A Lutfi,et al.  Nonmonotonicity of informational masking. , 1998, The Journal of the Acoustical Society of America.

[32]  B. Berg,et al.  Observer efficiency and weights in a multiple observation task. , 1990, The Journal of the Acoustical Society of America.

[33]  Jungmee Lee,et al.  Auditory streaming of tones of uncertain frequency, level, and duration. , 2015, The Journal of the Acoustical Society of America.

[34]  L. V. Noorden Temporal coherence in the perception of tone sequences , 1975 .

[35]  T Dau,et al.  Modulation detection interference: effects of concurrent and sequential streaming. , 2001, The Journal of the Acoustical Society of America.

[36]  M. Wertheimer,et al.  Gestalt Theory , 2019, Theories and Applications of Counseling and Psychotherapy: Relevance Across Cultures and Settings.

[37]  Gerald Kidd,et al.  Multiple bursts, multiple looks, and stream coherence in the release from informational masking. , 2003, The Journal of the Acoustical Society of America.

[38]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[39]  Stephen McAdams,et al.  Postrecognition of interleaved melodies as an indirect measure of auditory stream formation. , 2003, Journal of experimental psychology. Human perception and performance.

[40]  B C Moore,et al.  The role of spectral and periodicity cues in auditory stream segregation, measured using a temporal discrimination task. , 1999, The Journal of the Acoustical Society of America.

[41]  Guy J. Brown,et al.  Fundamentals of Computational Auditory Scene Analysis , 2006 .

[42]  Z. Govindarajulu,et al.  A brief survey of nonparametric statistics , 1976 .

[43]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[44]  Brian C J Moore Psychoacoustics of normal and impaired hearing. , 2002, British medical bulletin.

[45]  Shihab A. Shamma,et al.  Hearing out repeating elements in randomly varying multitone sequences: a case of streaming?: From Sensory Processing to Perception , 2007 .