Integration and segregation in auditory streaming

We aim to capture the perceptual dynamics of auditory streaming using a neurally inspired model of auditory processing. Traditional approaches view streaming as a competition of streams, realized within a tonotopically organized neural network. In contrast, we view streaming to be a dynamic integration process which resides at locations other than the sensory specific neural subsystems. This process finds its realization in the synchronization of neural ensembles or in the existence of informational convergence zones. Our approach uses two interacting dynamical systems, in which the first system responds to incoming acoustic stimuli and transforms them into a spatiotemporal neural field dynamics. The second system is a classification system coupled to the neural field and evolves to a stationary state. These states are identified with a single perceptual stream or multiple streams. Several results in human perception are modelled including temporal coherence and fission boundaries [L.P.A.S. van Noorden, Temporal coherence in the perception of tone sequences, Ph.D. Thesis, Eindhoven University of Technology, The Netherlands, 1975], and crossing of motions [A.S. Bregman, Auditory Scene Analysis: The Perceptual Organization of Sound, MIT Press, 1990]. Our model predicts phenomena such as the existence of two streams with the same pitch, which cannot be explained by the traditional stream competition models. An experimental study is performed to provide proof of existence of this phenomenon. The model elucidates possible mechanisms that may underlie perceptual phenomena. c 2005 Elsevier B.V. All rights reserved. PACS: 43.66.Ba

[1]  C F Moss,et al.  Auditory scene analysis by echolocation in bats. , 2001, The Journal of the Acoustical Society of America.

[2]  D. Poeppel,et al.  Towards a functional neuroanatomy of speech perception , 2000, Trends in Cognitive Sciences.

[3]  P. Nunez The brain wave equation: a model for the EEG , 1974 .

[4]  Stephen Grossberg,et al.  A Neural Network Model of Auditory Scene Anaysis and Source Segregation , 1994 .

[5]  R. Carlyon How the brain separates sounds , 2004, Trends in Cognitive Sciences.

[6]  W. Köhler Gestalt psychology , 1967 .

[7]  Jennifer S. Pardo,et al.  On the perceptual organization of speech. , 1994, Psychological review.

[8]  S. Amari Dynamics of pattern formation in lateral-inhibition type neural fields , 1977, Biological Cybernetics.

[9]  G. A. Miller,et al.  The Trill Threshold , 1950 .

[10]  DeLiang Wang,et al.  Primitive Auditory Segregation Based on Oscillatory Correlation , 1996, Cogn. Sci..

[11]  C. Micheyl,et al.  Auditory stream segregation on the basis of amplitude-modulation rate. , 2002, The Journal of the Acoustical Society of America.

[12]  D. Broadbent Perception and communication , 1958 .

[13]  F. Heider,et al.  Principles of topological psychology , 1936 .

[14]  J. Cowan,et al.  A mathematical theory of the functional dynamics of cortical and thalamic nervous tissue , 1973, Kybernetik.

[15]  Akihiro Izumi,et al.  Auditory stream segregation in Japanese monkeys , 2002, Cognition.

[16]  R. Carlyon,et al.  Effects of location, frequency region, and time course of selective attention on auditory scene analysis. , 2004, Journal of experimental psychology. Human perception and performance.

[17]  M. Kubovy The perceptual organization of dot lattices , 1994, Psychonomic bulletin & review.

[18]  W. Singer,et al.  Dynamic predictions: Oscillations and synchrony in top–down processing , 2001, Nature Reviews Neuroscience.

[19]  James J. Wright,et al.  Propagation and stability of waves of electrical activity in the cerebral cortex , 1997 .

[20]  P G Singh Perceptual organization of complex-tone sequences: a tradeoff between pitch and timbre? , 1987, The Journal of the Acoustical Society of America.

[21]  John J. Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities , 1999 .

[22]  P. Todd,et al.  Musical networks: Parallel distributed perception and performance , 1999 .

[23]  A. Bregman,et al.  Crossing of Auditory Streams , 1985 .

[24]  R. M. Warren,et al.  Auditory illusions and confusions. , 1970, Scientific American.

[25]  G. A. Miller,et al.  The Intelligibility of Interrupted Speech , 1948 .

[26]  Louis Goldstein,et al.  Primary auditory stream segregation of repeated consonant—vowel sequences , 1974 .

[27]  Michael J. Denham,et al.  A Model of Auditory Streaming , 1995, NIPS.

[28]  A. Bregman Auditory streaming: Competition among alternative organizations , 1978, Perception & psychophysics.

[29]  L. V. Noorden Temporal coherence in the perception of tone sequences , 1975 .

[30]  Shinsuke Shimojo,et al.  Beyond perceptual modality: Auditory effects on visual perception , 2001 .

[31]  C E Schreiner,et al.  Neural processing of amplitude-modulated sounds. , 2004, Physiological reviews.

[32]  M. R. Jones,et al.  Time, our lost dimension: toward a new theory of perception, attention, and memory. , 1976, Psychological review.

[33]  S Grossberg,et al.  A spectral network model of pitch perception. , 1995, The Journal of the Acoustical Society of America.

[34]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[35]  A. Oxenham,et al.  Sequential stream segregation in the absence of spectral cues. , 1999, The Journal of the Acoustical Society of America.

[36]  R Hari,et al.  Auditory stream segregation in dyslexic adults. , 1999, Brain : a journal of neurology.

[37]  I. Gordon Theories of Visual Perception , 1989 .

[38]  H. Haken,et al.  A derivation of a macroscopic field theory of the brain from the quasi-microscopic neural dynamics , 1997 .

[39]  J. Hopfield Neurons withgraded response havecollective computational properties likethoseoftwo-state neurons , 1984 .

[40]  S. Grossberg The Link between Brain Learning, Attention, and Consciousness , 1999, Consciousness and Cognition.

[41]  Shun-ichi Amari,et al.  Chapter XIV A Mathematical Theory of Self-Organizing Nerve Systems , 1982 .

[42]  G. Calvert Crossmodal processing in the human brain: insights from functional neuroimaging studies. , 2001, Cerebral cortex.

[43]  H. Haken,et al.  Field Theory of Electromagnetic Brain Activity. , 1996, Physical review letters.

[44]  A S Bregman,et al.  The influence of different timbre attributes on the perceptual segregation of complex-tone sequences. , 1997, The Journal of the Acoustical Society of America.

[45]  Stephen Grossberg,et al.  A massively parallel architecture for a self-organizing neural pattern recognition machine , 1988, Comput. Vis. Graph. Image Process..

[46]  Gereon R Fink,et al.  Left inferior parietal cortex integrates time and space during collision judgments , 2003, NeuroImage.

[47]  Guy J. Brown,et al.  A computational model of auditory selective attention , 2004, IEEE Transactions on Neural Networks.

[48]  O. Reiser,et al.  Principles Of Gestalt Psychology , 1936 .

[49]  R Meddis,et al.  Computer simulation of auditory stream segregation in alternating-tone sequences. , 1996, The Journal of the Acoustical Society of America.

[50]  H. Haken,et al.  Pattern recognition and associative memory as dynamical processes in a synergetic system , 1988, Biological Cybernetics.

[51]  D. Massaro,et al.  Cross-octave masking of single tones and musical sequences: The effects of structure on auditory recognition , 1976 .

[52]  U. Neisser Cognitive Psychology: Classic Edition , 1967 .

[53]  Viktor K. Jirsa,et al.  Derivation of a field equation of brain activity , 1996 .

[54]  Stephen Grossberg,et al.  Pitch-based streaming in auditory perception , 1999 .

[55]  J. Cowan,et al.  Excitatory and inhibitory interactions in localized populations of model neurons. , 1972, Biophysical journal.

[56]  H. Haken,et al.  Pattern recognition and associative memory as dynamical processes in a synergetic system , 1988, Biological Cybernetics.

[57]  Stephen Grossberg,et al.  ARTSTREAM: a neural network model of auditory scene analysis and source segregation , 2004, Neural Networks.

[58]  M. Hallett,et al.  Neural Correlates of Auditory–Visual Stimulus Onset Asynchrony Detection , 2001, The Journal of Neuroscience.