A Visionary Approach to Listening: Determining The Role Of Vision In Auditory Scene Analysis

To recognize and understand the auditory environment, the listener must first separate sounds that arise from different sources and capture each event. This process is known as auditory scene analysis. The aim of this thesis is to investigate whether and how visual information can influence auditory scene analysis. The thesis consists of four chapters. Firstly, I reviewed the literature to give a clear framework about the impact of visual information on the analysis of complex acoustic environments. In chapter II, I examined psychophysically whether temporal coherence between auditory and visual stimuli was sufficient to promote auditory stream segregation in a mixture. I have found that listeners were better able to report brief deviants in an amplitude modulated target stream when a visual stimulus changed in size in a temporally coherent manner than when the visual stream was coherent with the non-target auditory stream. This work demonstrates that temporal coherence between auditory and visual features can influence the way people analyse an auditory scene. In chapter III, the integration of auditory and visual features in auditory cortex was examined by recording neuronal responses in awake and anaesthetised ferret auditory cortex in response to the modified stimuli used in Chapter II. I demonstrated that temporal coherence between auditory and visual stimuli enhances the neural representation of a sound and influences which sound a neuron represents in a sound mixture. Visual stimuli elicited reliable changes in the phase of the local field potential which provides mechanistic insight into this finding. Together these findings provide evidence that early cross modal integration underlies the behavioural effects in chapter II. Finally, in chapter IV, I investigated whether training can influence the ability of listeners to utilize visual cues for auditory stream analysis and showed that this ability improved by training listeners to detect auditory-visual temporal coherence.

[1]  L P Shapiro,et al.  "How to milk a coat:" the effects of semantic and acoustic information on phoneme categorization. , 1998, The Journal of the Acoustical Society of America.

[2]  P. Walker,et al.  Stroop Interference Based on the Multimodal Correlates of Haptic Size and Auditory Pitch , 1985, Perception.

[3]  M. Mesulam,et al.  From sensation to cognition. , 1998, Brain : a journal of neurology.

[4]  F. Ohl,et al.  Non-sensory cortical and subcortical connections of the primary auditory cortex in Mongolian gerbils: Bottom-up and top-down processing of neuronal information via field AI , 2008, Brain Research.

[5]  Charles Spence,et al.  Perception of audiovisual speech synchrony for native and non-native language , 2010, Brain Research.

[6]  V. Ramachandran,et al.  Synaesthesia? A window into perception, thought and language , 2001 .

[7]  Cesare Parise,et al.  Synesthetic congruency modulates the temporal ventriloquism effect , 2008, Neuroscience Letters.

[8]  Y. Sugita Neuronal correlates of auditory induction in the cat cortex , 1997, Neuroreport.

[9]  J. Driver,et al.  Multisensory Interplay Reveals Crossmodal Influences on ‘Sensory-Specific’ Brain Regions, Neural Responses, and Judgments , 2008, Neuron.

[10]  John J. Foxe,et al.  Multisensory auditory-visual interactions during early sensory processing in humans: a high-density electrical mapping study. , 2002, Brain research. Cognitive brain research.

[11]  Barbara G Shinn-Cunningham,et al.  A sound element gets lost in perceptual competition , 2007, Proceedings of the National Academy of Sciences.

[12]  N. Mesgarani,et al.  Selective cortical representation of attended speaker in multi-talker speech perception , 2012, Nature.

[13]  M. Steinschneider,et al.  Neurophysiological evidence for context-dependent encoding of sensory input in human auditory cortex , 2006, Brain Research.

[14]  D. Hartley,et al.  Multisensory Training Improves Auditory Spatial Processing following Bilateral Cochlear Implantation , 2014, The Journal of Neuroscience.

[15]  A. Samuel Does lexical information influence the perceptual restoration of phonemes , 1996 .

[16]  C. Spence,et al.  Crossmodal correspondences: Innate or learned? , 2012, i-Perception.

[17]  N. Logothetis,et al.  Visual modulation of neurons in auditory cortex. , 2008, Cerebral cortex.

[18]  Mounya Elhilali,et al.  Temporal coherence and the streaming of complex sounds. , 2013, Advances in experimental medicine and biology.

[19]  A. Puce,et al.  Neuronal oscillations and visual amplification of speech , 2008, Trends in Cognitive Sciences.

[20]  Mitchell Steinschneider,et al.  Neural Correlates of Auditory Scene Analysis Based on Inharmonicity in Monkey Primary Auditory Cortex , 2010, The Journal of Neuroscience.

[21]  Rachel N. Denison,et al.  Temporal Structure and Complexity Affect Audio-Visual Correspondence Detection , 2013, Front. Psychology.

[22]  E Ahissar,et al.  Temporal frequency of whisker movement. II. Laminar organization of cortical representations. , 2001, Journal of neurophysiology.

[23]  J. Fritz,et al.  Active listening: Task-dependent plasticity of spectrotemporal receptive fields in primary auditory cortex , 2005, Hearing Research.

[24]  A. Bregman,et al.  Primary auditory stream segregation and perception of order in rapid sequences of tones. , 1971, Journal of experimental psychology.

[25]  J. Driver Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading , 1996, Nature.

[26]  A. Oxenham,et al.  Sequential stream segregation in the absence of spectral cues. , 1999, The Journal of the Acoustical Society of America.

[27]  Christoph Kayser,et al.  Multisensory interactions in primate auditory cortex: fMRI and electrophysiology , 2009, Hearing Research.

[28]  Torsten Rahne,et al.  Visual cues can modulate integration and segregation of objects in auditory scene analysis , 2007, Brain Research.

[29]  B. Shinn-Cunningham,et al.  Selective Attention in Normal and Impaired Hearing , 2008, Trends in amplification.

[30]  Susan Denham,et al.  Multistability in auditory stream segregation: a predictive coding view , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[31]  W. Ritter,et al.  An investigation of the auditory streaming effect using event-related brain potentials. , 1999, Psychophysiology.

[32]  C. Micheyl,et al.  Auditory stream segregation on the basis of amplitude-modulation rate. , 2002, The Journal of the Acoustical Society of America.

[33]  J. Rauschecker,et al.  Attention‐related modulation of activity in primary and secondary auditory cortex , 1997, Neuroreport.

[34]  Adrian K. C. Lee,et al.  Using neuroimaging to understand the cortical mechanisms of auditory selective attention , 2014, Hearing Research.

[35]  M. Chait,et al.  Neural Correlates of Auditory Figure-Ground Segregation Based on Temporal Coherence , 2016, Cerebral cortex.

[36]  S. Shamma,et al.  Interaction between Attention and Bottom-Up Saliency Mediates the Representation of Foreground and Background in an Auditory Scene , 2009, PLoS biology.

[37]  E. Vogel,et al.  Sensory gain control (amplification) as a mechanism of selective attention: electrophysiological and neuroimaging evidence. , 1998, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[38]  E. Bullmore,et al.  Activation of auditory cortex during silent lipreading. , 1997, Science.

[39]  Thilo Womelsdorf,et al.  A Role of Phase-Resetting in Coordinating Large Scale Neural Networks During Attention and Goal-Directed Behavior , 2016, Front. Syst. Neurosci..

[40]  Antje Ihlefeld,et al.  Disentangling the effects of spatial cues on selection and formation of auditory objects. , 2008, The Journal of the Acoustical Society of America.

[41]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[42]  Ankoor S. Shah,et al.  An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. , 2005, Journal of neurophysiology.

[43]  Kathleen S Rockland,et al.  Multisensory convergence in calcarine visual areas in macaque monkey. , 2003, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[44]  I. Nelken,et al.  Neurons and Objects: The Case of Auditory Cortex , 2008, Front. Neurosci..

[45]  I. Nelken,et al.  Physiological and Anatomical Evidence for Multisensory Interactions in Auditory Cortex , 2006, Cerebral cortex.

[46]  J. Swets Signal detection and recognition by human observers : contemporary readings , 1964 .

[47]  S. Shamma,et al.  Segregation of complex acoustic scenes based on temporal coherence , 2013, eLife.

[48]  M. Wallace,et al.  The effects of visual training on multisensory temporal processing , 2013, Experimental Brain Research.

[49]  P F Seitz,et al.  The use of visible speech cues for improving auditory detection of spoken sentences. , 2000, The Journal of the Acoustical Society of America.

[50]  Claude Alain,et al.  Concurrent Sound Segregation Is Enhanced in Musicians , 2009, Journal of Cognitive Neuroscience.

[51]  Joost X. Maier,et al.  Multisensory Integration of Dynamic Faces and Voices in Rhesus Monkey Auditory Cortex , 2005 .

[52]  Robert D. Melara,et al.  Interaction between synesthetically corresponding dimensions. , 1987 .

[53]  Pablo Fuentealba,et al.  Cell Type-Specific Tuning of Hippocampal Interneuron Firing during Gamma Oscillations In Vivo , 2007, The Journal of Neuroscience.

[54]  M. Wallace,et al.  Generalization of multisensory perceptual learning , 2016, Scientific Reports.

[55]  I. Winkler,et al.  Preattentive auditory context effects , 2003, Cognitive, affective & behavioral neuroscience.

[56]  Kerry M. M. Walker,et al.  Interdependent Encoding of Pitch, Timbre, and Spatial Location in Auditory Cortex , 2009, The Journal of Neuroscience.

[57]  B. Postle,et al.  Top-down control of the phase of alpha-band oscillations as a mechanism for temporal prediction , 2015, Proceedings of the National Academy of Sciences.

[58]  H. Bülthoff,et al.  Merging the senses into a robust percept , 2004, Trends in Cognitive Sciences.

[59]  D. Senkowski,et al.  The multifaceted interplay between attention and multisensory integration , 2010, Trends in Cognitive Sciences.

[60]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[61]  M. Daneman,et al.  How young and old adults listen to and remember speech in noise. , 1995, The Journal of the Acoustical Society of America.

[62]  T. Rahne,et al.  Visual cues release the temporal coherence of auditory objects in auditory scene analysis , 2009, Brain Research.

[63]  Shihab Shamma,et al.  Temporal coherence versus harmonicity in auditory stream formation. , 2013, The Journal of the Acoustical Society of America.

[64]  K. Reinikainen,et al.  Selective attention enhances the auditory 40-Hz transient response in humans , 1993, Nature.

[65]  H. Scheich,et al.  Multisensory processing via early cortical stages: Connections of the primary auditory cortical field with other sensory systems , 2006, Neuroscience.

[66]  Charles Spence,et al.  ‘When Birds of a Feather Flock Together’: Synesthetic Correspondences Modulate Audiovisual Integration in Non-Synesthetes , 2009, PloS one.

[67]  Jeremy Marozeau,et al.  The Effect of Visual Cues on Auditory Stream Segregation in Musicians and Non-Musicians , 2010, PloS one.

[68]  Kostas Giannakis,et al.  A comparative evaluation of auditory-visual mappings for sound visualisation , 2006, Organised Sound.

[69]  Renee Timmers,et al.  Beethoven’s last piano sonata and those who follow crocodiles: Cross-domain mappings of auditory pitch in a musical context , 2010, Cognition.

[70]  S. Hillyard,et al.  Electrical Signs of Selective Attention in the Human Brain , 1973, Science.

[71]  Kerry M. M. Walker,et al.  Spectral timbre perception in ferrets: discrimination of artificial vowels under different listening conditions. , 2013, The Journal of the Acoustical Society of America.

[72]  C. Schroeder,et al.  Neuronal Oscillations and Multisensory Interaction in Primary Auditory Cortex , 2007, Neuron.

[73]  H. Scheich,et al.  Nonauditory Events of a Behavioral Procedure Activate Auditory Cortex of Highly Trained Monkeys , 2005, The Journal of Neuroscience.

[74]  Xiao-Jing Wang Neurophysiological and computational principles of cortical rhythms in cognition. , 2010, Physiological reviews.

[75]  Christopher T. Lovelace,et al.  An irrelevant light enhances auditory detection in humans: a psychophysical analysis of multisensory integration in stimulus detection. , 2003, Brain research. Cognitive brain research.

[76]  Kostas Giannakis,et al.  4 Imaging Soundscapes : Identifying Cognitive Associations between Auditory and Visual Dimensions , 2000 .

[77]  J. Navarra,et al.  Spatial recoding of sound: Pitch-varying auditory cues modulate up/down visual spatial attention , 2012 .

[78]  I. Winkler,et al.  Object representation in the human auditory system , 2006, The European journal of neuroscience.

[79]  Aaron R. Seitz,et al.  Benefits of multisensory learning , 2008, Trends in Cognitive Sciences.

[80]  Virginie van Wassenhove,et al.  Auditory-visual fusion in speech perception in children with cochlear implants. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[81]  Jessica M. Foxton,et al.  Effects of attention and unilateral neglect on auditory stream segregation. , 2001, Journal of experimental psychology. Human perception and performance.

[82]  Yoshitaka Nakajima,et al.  Illusory recouplings of onsets and terminations of glide tone components , 2000, Perception & psychophysics.

[83]  Brian C J Moore,et al.  Properties of auditory stream formation , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[84]  David Poeppel,et al.  Cortical oscillations and speech processing: emerging computational principles and operations , 2012, Nature Neuroscience.

[85]  Michael J. Crosse,et al.  Congruent Visual Speech Enhances Cortical Entrainment to Continuous Auditory Speech in Noise-Free Conditions , 2015, The Journal of Neuroscience.

[86]  Charles Spence,et al.  Audiovisual crossmodal correspondences and sound symbolism: a study using the implicit association test , 2012, Experimental Brain Research.

[87]  S. Shamma,et al.  Temporal coherence and attention in auditory scene analysis , 2011, Trends in Neurosciences.

[88]  Christoph Kayser,et al.  A Precluding But Not Ensuring Role of Entrained Low-Frequency Oscillations for Auditory Perception , 2012, The Journal of Neuroscience.

[89]  I. Nelken,et al.  Processing of complex stimuli and natural scenes in the auditory cortex , 2004, Current Opinion in Neurobiology.

[90]  Mary A. Peterson,et al.  Memory and Learning in Figure-Ground Perception , 2003 .

[91]  B. Shinn-Cunningham Object-based auditory and visual attention , 2008, Trends in Cognitive Sciences.

[92]  Salvador Soto-Faraco,et al.  Attention to touch weakens audiovisual speech integration , 2007, Experimental Brain Research.

[93]  Salvador Soto-Faraco,et al.  Assessing the role of attention in the audiovisual integration of speech , 2010, Inf. Fusion.

[94]  D. Burr,et al.  The Ventriloquist Effect Results from Near-Optimal Bimodal Integration , 2004, Current Biology.

[95]  William E Cooper,et al.  Hierarchical coding in speech timing , 1978, Cognitive Psychology.

[96]  Jean-Luc Schwartz,et al.  No, There Is No 150 ms Lead of Visual Speech on Auditory Speech, but a Range of Audiovisual Asynchronies Varying from Small Audio Lead to Large Audio Lag , 2014, PLoS Comput. Biol..

[97]  J Miller,et al.  Channel interaction and the redundant-targets effect in bimodal divided attention. , 1991, Journal of experimental psychology. Human perception and performance.

[98]  H. Scheich,et al.  A multilevel and cross-modal approach towards neuronal mechanisms of auditory streaming , 2008, Brain Research.

[99]  M. Chait,et al.  Detecting and representing predictable structure during auditory scene analysis , 2016, eLife.

[100]  K. Grant,et al.  Auditory-visual speech recognition by hearing-impaired subjects: consonant recognition, sentence recognition, and auditory-visual integration. , 1998, The Journal of the Acoustical Society of America.

[101]  Frédéric Berthommier,et al.  Binding and unbinding the auditory and visual streams in the McGurk effect. , 2012, The Journal of the Acoustical Society of America.

[102]  J. Simon,et al.  Emergence of neural encoding of auditory objects while listening to competing speakers , 2012, Proceedings of the National Academy of Sciences.

[103]  D. Poeppel,et al.  Temporal context in speech processing and attentional stream selection: A behavioral and neural perspective , 2012, Brain and Language.

[104]  Jean Rouat,et al.  Audiovisual correspondence between musical timbre and visual shapes , 2014, Front. Hum. Neurosci..

[105]  Mitchell Steinschneider,et al.  Neural Representation of Concurrent Harmonic Sounds in Monkey Primary Auditory Cortex: Implications for Models of Auditory Scene Analysis , 2014, The Journal of Neuroscience.

[106]  A S Bregman,et al.  Resetting the pitch-analysis system. 2. Role of sudden onsets and offsets in the perception of individual components in a cluster of overlapping tones. , 1994, The Journal of the Acoustical Society of America.

[107]  N. Logothetis,et al.  Neuronal correlates of subjective visual perception. , 1989, Science.

[108]  Frédéric Berthommier,et al.  The effect of lip-reading on primary stream segregation. , 2011, The Journal of the Acoustical Society of America.

[109]  D. Poeppel,et al.  Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party” , 2013, Neuron.

[110]  B. Stein,et al.  Enhancement of Perceived Visual Intensity by Auditory Stimuli: A Psychophysical Analysis , 1996, Journal of Cognitive Neuroscience.

[111]  S. David,et al.  Emergent Selectivity for Task-Relevant Stimuli in Higher-Order Auditory Cortex , 2014, Neuron.

[112]  Asif A. Ghazanfar,et al.  The Natural Statistics of Audiovisual Speech , 2009, PLoS Comput. Biol..

[113]  Asif A Ghazanfar,et al.  Dynamic faces speed up the onset of auditory cortical spiking responses during vocal detection , 2013, Proceedings of the National Academy of Sciences.

[114]  G. Karmos,et al.  Entrainment of Neuronal Oscillations as a Mechanism of Attentional Selection , 2008, Science.

[115]  Anina N. Rich,et al.  Cross-Modality Correspondence between Pitch and Spatial Location Modulates Attentional Orienting , 2012, Perception.

[116]  Dylan M. Jones,et al.  Does auditory streaming require attention? Evidence from attentional selectivity in short-term memory. , 2003, Journal of experimental psychology. Human perception and performance.

[117]  Robert S. Harper,et al.  Pitch and the Vertical Localization of Sound , 1959 .

[118]  M. Giard,et al.  Auditory-Visual Integration during Multimodal Object Recognition in Humans: A Behavioral and Electrophysiological Study , 1999, Journal of Cognitive Neuroscience.

[119]  Joshua G. W. Bernstein,et al.  Auditory and auditory-visual intelligibility of speech in fluctuating maskers for normal-hearing and hearing-impaired listeners. , 2009, The Journal of the Acoustical Society of America.

[120]  J. Snyder,et al.  Adaptation reveals multiple levels of representation in auditory stream segregation. , 2009, Journal of experimental psychology. Human perception and performance.

[121]  D. Maurer,et al.  The shape of boubas: sound-shape correspondences in toddlers and adults. , 2006, Developmental science.

[122]  L E Marks,et al.  On associations of light and sound: the mediation of brightness, pitch, and loudness. , 1974, The American journal of psychology.

[123]  Birger Kollmeier,et al.  Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests. , 2002, The Journal of the Acoustical Society of America.

[124]  E. Yund,et al.  Attentional modulation of human auditory cortex , 2004, Nature Neuroscience.

[125]  Scott Sinnett,et al.  The effect of attention on the illusory capture of motion in bimodal stimuli , 2008, Brain Research.

[126]  J. Snyder,et al.  Toward a neurophysiological theory of auditory stream segregation. , 2007, Psychological bulletin.

[127]  Philip T Quinlan,et al.  Garner and congruence effects in the speeded classification of bimodal signals. , 2002, Journal of experimental psychology. Human perception and performance.

[128]  J. Ward,et al.  Visualized voices: A case study of audio-visual synesthesia , 2012, Neurocase.

[129]  J. Fritz,et al.  Rapid task-related plasticity of spectrotemporal receptive fields in primary auditory cortex , 2003, Nature Neuroscience.

[130]  Mounya Elhilali,et al.  Task Difficulty and Performance Induce Diverse Adaptive Patterns in Gain and Shape of Primary Auditory Cortical Receptive Fields , 2009, Neuron.

[131]  J. M. Ackroff,et al.  Auditory Induction: Perceptual Synthesis of Absent Sounds , 1972, Science.

[132]  G. A. Miller,et al.  The Intelligibility of Interrupted Speech , 1948 .

[133]  R. Hari,et al.  Seeing speech: visual information from lip movements modifies activity in the human auditory cortex , 1991, Neuroscience Letters.

[134]  D H Whalen,et al.  Limits on phonetic integration in duplex perception , 1996, Perception & psychophysics.

[135]  I H Bernstein,et al.  Effects of some variations in auditory input upon visual choice reaction time. , 1971, Journal of experimental psychology.

[136]  O. Bertrand,et al.  Effects of Selective Attention on the Electrophysiological Representation of Concurrent Sounds in the Human Auditory Cortex , 2007, The Journal of Neuroscience.

[137]  John J. Foxe,et al.  Object‐based attention is multisensory: co‐activation of an object's representations in ignored sensory modalities , 2007, The European journal of neuroscience.

[138]  Detection of across-frequency differences in fundamental frequency , 1992 .

[139]  D. Poeppel,et al.  Temporal window of integration in auditory-visual speech perception , 2007, Neuropsychologia.

[140]  Sibylle C. Herholz,et al.  Evidence for Training-Induced Plasticity in Multisensory Brain Structures: An MEG Study , 2012, PloS one.

[141]  Gary L. Dannenbring,et al.  The effect of continuity on auditory stream segregation , 1973 .

[142]  S. Campanella,et al.  Integrating face and voice in person perception , 2007, Trends in Cognitive Sciences.

[143]  Marcia Grabowecky,et al.  Changes in auditory frequency guide visual–spatial attention , 2011, Cognition.

[144]  L. V. Noorden Temporal coherence in the perception of tone sequences , 1975 .

[145]  Teemu Rinne,et al.  Functional Maps of Human Auditory Cortex: Effects of Acoustic Features and Attention , 2009, PloS one.

[146]  B. Moore,et al.  Primitive stream segregation of tone sequences without differences in fundamental frequency or passband. , 2002, The Journal of the Acoustical Society of America.

[147]  J. Rauschecker,et al.  Perceptual Organization of Tone Sequences in the Auditory Cortex of Awake Macaques , 2005, Neuron.

[148]  Jan W. H. Schnupp,et al.  Plasticity of Temporal Pattern Codes for Vocalization Stimuli in Primary Auditory Cortex , 2006, The Journal of Neuroscience.

[149]  S. Shamma,et al.  Temporal Coherence in the Perceptual Organization and Cortical Representation of Auditory Scenes , 2009, Neuron.

[150]  Virginia Best,et al.  Object continuity enhances selective auditory attention , 2008, Proceedings of the National Academy of Sciences.

[151]  S. Palmer,et al.  A century of Gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization. , 2012, Psychological bulletin.

[152]  Stefano Panzeri,et al.  Visual Enhancement of the Information Representation in Auditory Cortex , 2010, Current Biology.

[153]  I. Howard,et al.  Human Spatial Orientation , 1966 .

[154]  H. Kennedy,et al.  Anatomical Evidence of Multimodal Integration in Primate Striate Cortex , 2002, The Journal of Neuroscience.

[155]  J. Obleser,et al.  Frequency modulation entrains slow neural oscillations and optimizes human listening behavior , 2012, Proceedings of the National Academy of Sciences.

[156]  John J. Foxe,et al.  Auditory-somatosensory multisensory processing in auditory association cortex: an fMRI study. , 2002, Journal of neurophysiology.

[157]  G. A. Miller,et al.  The intelligibility of speech as a function of the context of the test materials. , 1951, Journal of experimental psychology.

[158]  L E Marks,et al.  On cross-modal similarity: auditory-visual interactions in speeded discrimination. , 1987, Journal of experimental psychology. Human perception and performance.

[159]  Erik Blaser,et al.  Tracking an object through feature space , 2000, Nature.

[160]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[161]  Adrian K. C. Lee,et al.  Defining Auditory-Visual Objects: Behavioral Tests and Physiological Mechanisms , 2016, Trends in Neurosciences.

[162]  P. Iverson,et al.  Auditory stream segregation by musical timbre: effects of static and dynamic acoustic attributes. , 1995, Journal of experimental psychology. Human perception and performance.

[163]  I. Winkler,et al.  The role of attention in the formation of auditory streams , 2007, Perception & psychophysics.

[164]  A. Samuel,et al.  Perceptual learning for speech , 2009, Attention, perception & psychophysics.

[165]  Claude Alain Breaking the wave: Effects of attention and learning on concurrent sound perception , 2007, Hearing Research.

[166]  D. Poeppel,et al.  Phase Patterns of Neuronal Responses Reliably Discriminate Speech in Human Auditory Cortex , 2007, Neuron.

[167]  M. Chait,et al.  Brain Bases for Auditory Stimulus-Driven Figure–Ground Segregation , 2011, The Journal of Neuroscience.

[168]  Yu He,et al.  Hearing Two Things at Once: Neurophysiological Indices of Speech Segregation and Identification , 2005, Journal of Cognitive Neuroscience.

[169]  Q. Summerfield Some preliminaries to a comprehensive account of audio-visual speech perception. , 1987 .

[170]  Nicolas Grimault,et al.  Effect of spectral smearing on the perceptual segregation of vowel sequences , 2007, Hearing Research.

[171]  Charles Spence,et al.  Multisensory synesthetic interactions in the speeded classification of visual size , 2006, Perception & psychophysics.

[172]  Barbara G Shinn-Cunningham,et al.  Nothing Is Irrelevant in a Noisy World: Sensory Illusions Reveal Obligatory within-and across-Modality Integration , 2012, The Journal of Neuroscience.

[173]  Douglas Johnson,et al.  Stream Segregation and Peripheral Channeling , 1991 .

[174]  S. Rosen Temporal information in speech: acoustic, auditory and linguistic aspects. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[175]  P Bertelson,et al.  Directing spatial attention towards the illusory location of a ventriloquized sound. , 2001, Acta psychologica.

[176]  T. Picton,et al.  Bottom-up and top-down influences on auditory scene analysis: evidence from event-related brain potentials. , 2001, Journal of experimental psychology. Human perception and performance.

[177]  P. Bertelson,et al.  The ventriloquist effect does not depend on the direction of deliberate visual attention , 2000, Perception & psychophysics.

[178]  R. Melara,et al.  Dimensional interaction between color and pitch. , 1989, Journal of experimental psychology. Human perception and performance.

[179]  Lawrence E. Marks,et al.  Visual-auditory interaction in speeded classification: Role of stimulus difference , 1995, Perception & psychophysics.

[180]  Chris I. Baker,et al.  Integration of Visual and Auditory Information by Superior Temporal Sulcus Neurons Responsive to the Sight of Actions , 2005, Journal of Cognitive Neuroscience.

[181]  Anne Treisman,et al.  Natural cross-modal mappings between visual and auditory features. , 2011, Journal of vision.

[182]  Nima Mesgarani,et al.  Phoneme representation and classification in primary auditory cortex. , 2008, The Journal of the Acoustical Society of America.

[183]  Kerry M. M. Walker,et al.  Pitch discrimination by ferrets for simple and complex sounds. , 2009, The Journal of the Acoustical Society of America.

[184]  James A. O'Sullivan,et al.  Evidence for Neural Computations of Temporal Coherence in an Auditory Scene and Their Enhancement during Active Listening , 2015, The Journal of Neuroscience.

[185]  K. O’Connor,et al.  Illusory Sound Perception in Macaque Monkeys , 2003, The Journal of Neuroscience.

[186]  E. Owens,et al.  An Introduction to the Psychology of Hearing , 1997 .

[187]  R. Quian Quiroga,et al.  Unsupervised Spike Detection and Sorting with Wavelets and Superparamagnetic Clustering , 2004, Neural Computation.

[188]  Huriye Atilgan,et al.  The role of spectral cues in timbre discrimination by ferrets and humans. , 2015, The Journal of the Acoustical Society of America.

[189]  Lynne E. Bernstein,et al.  Auditory speech detection in noise enhanced by lipreading , 2004, Speech Commun..

[190]  D. Poeppel,et al.  Auditory Cortex Tracks Both Auditory and Visual Stimulus Dynamics Using Low-Frequency Neuronal Phase Modulation , 2010, PLoS biology.

[191]  E. Sussman,et al.  Organization of sequential sounds in auditory memory , 2005, Neuroreport.

[192]  O. Reiser,et al.  Principles Of Gestalt Psychology , 1936 .

[193]  Georg M. Klump,et al.  Stimulus Familiarity Affects Perceptual Restoration in the European Starling (Sturnus vulgaris) , 2009, PloS one.

[194]  Charles Spence,et al.  ‘Bouba’ and ‘Kiki’ in Namibia? A remote culture make similar shape–sound matches, but different shape–taste matches to westerners , 2013 .

[195]  F. Lepore,et al.  Audiovisual fusion and cochlear implant proficiency. , 2010, Restorative neurology and neuroscience.

[196]  Adrian K. C. Lee,et al.  Auditory selective attention is enhanced by a task-irrelevant temporally coherent visual stimulus in human listeners , 2015, eLife.

[197]  H Scheich,et al.  Functional organization of auditory cortex in the Mongolian gerbil (Meriones unguiculatus). III. Anatomical subdivisions and corticocortical connections , 2000, The European journal of neuroscience.