The incongruency advantage for environmental sounds presented in natural auditory scenes.

The effect of context on the identification of common environmental sounds (e.g., dogs barking or cars honking) was tested by embedding them in familiar auditory background scenes (street ambience, restaurants). Initial results with subjects trained on both the scenes and the sounds to be identified showed a significant advantage of about five percentage points better accuracy for sounds that were contextually incongruous with the background scene (e.g., a rooster crowing in a hospital). Further studies with naive (untrained) listeners showed that this incongruency advantage (IA) is level-dependent: there is no advantage for incongruent sounds lower than a Sound/Scene ratio (So/Sc) of -7.5 dB, but there is about five percentage points better accuracy for sounds with greater So/Sc. Testing a new group of trained listeners on a larger corpus of sounds and scenes showed that the effect is robust and not confined to a specific stimulus set. Modeling using spectral-temporal measures showed that neither analyses based on acoustic features, nor semantic assessments of sound-scene congruency can account for this difference, indicating the IA is a complex effect, possibly arising from the sensitivity of the auditory system to new and unexpected events, under particular listening conditions.

[1]  D. Freed,et al.  Auditory correlates of perceived mallet hardness for a set of recorded percussive sound events. , 1990, The Journal of the Acoustical Society of America.

[2]  Kristin J. Van Engen,et al.  Sentence recognition in native- and foreign-language multi-talker background noise. , 2007, The Journal of the Acoustical Society of America.

[3]  Cook,et al.  Selective attention to the parameters of a physically informed sonic model , 2000, The Journal of the Acoustical Society of America.

[4]  W. M. Rabinowitz,et al.  Standardization of a test of speech perception in noise. , 1979, Journal of speech and hearing research.

[5]  W. Marslen-Wilson Functional parallelism in spoken word-recognition , 1987, Cognition.

[6]  M Molnár,et al.  Evoked potential correlates of stimulus deviance during wakefulness and sleep in cat--animal model of mismatch negativity. , 1987, Electroencephalography and clinical neurophysiology.

[7]  M. Kutas,et al.  Brain potentials during reading reflect word expectancy and semantic association , 1984, Nature.

[8]  F A Wichmann,et al.  Ning for Helpful Comments and Suggestions. This Paper Benefited Con- Siderably from Conscientious Peer Review, and We Thank Our Reviewers the Psychometric Function: I. Fitting, Sampling, and Goodness of Fit , 2001 .

[9]  C. Carello,et al.  Perception of Object Length by Sound , 1998 .

[10]  I. Nelken,et al.  Processing of low-probability sounds by cortical neurons , 2003, Nature Neuroscience.

[11]  R. M. Warren Perceptual Restoration of Missing Speech Sounds , 1970, Science.

[12]  Israel Nelken,et al.  Responses of auditory-cortex neurons to structural features of natural sounds , 1999, Nature.

[13]  Robert J. Logan,et al.  Perception of acoustic source characteristics: walking sounds. , 1991, The Journal of the Acoustical Society of America.

[14]  David B. Pisoni,et al.  Long-term memory in speech perception: Some new findings on talker variability, speaking rate and perceptual learning , 1993, Speech Commun..

[15]  David B Pisoni,et al.  Adaptation to frozen babble in spoken word recognition. , 2009, The Journal of the Acoustical Society of America.

[16]  A Kohlrausch,et al.  Differences in auditory performance between monaural and dichotic conditions. I: masking thresholds in frozen noise. , 1992, The Journal of the Acoustical Society of America.

[17]  J. Deutsch Perception and Communication , 1958, Nature.

[18]  J H Howard,et al.  Syntactic and semantic factors in the classification of nonspeech transient patterns , 1980, Perception & psychophysics.

[19]  M. Turvey,et al.  Hearing shape. , 2000, Journal of experimental psychology. Human perception and performance.

[20]  Hielke Freerk Boersma Characterization of the natural ambient sound environment: Measurements in open agricultural grassland , 1997 .

[21]  Brian Gygi,et al.  Spectral-temporal factors in the identification of environmental sounds. , 2004, The Journal of the Acoustical Society of America.

[22]  R. D. Gordon Attentional allocation during the perception of scenes. , 2004, Journal of experimental psychology. Human perception and performance.

[23]  Effects of Context on the Identification of Everyday Sounds , 1991 .

[24]  Brian Gygi Studying environmental sounds the Watson way , 2004 .

[25]  N. Mackworth,et al.  Cognitive determinants of fixation location during picture viewing. , 1978, Journal of experimental psychology. Human perception and performance.

[26]  Michael Kiefte,et al.  Sensitivity to change in perception of speech , 2003, Speech Commun..

[27]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[28]  J. H. Howard,et al.  Interpreting the Language of Environmental Sounds , 1987 .

[29]  Refractor Vision , 2000, The Lancet.

[30]  J. Townsend,et al.  Auditory semantic networks for words and natural sounds , 2006, Brain Research.

[31]  J D HOOD,et al.  Studies in auditory fatigue and adaptation. , 1950, Acta oto-laryngologica. Supplementum.

[32]  C. Petten,et al.  Conceptual relationships between spoken words and environmental sounds: Event-related brain potential measures , 1995, Neuropsychologia.

[33]  R. H. Wiley,et al.  Reverberations and Amplitude Fluctuations in the Propagation of Sound in a Forest: Implications for Animal Communication , 1980, The American Naturalist.

[34]  C A Fowler,et al.  Sound-producing sources as objects of perception: rate normalization and nonspeech perception. , 1990, The Journal of the Acoustical Society of America.

[35]  B. Repp The sound of two hands clapping: an exploratory study. , 1987, The Journal of the Acoustical Society of America.

[36]  Ava J. Senkfor,et al.  Electrophysiological dissociation between verbal and nonverbal semantic processing in learning disabled adults , 2000, Neuropsychologia.

[37]  K. Saberi,et al.  A level of stimulus representation model for auditory detection and attention. , 2001, The Journal of the Acoustical Society of America.

[38]  Michael T. Lippert,et al.  Mechanisms for Allocating Auditory Attention: An Auditory Saliency Map , 2005, Current Biology.

[39]  T. Houtgast,et al.  A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria , 1985 .

[40]  L. Raphael Preceding vowel duration as a cue to the perception of the voicing characteristic of word-final consonants in American English. , 1972, The Journal of the Acoustical Society of America.

[41]  Valeriy Shafiro,et al.  Identification of Environmental Sounds With Varying Spectral Resolution , 2008, Ear and hearing.

[42]  J. Ballas Common factors in the identification of an assortment of brief everyday sounds. , 1993, Journal of experimental psychology. Human perception and performance.

[43]  Robert Oostenveld,et al.  Enhanced EEG gamma-band activity reflects multisensory semantic matching in visual-to-auditory object priming , 2008, NeuroImage.

[44]  W H Warren,et al.  Auditory perception of breaking and bouncing events: a case study in ecological acoustics. , 1984, Journal of experimental psychology. Human perception and performance.

[45]  W. Larkin,et al.  Frequency-response characteristic of auditory observers detecting signals of a single frequency in noise: the probe-signal method. , 1968, The Journal of the Acoustical Society of America.

[46]  Brian Gygi,et al.  Similarity and categorization of environmental sounds , 2007, Perception & psychophysics.

[47]  D. Botteldooren,et al.  1/f Noise in Rural and Urban Soundscapes , 2003 .

[48]  D Sutton,et al.  Relation of psychophysical data to histopathology in monkeys with cochlear implants. , 1981, Acta oto-laryngologica.

[49]  Guido Orgs,et al.  Conceptual priming for environmental sounds and words: An ERP study , 2006, Brain and Cognition.

[50]  E. Hafter,et al.  Listening bandwidths and frequency uncertainty in pure-tone signal detection. , 1991, The Journal of the Acoustical Society of America.

[51]  R. Voss,et al.  ’’1/f noise’’ in music: Music from 1/f noise , 1978 .

[52]  Charles S. Watson,et al.  Auditory Processing of Complex Sounds , 1990 .

[53]  Brian Gygi,et al.  Informational factors in identifying environmental sounds in natural auditory scenes. , 2009, The Journal of the Acoustical Society of America.