Categorization of Extremely Brief Auditory Stimuli: Domain-Specific or Domain-General Processes?

The present study investigated the minimum amount of auditory stimulation that allows differentiation of spoken voices, instrumental music, and environmental sounds. Three new findings were reported. 1) All stimuli were categorized above chance level with 50 ms-segments. 2) When a peak-level normalization was applied, music and voices started to be accurately categorized with 20 ms-segments. When the root-mean-square (RMS) energy of the stimuli was equalized, voice stimuli were better recognized than music and environmental sounds. 3) Further psychoacoustical analyses suggest that the categorization of extremely brief auditory stimuli depends on the variability of their spectral envelope in the used set. These last two findings challenge the interpretation of the voice superiority effect reported in previously published studies and propose a more parsimonious interpretation in terms of an emerging property of auditory categorization processes.

[1]  Mark H. Johnson Subcortical face processing , 2005, Nature Reviews Neuroscience.

[2]  Roy D. Patterson,et al.  Neural Representation of Auditory Size in the Human Voice and in Sounds from Other Resonant Sources , 2007, Current Biology.

[3]  Stefan R Schweinberger,et al.  Human brain potential correlates of voice priming and voice recognition , 2001, Neuropsychologia.

[4]  Micah M. Murray,et al.  Rapid Brain Discrimination of Sounds of Objects , 2006, The Journal of Neuroscience.

[5]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[6]  Eliezer Rapoport,et al.  Singing, Mind and Brain - Unit Pulse, Rhythm, Emotion and Expression , 1996, Joint International Conference on Cognitive and Systematic Musicology.

[8]  R. Zatorre,et al.  Voice-selective areas in human auditory cortex , 2000, Nature.

[9]  Carol L. Krumhansl,et al.  Plink: "Thin Slices" of Music , 2010 .

[10]  Neil A. Macmillan,et al.  Detection Theory: A User's Guide , 1991 .

[11]  M. Coltheart,et al.  Modularity of music processing , 2003, Nature Neuroscience.

[12]  Hideki Kawahara,et al.  In the ear of the beholder: neural correlates of adaptation to voice gender , 2009, The European journal of neuroscience.

[13]  Clara Suied,et al.  Characteristics of human voice processing , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[14]  Pascal Belin,et al.  Is voice processing species-specific in human auditory cortex? An fMRI study , 2004, NeuroImage.

[15]  J. Fodor The Modularity of mind. An essay on faculty psychology , 1986 .

[16]  Paul Iverson,et al.  Name that tune: Identifying popular recordings from brief excerpts , 1999, Psychonomic bulletin & review.

[17]  Emmanuel Bigand,et al.  The Time Course of Emotional Responses to Music , 2005, Annals of the New York Academy of Sciences.

[18]  J. Fodor,et al.  The Modularity of Mind: An Essay on Faculty Psychology , 1984 .

[19]  P. D. Eimas,et al.  Evidence for Representations of Perceptually Similar Natural Categories by 3-Month-Old and 4-Month-Old Infants , 1993, Perception.

[20]  I. Peretz,et al.  Music and modularity , 1989 .

[21]  G Thierry,et al.  Language and Brain: What is Up? What is Coming Up? , 2001, Journal of clinical and experimental neuropsychology.

[22]  P. Belin,et al.  Electrophysiological markers of voice familiarity , 2006, The European journal of neuroscience.

[23]  I. Wambacq,et al.  Non-voluntary and voluntary processing of emotional prosody: an event-related potentials study , 2004, Neuroreport.

[24]  F Grosjean,et al.  Spoken word recognition processes and the gating paradigm , 1980, Perception & psychophysics.

[25]  B. Moore Frequency Selectivity in Hearing , 1987 .

[26]  Pascal Belin,et al.  Electrophysiological evidence for an early processing of human voices , 2009, BMC Neuroscience.

[27]  Barbara Tillmann,et al.  Judging familiarity and emotion from very brief musical excerpts , 2010, Psychonomic bulletin & review.

[28]  P D Eimas,et al.  Development of exclusivity in perceptually based categories of young infants. , 1994, Journal of experimental child psychology.

[29]  S. Bentin,et al.  Processing specificity for human voice stimuli: electrophysiological evidence , 2001, Neuroreport.

[30]  P. Belin Voice processing in human and non-human primates , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[31]  R. Näätänen,et al.  Preattentive voice discrimination by the human brain as indexed by the mismatch negativity , 2001, Neuroscience Letters.

[32]  Burkhard Maess,et al.  Dissociation of human and computer voices in the brain: Evidence for a preattentive gestalt‐like perception , 2003, Human brain mapping.

[33]  J. Ballas Common factors in the identification of an assortment of brief everyday sounds. , 1993, Journal of experimental psychology. Human perception and performance.

[34]  E. T. Possing,et al.  Human temporal lobe activation by speech and nonspeech sounds. , 2000, Cerebral cortex.

[35]  E. Bigand,et al.  Catégorisation asymétrique de séquences de hauteurs musicales , 2008 .

[36]  P. Alku,et al.  Electromagnetic recordings reveal latency differences in speech and tone processing in humans. , 1999, Brain research. Cognitive brain research.

[37]  E. Bigand,et al.  The Contribution of Local Features to Familiarity Judgments in Music , 2009, Annals of the New York Academy of Sciences.