Speech versus song: multiple pitch-sensitive areas revealed by a naturally occurring musical illusion.

It is normally obvious to listeners whether a human vocalization is intended to be heard as speech or song. However, the 2 signals are remarkably similar acoustically. A naturally occurring boundary case between speech and song has been discovered where a spoken phrase sounds as if it were sung when isolated and repeated. In the present study, an extensive search of audiobooks uncovered additional similar examples, which were contrasted with samples from the same corpus that do not sound like song, despite containing clear prosodic pitch contours. Using functional magnetic resonance imaging, we show that hearing these 2 closely matched stimuli is not associated with differences in response of early auditory areas. Rather, we find that a network of 8 regions, including the anterior superior temporal gyrus (STG) just anterior to Heschl's gyrus and the right midposterior STG, respond more strongly to speech perceived as song than to mere speech. This network overlaps a number of areas previously associated with pitch extraction and song production, confirming that phrases originally intended to be heard as speech can, under certain circumstances, be heard as song. Our results suggest that song processing compared with speech processing makes increased demands on pitch processing and auditory-motor integration.

[1]  D. Klatt Vowel Lengthening is Syntactically Determined in a Connected Discourse. , 1975 .

[2]  D. Robert Ladd,et al.  Aspects of pitch realisation in Yoruba , 1990, Phonology.

[3]  J. Vaissière Rhythm, accentuation and final lengthening in French , 1991 .

[4]  Lennart Nord,et al.  Prosodic and segmental speaker variations , 1991, Speech Commun..

[5]  D. Pandya,et al.  Efferent cortical connections of multimodal cortex of the superior temporal sulcus in the rhesus monkey , 1992, The Journal of comparative neurology.

[6]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.

[7]  Chilin Shih,et al.  Pitch downtrend in Spanish , 1996 .

[8]  R. Herman,et al.  Final Lowering in Kipare , 1996 .

[9]  Jan P. H. van Santen,et al.  Modeling vowel duration for Japanese text-to-speech synthesis , 1998, ICSLP.

[10]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[11]  A. Dale,et al.  Cortical Surface-Based Analysis II: Inflation, Flattening, and a Surface-Based Coordinate System , 1999, NeuroImage.

[12]  W Grodd,et al.  Opposite hemispheric lateralization effects during speaking and singing at motor cortex, insula and cerebellum , 2000, Neuroreport.

[13]  E. T. Possing,et al.  Human temporal lobe activation by speech and nonspeech sounds. , 2000, Cerebral cortex.

[14]  Johan Sundberg Emotive Transforms , 2000, Phonetica.

[15]  R. Zatorre,et al.  Spectral and temporal processing in human auditory cortex. , 2001, Cerebral cortex.

[16]  R. Patterson,et al.  Encoding of the temporal regularity of sound in the human brainstem , 2001, Nature Neuroscience.

[17]  H. Scheich,et al.  Phonetic Perception and the Temporal Cortex , 2002, NeuroImage.

[18]  R. Patterson,et al.  The Processing of Temporal Pitch and Melody Information in Auditory Cortex , 2002, Neuron.

[19]  Gary H. Glover,et al.  Neural Correlates of Timbre Change in Harmonic Sounds , 2002, NeuroImage.

[20]  K. Sakai,et al.  Brain activations during conscious self‐monitoring of speech production with delayed auditory feedback: An fMRI study , 2003, Human brain mapping.

[21]  M. Coltheart,et al.  Modularity of music processing , 2003, Nature Neuroscience.

[22]  Fred Popowich,et al.  Computationally measurable differences between speech and song , 2003 .

[23]  G. Hickok,et al.  Auditory–Motor Interaction Revealed by fMRI: Speech, Music, and Working Memory in Area Spt , 2003 .

[24]  J. D. Warren,et al.  Separating pitch chroma and pitch height in the human brain , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[25]  A. Braun,et al.  Words in melody: an H215O PET study of brain activation during singing and speaking , 2003, Neuroreport.

[26]  Lutz Jäncke,et al.  Functional anatomy of pitch memory—an fMRI study with sparse temporal sampling , 2003, NeuroImage.

[27]  S. Bookheimer,et al.  Dissociating Neural Mechanisms of Temporal Sequencing and Processing Phonemes , 2003, Neuron.

[28]  G. Hickok,et al.  AuditoryMotor Interaction Revealed by fMRI: Speech, Music, and Working Memory in Area Spt , 2003, Journal of Cognitive Neuroscience.

[29]  Michael J. Martinez,et al.  The song system of the human brain. , 2004, Brain research. Cognitive brain research.

[30]  Hubert Truckenbrodt,et al.  Final lowering in non-final position , 2004, J. Phonetics.

[31]  R. Wise,et al.  Sounds do-able: auditory–motor transformations and the posterior temporal plane , 2005, Trends in Neurosciences.

[32]  I. Peretz,et al.  Brain organization for music processing. , 2005, Annual review of psychology.

[33]  Stuart Rosen,et al.  Neural correlates of intelligibility in speech investigated with noise vocoded speech--a positron emission tomography study. , 2006, The Journal of the Acoustical Society of America.

[34]  Matthew Richardson,et al.  Phonetic processing areas revealed by sinewave speech and acoustically similar non-speech , 2006, NeuroImage.

[35]  Takashi Hanakawa,et al.  Song and speech: Brain regions involved with perception and covert production , 2006, NeuroImage.

[36]  B. Douglas Ward,et al.  Deconvolution Analysis of FMRI Time Series Data , 2006 .

[37]  Mikko Sams,et al.  Perceiving identical sounds as speech or non-speech modulates activity in the left posterior superior temporal sulcus , 2006, NeuroImage.

[38]  Gottfried Schlaug,et al.  Shared and distinct neural correlates of singing and speaking , 2006, NeuroImage.

[39]  G. Schlaug,et al.  Testing for causality with transcranial direct current stimulation: pitch memory and the left supramarginal gyrus , 2006, Neuroreport.

[40]  Christian Gaser,et al.  Improvement-related functional plasticity following pitch memory training , 2006, NeuroImage.

[41]  Sophie K. Scott,et al.  Human brain mechanisms for the early analysis of voices , 2006, NeuroImage.

[42]  Ayse Pinar Saygin,et al.  Smoothing and cluster thresholding for cortical surface-based group analysis of fMRI data , 2006, NeuroImage.

[43]  S. Koyama,et al.  Neural correlates of auditory feedback control in human , 2007, Neuroscience.

[44]  Ayse Pinar Saygin,et al.  What is Involved and What is Necessary for Complex Linguistic and Nonlinguistic Auditory Processing: Evidence from Functional Magnetic Resonance Imaging and Lesion Data , 2007, Journal of Cognitive Neuroscience.

[45]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[46]  Robert J. Zatorre,et al.  Experience-dependent neural substrates involved in vocal pitch regulation during singing , 2008, NeuroImage.

[47]  E. Ngan,et al.  A larynx area in the human motor cortex. , 2008, Cerebral cortex.

[48]  J. Baudewig,et al.  Cortical Sensorimotor Control in Vocalization: A Functional Magnetic Resonance Imaging Study , 2008, The Laryngoscope.

[49]  Judy Pa,et al.  A parietal–temporal sensory–motor integration area for the human vocal tract: Evidence from an fMRI study of skilled musicians , 2008, Neuropsychologia.

[50]  D. Hall,et al.  Pitch Processing Sites in the Human Auditory Brain , 2008, Cerebral cortex.

[51]  Bruce Fischl,et al.  Accurate and robust brain image alignment using boundary-based registration , 2009, NeuroImage.

[52]  Mireille Besson,et al.  Similar cerebral networks in language, music and song perception , 2010, NeuroImage.

[53]  Birger Kollmeier,et al.  Dichotic pitch activates pitch processing centre in Heschl's gyrus , 2009, NeuroImage.

[54]  Steven Brown,et al.  Representation of the speech effectors in the human motor cortex: Somatotopy or overlap? , 2010, Brain and Language.

[55]  Diana Deutsch,et al.  Illusory transformation from speech to song. , 2011, The Journal of the Acoustical Society of America.

[56]  Cathy J. Price,et al.  Auditory-Motor Expertise Alters “Speech Selectivity” in Professional Musicians and Actors , 2010, Cerebral cortex.

[57]  Johan Sundberg,et al.  The Human Voice in Speech and Singing , 2014 .

[58]  D. M. Campbell,et al.  Springer Handbook of Acoustics , 2015 .