Cortical processing of pitch: Model-based encoding and decoding of auditory fMRI responses to real-life sounds

ABSTRACT Pitch is a perceptual attribute related to the fundamental frequency (or periodicity) of a sound. So far, the cortical processing of pitch has been investigated mostly using synthetic sounds. However, the complex harmonic structure of natural sounds may require different mechanisms for the extraction and analysis of pitch. This study investigated the neural representation of pitch in human auditory cortex using model‐based encoding and decoding analyses of high field (7 T) functional magnetic resonance imaging (fMRI) data collected while participants listened to a wide range of real‐life sounds. Specifically, we modeled the fMRI responses as a function of the sounds' perceived pitch height and salience (related to the fundamental frequency and the harmonic structure respectively), which we estimated with a computational algorithm of pitch extraction (de Cheveigné and Kawahara, 2002). First, using single‐voxel fMRI encoding, we identified a pitch‐coding region in the antero‐lateral Heschl's gyrus (HG) and adjacent superior temporal gyrus (STG). In these regions, the pitch representation model combining height and salience predicted the fMRI responses comparatively better than other models of acoustic processing and, in the right hemisphere, better than pitch representations based on height/salience alone. Second, we assessed with model‐based decoding that multi‐voxel response patterns of the identified regions are more informative of perceived pitch than the remainder of the auditory cortex. Further multivariate analyses showed that complementing a multi‐resolution spectro‐temporal sound representation with pitch produces a small but significant improvement to the decoding of complex sounds from fMRI response patterns. In sum, this work extends model‐based fMRI encoding and decoding methods ‐ previously employed to examine the representation and processing of acoustic sound features in the human auditory system ‐ to the representation and processing of a relevant perceptual attribute such as pitch. Taken together, the results of our model‐based encoding and decoding analyses indicated that the pitch of complex real life sounds is extracted and processed in lateral HG/STG regions, at locations consistent with those indicated in several previous fMRI studies using synthetic sounds. Within these regions, pitch‐related sound representations reflect the modulatory combination of height and the salience of the pitch percept. HIGHLIGHTSPitch processing is analyzed with a model‐based fMRI encoding/decoding approachThe perceived pitch is modeled as a combination of height and salienceLateral HG and STG responses reflect pitch height and saliencePitch information improves the decoding of the fMRI responses to natural sounds

[1]  M. Schönwiesner,et al.  Spectro-temporal modulation transfer function of single voxels in the human auditory cortex measured with high-resolution fMRI , 2009, Proceedings of the National Academy of Sciences.

[2]  Christopher J. Plack,et al.  The effect of stimulus context on pitch representations in the human auditory cortex , 2010, NeuroImage.

[3]  Essa Yacoub,et al.  Encoding of Natural Sounds at Multiple Spectral and Temporal Resolutions in the Human Auditory Cortex , 2014, PLoS Comput. Biol..

[4]  C. Plack Oxford Handbook of Auditory Science: Hearing , 2010 .

[5]  Daniel Bendor,et al.  Dual-Pitch Processing Mechanisms in Primate Auditory Cortex , 2012, The Journal of Neuroscience.

[6]  Elia Formisano,et al.  An anatomical and functional topography of human auditory cortical areas , 2014, Front. Neurosci..

[7]  Ray Meddis,et al.  Virtual pitch and phase sensitivity of a computer model of the auditory periphery , 1991 .

[8]  R. Fay,et al.  Pitch : neural coding and perception , 2005 .

[9]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[10]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[11]  Andrew J Oxenham,et al.  Correct tonotopic representation is necessary for complex pitch perception. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12]  S. Shamma,et al.  Temporal coherence and attention in auditory scene analysis , 2011, Trends in Neurosciences.

[13]  R. Patterson,et al.  Encoding of the temporal regularity of sound in the human brainstem , 2001, Nature Neuroscience.

[14]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[15]  Essa Yacoub,et al.  Reconstructing the spectrotemporal modulations of real-life sounds from fMRI response patterns , 2017, Proceedings of the National Academy of Sciences.

[16]  J. Gallant,et al.  Identifying natural images from human brain activity , 2008, Nature.

[17]  Gene H. Golub,et al.  Generalized cross-validation as a method for choosing a good ridge parameter , 1979, Milestones in Matrix Computation.

[18]  W A Yost,et al.  A time domain description for the pitch strength of iterated rippled noise. , 1996, The Journal of the Acoustical Society of America.

[19]  Masa-aki Sato,et al.  Visual Image Reconstruction from Human Brain Activity using a Combination of Multiscale Local Image Decoders , 2008, Neuron.

[20]  Essa Yacoub,et al.  Spatial organization of frequency preference and selectivity in the human inferior colliculus , 2012, Nature Communications.

[21]  Dave R. M. Langers,et al.  Tonotopic mapping of human auditory cortex , 2014, Hearing Research.

[22]  Timothy D Griffiths,et al.  Mapping Pitch Representation in Neural Ensembles with fMRI , 2012, The Journal of Neuroscience.

[23]  Daniel Bendor,et al.  Cortical representations of pitch in monkeys and humans , 2006, Current Opinion in Neurobiology.

[24]  Powen Ru,et al.  Multiresolution spectrotemporal analysis of complex sounds. , 2005, The Journal of the Acoustical Society of America.

[25]  Jeremy Marozeau,et al.  The dependency of timbre on fundamental frequencya ) , 2003 .

[26]  Kathleen A. Hansen,et al.  Modeling low‐frequency fluctuation and hemodynamic response timecourse in event‐related fMRI , 2008, Human brain mapping.

[27]  Jack L. Gallant,et al.  Encoding and decoding in fMRI , 2011, NeuroImage.

[28]  Andrew J Oxenham,et al.  A Neural Representation of Pitch Salience in Nonprimary Human Auditory Cortex Revealed with Functional Magnetic Resonance Imaging , 2004, The Journal of Neuroscience.

[29]  Ramani Duraiswami,et al.  Neuromimetic Sound Representation for Percept Detection and Manipulation , 2005, EURASIP J. Adv. Signal Process..

[30]  Bruno L. Giordano,et al.  Abstract encoding of auditory objects in cortical activity patterns. , 2013, Cerebral cortex.

[31]  S Shamma,et al.  The case of the missing pitch templates: how harmonic templates emerge in the early auditory system. , 2000, The Journal of the Acoustical Society of America.

[32]  G. Soete,et al.  Perceptual scaling of synthesized musical timbres: Common dimensions, specificities, and latent subject classes , 1995, Psychological research.

[33]  Roy D. Patterson,et al.  The relative strength of the tone and noise components in iterated rippled noise , 1996 .

[34]  R. Goebel,et al.  Mirror-Symmetric Tonotopic Maps in Human Primary Auditory Cortex , 2003, Neuron.

[35]  Christopher J Plack,et al.  The human ‘pitch center’ responds differently to iterated noise and Huggins pitch , 2007, Neuroreport.

[36]  Christopher J Plack,et al.  Reexamining the evidence for a pitch-sensitive region: a human fMRI study using iterated ripple noise. , 2012, Cerebral cortex.

[37]  D. Hall,et al.  Pitch Processing Sites in the Human Auditory Brain , 2008, Cerebral cortex.

[38]  R. Goebel,et al.  Processing of Natural Sounds: Characterization of Multipeak Spectral Tuning in Human Auditory Cortex , 2013, The Journal of Neuroscience.

[39]  R. Patterson,et al.  The Processing of Temporal Pitch and Melody Information in Auditory Cortex , 2002, Neuron.

[40]  Josh H. McDermott,et al.  Cortical Pitch Regions in Humans Respond Primarily to Resolved Harmonics and Are Located in Specific Tonotopic Regions of Anterior Auditory Cortex , 2013, The Journal of Neuroscience.

[41]  S Grossberg,et al.  A spectral network model of pitch perception. , 1995, The Journal of the Acoustical Society of America.

[42]  Noël Staeren,et al.  Sound Categories Are Represented as Distributed Patterns in the Human Auditory Cortex , 2009, Current Biology.

[43]  Shihab A Shamma Topographic organization is essential for pitch perception. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Richard S. J. Frackowiak,et al.  Human Primary Auditory Cortex Follows the Shape of Heschl's Gyrus , 2011, The Journal of Neuroscience.

[45]  A. Cheveigné Pitch perception models-a historical review , 2003 .

[46]  J. Rauschecker,et al.  Cortical Representation of Natural Complex Sounds: Effects of Acoustic Features and Auditory Object Category , 2010, The Journal of Neuroscience.

[47]  Rainer Goebel,et al.  Analysis of functional image analysis contest (FIAC) data with brainvoyager QX: From single‐subject to cortically aligned group general linear model analysis and self‐organizing group independent component analysis , 2006, Human brain mapping.

[48]  Richard S. J. Frackowiak,et al.  Analysis of temporal structure in sound by the human brain , 1998, Nature Neuroscience.

[49]  Andrew J Oxenham,et al.  Revisiting place and temporal theories of pitch. , 2013, Acoustical science and technology.

[50]  Kuansan Wang,et al.  Spectral shape analysis in the central auditory system , 1995, IEEE Trans. Speech Audio Process..

[51]  Edward C. Carterette,et al.  Perceptual and Acoustical Features of Natural and Synthetic Orchestral Instrument Tones , 1999 .

[52]  William J. Talkington,et al.  Human Cortical Organization for Processing Vocalizations Indicates Representation of Harmonic Structure as a Signal Attribute , 2009, The Journal of Neuroscience.

[53]  Mounya Elhilali,et al.  A cocktail party with a cortical twist: how cortical mechanisms contribute to sound segregation. , 2008, The Journal of the Acoustical Society of America.

[54]  Jonathan D. Cohen,et al.  Improved Assessment of Significant Activation in Functional Magnetic Resonance Imaging (fMRI): Use of a Cluster‐Size Threshold , 1995, Magnetic resonance in medicine.

[55]  Ajm Adrian Houtsma,et al.  Pitch and timbre : definition, meaning and use , 1997 .

[56]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[57]  Elia Formisano,et al.  Processing of Natural Sounds in Human Auditory Cortex: Tonotopy, Spectral Tuning, and Relation to Voice Sensitivity , 2012, The Journal of Neuroscience.

[58]  D. Bendor,et al.  The neuronal representation of pitch in primate auditory cortex , 2005, Nature.

[59]  Daniel Bendor Does a pitch center exist in auditory cortex? , 2012, Journal of neurophysiology.

[60]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .