Processing the acoustic effect of size in speech sounds

The length of a vocal tract is reflected in the sound it is producing. The length of the vocal tract is correlated with body size and humans are very good at making size judgments based on the acoustic effect of vocal tract length only. Here we investigate the underlying mechanism for processing this main auditory cue to size information in the human brain. Sensory encoding of the acoustic effect of vocal tract length (VTL) depends on a time-stabilized spectral scaling mechanism that is independent of glottal pulse rate (GPR, or voice pitch); we provide evidence that a potential neural correlate for such a mechanism exists in the medial geniculate body (MGB). The perception of the acoustic effect of speaker size is influenced by GPR suggesting an interaction between VTL and GPR processing; such an interaction occurs only at the level of non-primary auditory cortex in planum temporale and anterior superior temporal gyrus. Our findings support a two-stage model for the processing of size information in speech based on an initial stage of sensory analysis as early as MGB, and a neural correlate of the perception of source size in non-primary auditory cortex.

[1]  Richard E. Turner,et al.  The processing and perception of size information in speech sounds. , 2005, The Journal of the Acoustical Society of America.

[2]  A. R. Jennings,et al.  Analysis of the spectral envelope of sounds by the human brain , 2005, NeuroImage.

[3]  Hideki Kawahara,et al.  Underlying Principles of a High-quality Speech Manipulation System STRAIGHT and Its Application to Speech Segregation , 2005, Speech Separation by Humans and Machines.

[4]  T. Griffiths,et al.  The planum temporale as a computational hub , 2002, Trends in Neurosciences.

[5]  Roy D. Patterson,et al.  Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform , 2002, Speech Commun..

[6]  Lee M. Miller,et al.  Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex. , 2002, Journal of neurophysiology.

[7]  S. M. Marcus Acoustic determinants of perceptual center (P-center) location , 1981, Perception & psychophysics.

[8]  Timothy D. Griffiths,et al.  Functional Imaging of Pitch Processing , 2005 .

[9]  P Iverson,et al.  Perceptual interactions between musical pitch and timbre. , 1992, Journal of experimental psychology. Human perception and performance.

[10]  R. Bowtell,et al.  “sparse” temporal sampling in auditory fMRI , 1999, Human brain mapping.

[11]  T. Clutton‐Brock,et al.  Red deer stags use formants as assessment cues during intrasexual agonistic interactions , 2005, Proceedings of the Royal Society B: Biological Sciences.

[12]  C. Schreiner,et al.  Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms. , 1988, Journal of neurophysiology.

[13]  Karl J. Friston,et al.  Spatial registration and normalization of images , 1995 .

[14]  Andrew J Oxenham,et al.  A Neural Representation of Pitch Salience in Nonprimary Human Auditory Cortex Revealed with Functional Magnetic Resonance Imaging , 2004, The Journal of Neuroscience.

[15]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[16]  R. Patterson,et al.  The Processing of Temporal Pitch and Melody Information in Auditory Cortex , 2002, Neuron.

[17]  R. Patterson,et al.  Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. , 1995, The Journal of the Acoustical Society of America.

[18]  R. Patterson,et al.  Encoding of the temporal regularity of sound in the human brainstem , 2001, Nature Neuroscience.

[19]  Drew Rendall,et al.  Reliable but weak voice‐formant cues to body size in men but not women , 2005 .

[20]  Roy D Patterson,et al.  Discrimination of speaker size from syllable phrases. , 2005, Journal of the Acoustical Society of America.

[21]  Nina Kraus,et al.  Thalamic asymmetry is related to acoustic signal complexity , 1999, Neuroscience Letters.

[22]  H J Künzel,et al.  How Well Does Average Fundamental Frequency Correlate with Speaker Height and Weight? , 1989, Phonetica.

[23]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[24]  L E Marks,et al.  Interaction among auditory dimensions: Timbre, pitch, and loudness , 1990, Perception & psychophysics.

[25]  Anne-Lise Giraud,et al.  Distinct functional substrates along the right superior temporal sulcus for the processing of voices , 2004, NeuroImage.

[26]  R. Zatorre,et al.  Voice-selective areas in human auditory cortex , 2000, Nature.

[27]  W. Fitch,et al.  Morphology and development of the human vocal tract: a study using magnetic resonance imaging. , 1999, The Journal of the Acoustical Society of America.

[28]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[29]  L Fairchild,et al.  Mate Selection and Behavioral Thermoregulation in Fowler's Toads. , 1981, Science.

[31]  Roy D Patterson,et al.  The interaction of glottal-pulse rate and vocal-tract length in judgements of speaker size, sex, and age. , 2005, The Journal of the Acoustical Society of America.

[32]  Julio González,et al.  Formant frequencies and body size of speaker: a weak relationship in adult humans , 2004, J. Phonetics.

[33]  Jeremy Marozeau,et al.  The dependency of timbre on fundamental frequencya ) , 2003 .

[34]  Terry M. Peters,et al.  3D statistical neuroanatomical models from 305 MRI volumes , 1993, 1993 IEEE Conference Record Nuclear Science Symposium and Medical Imaging Conference.

[35]  C. Schreiner,et al.  Periodicity coding in the inferior colliculus of the cat. II. Topographical organization. , 1988, Journal of neurophysiology.

[36]  A. Dale,et al.  Tonotopic organization in human auditory cortex revealed by progressions of frequency sensitivity. , 2004, Journal of neurophysiology.

[37]  Alan C. Evans,et al.  Event-Related fMRI of the Auditory Cortex , 1998, NeuroImage.