The role of spectral cues in timbre discrimination by ferrets and humans.

Timbre distinguishes sounds of equal loudness, pitch, and duration; however, little is known about the neural mechanisms underlying timbre perception. Such understanding requires animal models such as the ferret in which neuronal and behavioral observation can be combined. The current study asked what spectral cues ferrets use to discriminate between synthetic vowels. Ferrets were trained to discriminate vowels differing in the position of the first (F1) and second formants (F2), inter-formant distance, and spectral centroid. In experiment 1, ferrets responded to probe trials containing novel vowels in which the spectral cues of trained vowels were mismatched. Regression models fitted to behavioral responses determined that F2 and spectral centroid were stronger predictors of ferrets' behavior than either F1 or inter-formant distance. Experiment 2 examined responses to single formant vowels and found that individual spectral peaks failed to account for multi-formant vowel perception. Experiment 3 measured responses to unvoiced vowels and showed that ferrets could generalize vowel identity across voicing conditions. Experiment 4 employed the same design as experiment 1 but with human participants. Their responses were also predicted by F2 and spectral centroid. Together these findings further support the ferret as a model for studying the neural processes underlying timbre perception.

[1]  Brian C J Moore,et al.  Basic auditory processes involved in the analysis of speech sounds , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[2]  A. Liberman,et al.  An Experimental Study of the Acoustic Determinants of Vowel Color; Observations on One- and Two-Formant Vowels Synthesized from Spectrographic Patterns , 1952 .

[3]  James D. Miller Auditory‐perceptual interpretation of the vowel , 1987 .

[4]  J. Fritz,et al.  Rhesus macaques spontaneously perceive formants in conspecific vocalizations. , 2006, The Journal of the Acoustical Society of America.

[5]  S. Kojima,et al.  Comparison of auditory functions in the chimpanzee and human. , 1990, Folia primatologica; international journal of primatology.

[6]  S. Singh,et al.  Perceptual structure of 12 American English vowels. , 1971, The Journal of the Acoustical Society of America.

[7]  M. Yano,et al.  On the effectiveness of whole spectral shape for vowel perception. , 2001, The Journal of the Acoustical Society of America.

[8]  W. Fitch,et al.  Perception of Vocal Tract Resonances by Whooping Cranes Grus americana , 2000 .

[9]  Christian J Sumner,et al.  Auditory nerve fibre responses in the ferret , 2012, The European journal of neuroscience.

[10]  B. Rakerd,et al.  Vowels in consonantal context are perceived more linguistically than are isolated vowels: Evidence from an individual differences scaling study , 1984, Perception & psychophysics.

[11]  J. Fritz,et al.  Rapid task-related plasticity of spectrotemporal receptive fields in primary auditory cortex , 2003, Nature Neuroscience.

[12]  S. Zahorian,et al.  Spectral-shape features versus formants as acoustic correlates for vowels. , 1993, The Journal of the Acoustical Society of America.

[13]  Rebecca J Snyder,et al.  The information content of giant panda, Ailuropoda melanoleuca, bleats: acoustic cues to sex, age and size , 2009, Animal Behaviour.

[14]  C J Darwin,et al.  Listening to speech in the presence of other sounds , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[15]  H Scheich,et al.  Orderly cortical representation of vowels based on formant interaction. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Hideki Kawahara,et al.  Missing-data model of vowel identification. , 1999, The Journal of the Acoustical Society of America.

[17]  S. Shamma,et al.  Spectral-ripple representation of steady-state vowels in primary auditory cortex. , 1998, The Journal of the Acoustical Society of America.

[18]  Shigeru Kiritani,et al.  Vocal-auditory functions in the chimpanzee: Vowel perception , 1989, International Journal of Primatology.

[19]  Michelle R Molis,et al.  Evaluating models of vowel perception. , 2005, The Journal of the Acoustical Society of America.

[20]  M. Loeb,et al.  AUDITORY SENSITIVITY OF THE RHEUSUS MONKEY. , 1965, Journal of comparative and physiological psychology.

[21]  Mark N. Coleman What Do Primates Hear? A Meta-analysis of All Known Nonhuman Primate Behavioral Audiograms , 2009, International Journal of Primatology.

[22]  Stephen McAdams,et al.  Perspectives on the Contribution of Timbre to Musical Structure , 1999, Computer Music Journal.

[23]  Dennis H. Klatt,et al.  Prediction of perceived phonetic distance from critical-band spectra: A first step , 1982, ICASSP.

[24]  A. Bregman,et al.  The perceptual segregation of simultaneous auditory signals: Pulse train segregation and vowel segregation , 1989, Perception & psychophysics.

[25]  Johan J Hanekom,et al.  The relative importance of spectral cues for vowel recognition in severe noise. , 2012, The Journal of the Acoustical Society of America.

[26]  R B Gardner,et al.  Mistuning a harmonic of a vowel: grouping and phase effects on vowel quality. , 1986, The Journal of the Acoustical Society of America.

[27]  J. Bizley,et al.  Neural and behavioral investigations into timbre perception , 2013, Front. Syst. Neurosci..

[28]  S. McAdams,et al.  Acoustic correlates of timbre space dimensions: a confirmatory study using synthetic tones. , 2005, The Journal of the Acoustical Society of America.

[29]  Charles H. Brown,et al.  A multidimensional scaling analysis of vowel discrimination in humans and monkeys , 1997 .

[30]  Kerry M. M. Walker,et al.  Multiplexed and Robust Representations of Sound Features in Auditory Cortex , 2011, The Journal of Neuroscience.

[31]  R. Plomp,et al.  Dimensional analysis of vowel spectra , 1967 .

[32]  J. Kelly,et al.  Hearing in the ferret (Mustela putorius): Thresholds for pure tone detection , 1986, Hearing Research.

[33]  Uwe Firzlaff,et al.  Perception and neural representation of size-variant human vowels in the Mongolian gerbil (Meriones unguiculatus) , 2010, Hearing Research.

[34]  G. Soete,et al.  Perceptual scaling of synthesized musical timbres: Common dimensions, specificities, and latent subject classes , 1995, Psychological research.

[35]  Terrance M. Nearey,et al.  Vowel Perception in Normal Speakers , 2012 .

[36]  Auditory nerve representation of naturally-produced vowels with variable acoustics , 2005, Hearing Research.

[37]  Kerry M. M. Walker,et al.  Interdependent Encoding of Pitch, Timbre, and Spatial Location in Auditory Cortex , 2009, The Journal of Neuroscience.

[38]  Gabriel J. L. Beckers,et al.  Zebra finches exhibit speaker-independent phonetic perception of human speech , 2010, Proceedings of the Royal Society B: Biological Sciences.

[39]  Kerry M. M. Walker,et al.  Spectral timbre perception in ferrets: discrimination of artificial vowels under different listening conditions. , 2013, The Journal of the Acoustical Society of America.

[40]  L. Feth,et al.  Bandwidth of spectral resolution for two-formant synthetic vowels and two-tone complex signals. , 2004, The Journal of the Acoustical Society of America.

[41]  M. B. Sachs,et al.  Representation of whispered vowels in discharge patterns of auditory-nerve fibers , 1982, Hearing Research.

[42]  L. Chistovich,et al.  The ‘center of gravity’ effect in vowel spectra and critical distance between the formants: Psychoacoustical study of the perception of vowel-like stimuli , 1979, Hearing Research.

[43]  A M Liberman,et al.  Perception of the speech code. , 1967, Psychological review.

[44]  C. ten Cate,et al.  Zebra finches and Dutch adults exhibit the same cue weighting bias in vowel perception , 2011, Animal Cognition.

[45]  T. Clutton‐Brock,et al.  Red deer stags use formants as assessment cues during intrasexual agonistic interactions , 2005, Proceedings of the Royal Society B: Biological Sciences.

[46]  R J Dooling,et al.  Hearing in passerine and psittacine birds: a comparative study of absolute and masked auditory thresholds. , 1987, Journal of comparative psychology.

[47]  H. Scheich,et al.  Neuronal discrimination of natural and synthetic vowels in field L of trained mynah birds , 2004, Experimental Brain Research.

[48]  Kerry M. M. Walker,et al.  Auditory Cortex Represents Both Pitch Judgments and the Corresponding Acoustic Cues , 2013, Current Biology.

[49]  R. Plomp,et al.  Perceptual and physical space of vowel sounds. , 1969, The Journal of the Acoustical Society of America.

[50]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .