The Perception of Scale in Vowels

Abstract : Previous reports presented our psychophysical findings on the perception of vowels which had been manipulated to make them sound like smaller and larger people, including some well beyond the normal range of the population. This final report includes, in addition to this previous research, an experiment showing speaker size can be extracted from a speech-like sequence of vowels that does not possess any simple spectral cue. We provide a detailed motivation and discussion of scale in vowel sounds. Our results show that we can be confident that human listeners are able to extract both vowel type and speaker size from vowel sounds even when the size and pitch are well beyond normal experience.

[1]  M H Hast The larynx of roaring and non-roaring cats. , 1989, Journal of anatomy.

[2]  T. Cornsweet,et al.  Luminance discrimination of brief flashes under various conditions of adaptation , 1965, The Journal of physiology.

[3]  P. Lieberman,et al.  Fundamental frequency and vowel perception. , 1982, The Journal of the Acoustical Society of America.

[4]  Terrance M. Nearey,et al.  Modeling the perception of frequency-shifted vowels , 2002, INTERSPEECH.

[5]  K. Johnson,et al.  Formants of children, women, and men: the effects of vocal intensity variation. , 1999, The Journal of the Acoustical Society of America.

[6]  R. Patterson,et al.  Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. , 1995, The Journal of the Acoustical Society of America.

[7]  R. Patterson,et al.  The lower limit of melodic pitch. , 2001, The Journal of the Acoustical Society of America.

[8]  W. Fitch Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques. , 1997, The Journal of the Acoustical Society of America.

[9]  D. Schaid,et al.  Androgen Stimulation and Laryngeal Development , 1985, The Annals of otology, rhinology, and laryngology.

[10]  L Fairchild,et al.  Mate Selection and Behavioral Thermoregulation in Fowler's Toads. , 1981, Science.

[11]  B. L. Cardozo,et al.  Pitch of the Residue , 1962 .

[12]  Roy D. Patterson,et al.  Auditory images:How complex sounds are represented in the auditory system , 2000 .

[13]  N J Lass,et al.  Correlational study of speakers' heights, weights, body surface areas, and speaking fundamental frequencies. , 1978, The Journal of the Acoustical Society of America.

[14]  Roy D. Patterson,et al.  Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform , 2002, Speech Commun..

[15]  J. Bachorowski,et al.  Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech. , 1999, The Journal of the Acoustical Society of America.

[16]  D. Foster,et al.  Bootstrap estimates of the statistical accuracy of thresholds obtained from psychometric functions. , 1997, Spatial vision.

[17]  Hast Mh,et al.  The larynx of roaring and non-roaring cats. , 1989 .

[18]  T. Riede,et al.  Vocal tract length and acoustics of vocalization in the domestic dog (Canis familiaris). , 1999, The Journal of experimental biology.

[19]  R. Shannon,et al.  Recognition of spectrally degraded and frequency-shifted vowels in acoustic and electric hearing. , 1999, The Journal of the Acoustical Society of America.

[20]  B. Moore,et al.  Frequency discrimination as a function of frequency, measured in several ways. , 1995, The Journal of the Acoustical Society of America.

[21]  W. Fitch,et al.  Morphology and development of the human vocal tract: a study using magnetic resonance imaging. , 1999, The Journal of the Acoustical Society of America.

[22]  W. Tecumseh,et al.  Vocal Tract Length Perception and the Evolution of Language , 1994 .

[23]  W. Fitch Acoustic exaggeration of size in birds via tracheal elongation: comparative and theoretical analyses , 1999 .

[24]  V. Negus The Comparative Anatomy and Physiology of the Larynx , 1950 .

[25]  Hideki Kawahara,et al.  Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[26]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[27]  Terrance M. Nearey,et al.  Frequency Shifts and Vowel Identification , 2003 .

[28]  George A. Gescheider,et al.  Psychophysics: Method and theory , 1976 .

[29]  G. A. Miller,et al.  Sensitivity to Changes in the Intensity of White Noise and Its Relation to Masking and Loudness , 1947 .

[30]  V C Tartter,et al.  Hearing smiles and frowns in normal and whisper registers. , 1994, The Journal of the Acoustical Society of America.

[31]  D. Reby,et al.  The descended larynx is not uniquely human , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.