Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech.

Although listeners routinely perceive both the sex and individual identity of talkers from their speech, explanations of these abilities are incomplete. Here, variation in vocal production-related anatomy was assumed to affect vowel acoustics thought to be critical for indexical cueing. Integrating this approach with source-filter theory, patterns of acoustic parameters that should represent sex and identity were identified. Due to sexual dimorphism, the combination of fundamental frequency (F0, reflecting larynx size) and vocal tract length cues (VTL, reflecting body size) was predicted to provide the strongest acoustic correlates of talker sex. Acoustic measures associated with presumed variations in supralaryngeal vocal tract-related anatomy occurring within sex were expected to be prominent in individual talker identity. These predictions were supported by results of analyses of 2500 tokens of the /epsilon/ phoneme, extracted from the naturally produced speech of 125 subjects. Classification by talker sex was virtually perfect when F0 and VTL were used together, whereas talker classification depended primarily on the various acoustic parameters associated with vocal-tract filtering.

[1]  M J Owren,et al.  The role of vocal tract filtering in identity cueing in rhesus monkey (Macaca mulatta) vocalizations. , 1998, The Journal of the Acoustical Society of America.

[2]  S. Whiteside,et al.  Identification of a Speaker's Sex: A Study of Vowels , 1998, Perceptual and motor skills.

[3]  J. W. Horst,et al.  Frequency discrimination of stylized synthetic vowels with a single formant. , 1997, The Journal of the Acoustical Society of America.

[4]  W. Fitch Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques. , 1997, The Journal of the Acoustical Society of America.

[5]  Robert Hagiwara,et al.  DIALECT VARIATION AND FORMANT FREQUENCY : THE AMERICAN ENGLISH VOWELS REVISITED , 1997 .

[6]  Jennifer M. Fellowes,et al.  Talker identification based on phonetic information. , 1997, Journal of experimental psychology. Human perception and performance.

[7]  R. Seyfarth,et al.  The acoustic features of vowel-like grunt calls in chacma baboons (Papio cyncephalus ursinus): implications for production processes and functions. , 1997, The Journal of the Acoustical Society of America.

[8]  P. Lieberman,et al.  Fundamental frequency of phonation and perceived emotional stress. , 1997, The Journal of the Acoustical Society of America.

[9]  H. Traunmüller,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Comparative Study of the Male and Female Whispered and Phonated Versions of the Long Vowels of Swedish , 2022 .

[10]  S. Goldinger Words and voices: episodic traces in spoken word identification and recognition memory. , 1996, Journal of experimental psychology. Learning, memory, and cognition.

[11]  J. Bachorowski,et al.  Vocal Expression of Emotion: Acoustic Properties of Speech Are Associated With Emotional Intensity and Context , 1995 .

[12]  D. Kewley-Port,et al.  Fundamental frequency effects on thresholds for vowel formant discrimination. , 1994, The Journal of the Acoustical Society of America.

[13]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[14]  D Byrd,et al.  Preliminary results on speaker-dependent variation in the TIMIT database. , 1992, The Journal of the Acoustical Society of America.

[15]  Tohru Takagi,et al.  Acoustic parameters of voice individuality and voice-quality control by analysis-synthesis method , 1991, Speech Commun..

[16]  D G Childers,et al.  Gender recognition from speech. Part II: Fine analysis. , 1991, The Journal of the Acoustical Society of America.

[17]  D. Childers,et al.  Gender recognition from speech. Part I: Coarse analysis. , 1991, The Journal of the Acoustical Society of America.

[18]  V C Tartter,et al.  Identifiability of vowels and speakers from whispered syllables , 1991, Perception & psychophysics.

[19]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[20]  I. Titze Physiologic and acoustic differences between male and female voices. , 1989, The Journal of the Acoustical Society of America.

[21]  E T Doherty,et al.  Tape recorder effects on jitter and shimmer extraction. , 1988, Journal of speech and hearing research.

[22]  James D. Miller Auditory‐perceptual interpretation of the vowel , 1987 .

[23]  K. Scherer Vocal affect expression: a review and a model for future research. , 1986, Psychological bulletin.

[24]  T Murry,et al.  Multidimensional analysis of male and female voices. , 1980, The Journal of the Acoustical Society of America.

[25]  T Murry,et al.  Multidimensional classification of normal voice qualities. , 1977, The Journal of the Acoustical Society of America.

[26]  R O Coleman,et al.  A comparison of the contributions of two voice quality characteristics to the perception of maleness and femaleness in the voice. , 1976, Journal of speech and hearing research.

[27]  N. Lass,et al.  Speaker sex identification from voiced, whispered, and filtered isolated vowels. , 1974, The Journal of the Acoustical Society of America.

[28]  R O Coleman,et al.  Speaker identification in the absence of inter-subject differences in glottal source characteristics. , 1973, The Journal of the Acoustical Society of America.

[29]  R O Coleman,et al.  Male and female voice quality and its relationship to vowel formant frequencies. , 1971, Journal of speech and hearing research.

[30]  F. Ingemann,et al.  Identification of the speaker's sex from voiceless fricatives. , 1968, The Journal of the Acoustical Society of America.

[31]  M. F. Schwartz,et al.  Identification of speaker sex from isolated, voiceless fricatives. , 1968, The Journal of the Acoustical Society of America.

[32]  H. Fujisaki,et al.  The roles of pitch and higher formants in the perception of vowels , 1968 .

[33]  K. Stevens,et al.  Development of a Quantitative Description of Vowel Articulation , 1955 .

[34]  H M Hanson,et al.  Glottal characteristics of female speakers: acoustic correlates. , 1997, The Journal of the Acoustical Society of America.

[35]  M Kagoshima,et al.  Effects of Y-24180, a long-acting and potent antagonist to platelet-activating factor, on immediate asthmatic response in guinea pigs. , 1997, Pharmacology.

[36]  D. Maurer,et al.  Intelligibility and spectral differences in high-pitched vowels. , 1996, Folia phoniatrica et logopaedica : official organ of the International Association of Logopedics and Phoniatrics.

[37]  M. Flug,et al.  Behavior of the basement membrane during carcinoma cell invasion in chemically induced carcinomas of the skin. , 1996, Acta anatomica.

[38]  Inger Karlsson,et al.  Female voices in speech synthesis , 1991 .

[39]  D. Broadbent,et al.  Information Conveyed by Vowels , 1957 .

[40]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .