Speaker race identification from acoustic cues in the vocal signal.

One-second acoustic samples were extracted from the mid-portion of sustained /a/ vowels produced by 50 black and 50 white adult males. Each vowel sample from a black subject was randomly paired with a sample from a white subject. From the tape-recorded samples alone, both expert and naive listeners could determine the race of the speaker with 60% accuracy. The accuracy of race identification was independent of the listener's own race, sex, or listening experience. An acoustic analysis of the samples revealed that, although within ranges reported by previous studies of normal voices, the black speakers had greater frequency perturbation, significantly greater amplitude perturbation, and a significantly lower harmonics-to-noise ratio than did the white speakers. The listeners were most successful in distinguishing voice pairs when the differences in vocal perturbation and additive noise were greatest and were least successful when such differences were minimal or absent. Because there were no significant differences in the mean fundamental frequency or formant structure of the voice samples, it is likely that the listeners relied on differences in spectral noise to discriminate the black and white speakers.

[1]  E. Tarone,et al.  Aspects of Intonation in Black English. , 1973 .

[2]  Ralph W. Fasold,et al.  THE RELATION BETWEEN BLACK AND WHITE SPEECH IN THE SOUTH , 1981 .

[3]  Y. Horii Fundamental frequency perturbation observed in sustained phonation. , 1979, Journal of speech and hearing research.

[4]  R F Orlikoff,et al.  The relationship of age and cardiovascular health to certain acoustic characteristics of male voices. , 1990, Journal of speech and hearing research.

[5]  E. Yumoto,et al.  Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness. , 1984, Journal of speech and hearing research.

[6]  L. F. Willems,et al.  Measurement of pitch in speech: an implementation of Goldstein's theory of pitch perception. , 1982, The Journal of the Acoustical Society of America.

[7]  N. Lass,et al.  The effect of filtered speech on speaker race and sex identifications , 1980 .

[8]  Anthony Holbrook,et al.  A Study of the Frequency Reading Fundamental Vocal of Young Black Adults , 1981 .

[9]  Harry Hollien,et al.  Adolescent voice change in southern Negro males , 1962 .

[10]  H. Hollien,et al.  Speaking fundamental frequency and chronologic age in males. , 1972, Journal of speech and hearing research.

[11]  Y. Koike Application of Some Acoustic Measures for the Evaluation of Laryngeal Dysfunction , 1967 .

[12]  J. Baugh Black Street Speech: Its History, Structure, and Survival , 1985 .

[13]  R F Orlikoff Vowel amplitude variation associated with the heart cycle. , 1990, The Journal of the Acoustical Society of America.

[14]  F. Emanuel,et al.  Some waveform and spectral features of vowel roughness. , 1978, Journal of speech and hearing research.

[15]  An analysis of vocal frequency and duration characteristics of selected samples of speech from three American dialect regions , 1951 .

[16]  Eiji Yumoto,et al.  The Quantitative Evaluation of Hoarseness: A New Harmonics to Noise Ratio Method , 1983 .

[17]  T. Baer,et al.  A pitch-synchronous analysis of hoarseness in running speech. , 1988, The Journal of the Acoustical Society of America.

[18]  Doris J. Kistler,et al.  Perceptual dimensions of dysphonic voices , 1984 .

[19]  Ralph W. Fasold,et al.  The study of social dialects in American English , 1974 .

[20]  A. Holbrook,et al.  Fundamental frequency characteristics of young Black adults: spontaneous speaking and oral reading. , 1982, Journal of speech and hearing research.

[21]  Y. Koike,et al.  Some perceptual dimensions and acoustical correlates of pathologic voices. , 1976, Acta oto-laryngologica. Supplementum.

[22]  R. Orlikoff,et al.  The effect of the heartbeat on vocal fundamental frequency perturbation. , 1989, Journal of speech and hearing research.

[23]  T. Murry Vocal tract parameters associated with voice quality and preference , 1988 .

[24]  L H Wells,et al.  A Note on Two Abnormal Laryngeal Muscles in a Zulu. , 1927, Journal of anatomy.

[25]  J. Till,et al.  Effects of initial consonant, pneumotachographic mask, and oral pressure tube on vocal perturbation, harmonics-to-noise, and intensity measurements , 1992 .

[26]  A. Hudson,et al.  Spontaneous speaking fundamental frequency of 6-year-old black children. , 1988, Journal of speech and hearing research.

[27]  J. Kreiman,et al.  Perceptual evaluation of voice quality: review, tutorial, and a framework for future research. , 1993, Journal of speech and hearing research.

[28]  J. L. Dillard Lexicon of Black English , 1979 .

[29]  R L Beckett,et al.  Pitch perturbation as a function of subjective vocal constriction. , 1969, Folia phoniatrica.

[30]  E T Doherty,et al.  Tape recorder effects on jitter and shimmer extraction. , 1988, Journal of speech and hearing research.

[31]  Rupert G. Miller Simultaneous Statistical Inference , 1966 .

[32]  Harvey Fletcher,et al.  Loudness, Pitch and the Timbre of Musical Tones and Their Relation to the Intensity, the Frequency and the Overtone Structure , 1934 .

[33]  W. S. Brown,et al.  Judgments of voice quality and preference: Acoustic interpretations , 1987 .

[34]  J. L. Dillard Black English; Its History and Usage In the United States , 1973 .

[35]  Y. Horii Jitter and shimmer differences among sustained vowel phonations. , 1982, Journal of speech and hearing research.

[36]  Eiji Yumoto Quantitative assessment of the degree of hoarseness , 1988 .

[37]  N. Lass,et al.  The effect of phonetic complexity on speaker race and sex identifications , 1979 .

[38]  George A. Gescheider,et al.  Hearing: Physiological Acoustics, Neural Coding, and Psychoacoustics , 1989 .

[39]  T. Baer,et al.  Harmonics-to-noise ratio as an index of the degree of hoarseness. , 1982, The Journal of the Acoustical Society of America.

[40]  I R Titze,et al.  Some technical considerations in voice perturbation measurements. , 1987, Journal of speech and hearing research.

[41]  W. Gould,et al.  Vocal Shimmer in Sustained Phonation of Normal and Pathologic Voice , 1976, The Annals of otology, rhinology, and laryngology.

[42]  H. Hollien,et al.  Perceived pitch and fundamental frequency comparisons of institutionalized Down's syndrome children. , 1978, Folia phoniatrica.

[43]  Y Horii,et al.  Vocal shimmer in sustained phonation. , 1980, Journal of speech and hearing research.

[44]  E. Mysak Pitch and duration characteristics of older males. , 1959, Journal of speech and hearing research.

[45]  S. Imaizumi Acoustic measures of roughness in pathological voice , 1986 .