The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels

Abstract The aim of the present study was to examine the relative importance of various acoustic features as cues to familiar speaker identification for the vowel /a/. To this aim, a group of 20 speakers was recorded. The speakers' voices were modified using an analysis–synthesis system, which enabled analysis and modification of the glottal waveform, of the fundamental frequency, and of the vocal tract formants. Thirty listeners, very familiar with the speakers’ voices, had to identify the speakers in an open-set, multiple-choice experiment. The results suggest that on average, the contribution of the vocal tract features to the identification process is more important than that of the glottal source features. The exact shape of the glottal waveform was found to be of minor importance. Examination of individual speakers reveals that changes of identical features affect the identification rate of various speakers differently. This finding suggests that for each speaker a different group of acoustic features serves as the cue to the vocal identity.

[1]  K E Cummings,et al.  Analysis of the glottal excitation of emotionally styled and stressed speech. , 1995, The Journal of the Acoustical Society of America.

[2]  Per Hedelin A glottal LPC-vocoder , 1984, ICASSP.

[3]  Inger Karlsson Glottal wave forms for normal female speakers , 1986 .

[4]  John G. Proakis,et al.  Digital Signal Processing: Principles, Algorithms, and Applications , 1992 .

[5]  Per Hedelin High quality glottal LPC-vocoding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[7]  T. Carrell Contributions of Fundamental Frequency, Formant Spacing, and Glottal Waveform to Talker Identification. Research on Speech Perception. Technical Report No. 5. , 1984 .

[8]  P. J. Price,et al.  Male and female voice source characteristics: Inverse filtering results , 1989, Speech Commun..

[9]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[10]  A. Gray,et al.  Least squares glottal inverse filtering from the acoustic speech waveform , 1979 .

[11]  D. Lancker,et al.  Familiar voice recognition: Patterns and parameters. Part I. Recognition of backward voices , 1985 .

[12]  D. Lancker,et al.  Familiar voice recognition: Parameters and patterns. Part II. Recognition of rate-altered voices , 1985 .

[13]  J. Skoglund,et al.  Investigation on the Audibility of Glottal Parameter Variations in Speech Synthesis , 1994 .

[14]  Björn Lindblom,et al.  Frontiers of speech communication research , 1979 .

[15]  S. Hiki,et al.  Multidimensional representation of personal quality of vowels and its acoustical correlates , 1973 .

[16]  M. Sondhi,et al.  Measurement of the glottal waveform. , 1975, The Journal of the Acoustical Society of America.

[17]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[18]  G Papcun,et al.  Long-term memory for unfamiliar voices. , 1989, The Journal of the Acoustical Society of America.

[19]  Joan E. Miller,et al.  Decapitation and Recapitation, a Study of Voice Quality , 1964 .

[20]  Tohru Takagi,et al.  Acoustic parameters of voice individuality and voice-quality control by analysis-synthesis method , 1991, Speech Commun..

[21]  Chin-Hui Lee Speech Science and Technology. , 1993 .

[22]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[23]  R. B. Monsen,et al.  Study of variations in the male and female glottal wave. , 1976, The Journal of the Acoustical Society of America.

[24]  A. Schmidt‐Nielsen,et al.  Identification of known voices as a function of familiarity and narrow‐band coding , 1985 .

[25]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[26]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .