Age-and Gender-Dependent Analysis of Voice Source Characteristics

The effects of age, gender, and vocal tract configurations on the glottal excitation signal are still only partially understood. In this paper we examine some of these effects, and show that the voice source parameters, such as fundamental frequency (F<sub>o</sub>), open quotient (related to H*<sub>1</sub> - H*<sub>2</sub>), and spectral tilt (related to H*<sub>1</sub> - A*<sub>3</sub>) are not only affected by age and gender but are also intercorrelated (the asterisk superscript denotes correction for the influence of various formants). Recordings of 92 male and female speakers from three age groups (8, 15, 20-39) are analyzed. The main observations are: for low-pitched talkers H*<sub>1</sub> - H* <sub>2</sub> (hence, the open quotient) is proportional to F<sub>o</sub>, while for high-pitched talkers H*<sub>1</sub> $H*<sub>2 </sub> is proportional to F<sub>1</sub> (high to low vowels) for F<sub>1 </sub> < 700 Hz. The parameter H*<sub>1</sub> - A*<sub>3</sub> showed a strong dependence on F<sub>2</sub> and F<sub>3</sub> for all talkers and age groups: increasing F<sub>2</sub> or F<sub>3</sub> yielded an increase in H*<sub>1</sub> - A*<sub>3</sub>. Spectral tilt was seen to be vowel dependent and for male talkers, spectral tilt changed dramatically with age. A better understanding of the dependencies of voice source parameters on age and gender will help improve voice source parameter estimation and analysis for a variety of speech processing and medical applications

[1]  Gautam K. Vallabha,et al.  Systematic errors in the formant analysis of steady-state vowels , 2002, Speech Commun..

[2]  W. Fitch,et al.  Voice instabilities due to source-tract interactions , 2006 .

[3]  Abeer Alwan,et al.  An improved correction formula for the estimation of harmonic magnitudes and its application to open quotient estimation , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  J.J.M. Koreman,et al.  Decoding linguistic information in the glottal airflow , 1996 .

[5]  Robert H. Mannell Formant diphone parameter extraction utilising a labelled single-speaker database , 1998, ICSLP.

[6]  Sungbok Lee,et al.  Creation of two children's speech databases , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7]  Krzysztof Marasek,et al.  Glottal correlates of the word stress and the tense/lax opposition in German , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[8]  E. Chuang,et al.  Glottal characteristics of male speakers: acoustic correlates and comparison with female data. , 1996, The Journal of the Acoustical Society of America.

[9]  Gunnar Fant,et al.  Voice source properties of the speech code , 1996 .

[10]  G. E. Peterson,et al.  Some Basic Considerations in the Analysis of Intonation , 1960 .

[11]  Shrikanth S. Narayanan,et al.  Acoustics of children's speech: developmental changes of temporal and spectral parameters. , 1999, The Journal of the Acoustical Society of America.

[12]  Helen M. Hanson,et al.  Glottal characteristics of male speakers: acoustic correlates and comparison with female data. , 1996 .

[13]  J. Perkell,et al.  Comparisons among aerodynamic, electroglottographic, and acoustic spectral measures of female voice. , 1995, Journal of speech and hearing research.