Estimation of speaker's height and vocal tract length from speech signal

Estimation of speaker’s height and vocal tract length (VTL) from speech signal can have forensic and automatic speech recognition applications. It was suggested for a long time that there is a correlation between speaker’s VTL, on one side, and speaker’s height and formant frequencies, on another side. Until recently, these putative relationships have been empirically examined in studies employing relatively small numbers of speakers. Scattered studies presented intriguing results about the correlations between speaker’s height and various acoustic speech parameters. Due to lack of databases, few studies presented extensive comparative results between the actual speaker’s VTL and the estimated one from speech signal. This paper presents an analysis of correlations between various acoustic speech parameters and speaker’s height for a large number of speakers. It also presents a new method for an optimal estimation of speaker’s height and VTL from various acoustic speech parameters.

[1]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[2]  Julio Gonzalez,et al.  Estimation of Speakers' Weight and Height from Speech: A Re-Analysis of Data from Multiple Studies by Lass and Colleagues , 2003, Perceptual and motor skills.

[3]  John H. L. Hansen,et al.  VOICE ANALYSIS IN ADVERSE CONDITIONS: THE CENTENNIAL OLYMPIC PARK BOMBING 911 CALL , 1999 .

[4]  N J Lass,et al.  The Effect of Phonetic Complexity On Speaker Height and Weight Identification , 1979, Language and speech.

[5]  W. Fitch,et al.  Morphology and development of the human vocal tract: a study using magnetic resonance imaging. , 1999, The Journal of the Acoustical Society of America.

[6]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[7]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[8]  H J Künzel,et al.  How Well Does Average Fundamental Frequency Correlate with Speaker Height and Weight? , 1989, Phonetica.

[9]  W. Fitch Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques. , 1997, The Journal of the Acoustical Society of America.

[10]  Thomas P. Barnwell,et al.  Unsupervised estimation of the human vocal tract length over sentence level utterances , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[11]  Li Deng,et al.  Vocal‐tract length normalization for acoustic‐to‐articulatory mapping using neural networks , 1999 .

[12]  D. Rendall,et al.  Pitch (F0) and formant profiles of human vowels and vowel-like baboon grunts: the role of vocalizer body size and voice-acoustic allometry. , 2005, The Journal of the Acoustical Society of America.

[13]  Jean-Luc Gauvain,et al.  A phone-based approach to non-linguistic speech feature identification , 1995, Comput. Speech Lang..