Unsupervised estimation of the human vocal tract length over sentence level utterances

This paper describes a method for the unsupervised and gender-independent estimation of the average human vocal tract length from the speech waveform, and reports results obtained on Fant's (1960) X-ray vowel data as well as results from experiments performed on multiple sentence utterances of 86 male and 78 female TIMIT speakers, including correlation analyses between the vocal tract length estimates and given body heights. The investigated error criteria that make non-iterative, closed-form estimator solutions possible are all found to achieve good speaker clustering potential for both male and female subgroups.

[1]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[2]  J. Markel,et al.  The SIFT algorithm for fundamental frequency estimation , 1972 .

[3]  Herbert Gish,et al.  A parametric approach to vocal tract length normalization , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[4]  Louis ten Bosch,et al.  A novel feature transformation for vocal tract length normalization in automatic speech recognition , 1998, IEEE Trans. Speech Audio Process..

[5]  Burhan F. Necioğlu Objectively measured descriptors for perceptual characterization of speakers , 1999 .

[6]  R. Kirlin,et al.  A posteriori estimation of vocal tract length , 1978 .

[7]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[8]  Thomas P. Barnwell,et al.  Perceptual relevance of objectively measured descriptors for speaker characterization , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[9]  Roderick P. Singh Anatomy of hearing and speech , 1980 .

[10]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[11]  W. Fitch Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques. , 1997, The Journal of the Acoustical Society of America.

[12]  A. Paige,et al.  Calculation of vocal tract length , 1970 .