Identification of synthetic vowels based on selected vocal tract area functions.

The purpose of this study was to determine the degree to which synthetic vowel samples based on previously reported vocal tract area functions of eight speakers could be accurately identified by listeners. Vowels were synthesized with a wave-reflection type of vocal tract model coupled to a voice source. A particular vowel was generated by specifying an area function that had been derived from previous magnetic resonance imaging based measurements. The vowel samples were presented to ten listeners in a forced choice paradigm in which they were asked to identify the vowel. Results indicated that the vowels [i], [ae], and [u] were identified most accurately for all of speakers. The identification errors of the other vowels were typically due to confusions with adjacent vowels.

[1]  Brad H Story Time dependence of vocal tract modes during production of vowels and vowel sequences. , 2007, The Journal of the Acoustical Society of America.

[2]  Brad H Story,et al.  Comparison of magnetic resonance imaging-based vocal tract area functions obtained from the same speaker in 1994 and 2002. , 2008, The Journal of the Acoustical Society of America.

[3]  S. Nittrouer Dynamic spectral structure specifies vowels for children and adults. , 2007, The Journal of the Acoustical Society of America.

[4]  I R Titze,et al.  Vocal tract area functions for an adult female speaker based on volumetric imaging. , 1998, The Journal of the Acoustical Society of America.

[5]  J Hillenbrand,et al.  Identification of steady-state vowels synthesized from the Peterson and Barney measurements. , 1993, The Journal of the Acoustical Society of America.

[6]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969, The Journal of the Acoustical Society of America.

[7]  J. Jenkins,et al.  Identification of vowels in “vowelless” syllables , 1983, Perception & psychophysics.

[8]  David M. Howard,et al.  Real-Time Dynamic Articulations in the 2-D Waveguide Mesh Vocal Tract Model , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Brad H Story,et al.  Synergistic modes of vocal tract articulation for American English vowels. , 2005, The Journal of the Acoustical Society of America.

[10]  D. Broadbent,et al.  Information Conveyed by Vowels , 1957 .

[11]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[12]  René Carré,et al.  Perception of Vowel-to-Vowel Transitions with Different Formant Trajectories , 2001, Phonetica.

[13]  J. Jenkins,et al.  Dynamic specification of coarticulated vowels. , 1983, The Journal of the Acoustical Society of America.

[14]  James M Hillenbrand,et al.  Open source software for experiment design and control. , 2005, Journal of speech, language, and hearing research : JSLHR.

[15]  T. M. Nearey Static, dynamic, and relational properties in vowel perception. , 1989, The Journal of the Acoustical Society of America.

[16]  W. Strange,et al.  Dynamic specification of coarticulated vowels spoken in sentence context. , 1989, The Journal of the Acoustical Society of America.

[17]  J. Hillenbrand,et al.  Some effects of duration on vowel recognition. , 2000, The Journal of the Acoustical Society of America.

[18]  B. Story A parametric model of the vocal tract area function for vowel and consonant simulation. , 2005, The Journal of the Acoustical Society of America.

[19]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[20]  Brad H Story,et al.  Technique for "tuning" vocal tract area functions based on acoustic sensitivity functions. , 2006, The Journal of the Acoustical Society of America.