Automatically rating pronunciation through articulatory phonology

Articulatory Phonology’s link between cognitive speech planning and the physical realizations of vocal tract constrictions has implications for speech acoustic and duration modeling that should be useful in assigning subjective ratings of pronunciation quality to nonnative speech. In this work, we compare traditional phoneme models used in automatic speech recognition to similar models for articulatory gestural pattern vectors, each with associated duration models. What we find is that, on the CDT corpus, gestural models outperform the phonemelevel baseline in terms of correlation with listener ratings, and in combination phoneme and gestural models outperform either one alone. This also validates previous findings with a similar (but not gesture-based) pseudo-articulatory representation. Index Terms: pronunciation modeling, nonnative speech, articulatory phonology

[1]  Jeff A. Bilmes,et al.  Hidden-articulator Markov models for speech recognition , 2003, Speech Commun..

[2]  Steve Young,et al.  The HTK book , 1995 .

[3]  Carol Y. Espy-Wilson,et al.  From acoustics to Vocal Tract time functions , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Wayne H. Ward,et al.  Parsing speech into articulatory events , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Mark Hasegawa-Johnson,et al.  The entropy of the articulatory phonological code: recognizing gestures from tract variables , 2008, INTERSPEECH.

[6]  Mitch Weintraub,et al.  Automatic scoring of pronunciation quality , 2000, Speech Commun..

[7]  Shrikanth S. Narayanan,et al.  Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  P. Ladefoged A course in phonetics , 1975 .

[9]  Dani Byrd,et al.  TADA: An enhanced, portable Task Dynamics model in MATLAB , 2004 .

[10]  C. Browman,et al.  Articulatory Phonology: An Overview , 1992, Phonetica.