Investigating Pitch Accent Recognition in Non-native Speech

Acquisition of prosody, in addition to vocabulary and grammar, is essential for language learners. However, it has received less attention in instruction. To enable automatic identification and feedback on learners' prosodic errors, we investigate automatic pitch accent labeling for non-native speech. We demonstrate that an acoustic-based context model can achieve accuracies over 79% on binary pitch accent recognition when trained on within-group data. Furthermore, we demonstrate that good accuracies are achieved in cross-group training, where native and near-native training data result in no significant loss of accuracy on non-native test speech. These findings illustrate the potential for automatic feedback in computer-assisted prosody learning.

[1]  Yi Xu,et al.  Maximum speed of pitch change and how it may relate to speech. , 2002, The Journal of the Acoustical Society of America.

[2]  Kristin Precoda,et al.  Prosodic features for automatic text-independent evaluation of degree of nativeness for language learners , 2000, INTERSPEECH.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  Dorothy M. Chun SIGNAL ANALYSIS SOFTWARE FOR TEACHING DISCOURSE INTONATION , 1998 .

[5]  Shrikanth S. Narayanan,et al.  Exploiting Acoustic and Syntactic Features for Prosody Labeling in a Maximum Entropy Framework , 2007, HLT-NAACL.

[6]  Ulrike Gut Non-native Speech: A Corpus-based Analysis of Phonological and Phonetic Properties of L2 English and German , 2009 .

[7]  Gina-Anne Levow,et al.  Automatic Prosodic Labeling with Conditional Random Fields and Rich Acoustic Features , 2008, IJCNLP.

[8]  Shrikanth S. Narayanan,et al.  Better nonnative intonation scores through prosodic theory , 2008, INTERSPEECH.

[9]  P. Bayerl,et al.  Measuring the reliability of manual annotations of speech corpora , 2004, Speech Prosody 2004.

[10]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[11]  Xuejing Sun,et al.  Pitch accent prediction using ensemble machine learning , 2002, INTERSPEECH.

[12]  Ulrike Gut,et al.  Non-native Speech , 2009 .

[13]  Ulrike Gut,et al.  A Prosodic Corpus of Non-Native Speech , 2002 .

[14]  Stephanie Seneff,et al.  Improved tone recognition by normalizing for coarticulation and intonation effects , 2000, INTERSPEECH.