Intonation-based classification of language proficiency using FDA

State-of-the-art pronunciation tutoring (CAPT) systems are based on ASR technology. Consequently, they can provide a distinguished learning feedback which is focused on phonetic features and the positions of articulation errors. In contrast with the relative success with segmental errors, the acquisition and assessment of second language (L2) prosody is still a challenging problem. Although prosodic parameters like f0 contour or duration measures are usually displayed, the consequential evaluation components are generally missing. Considering the strong variation in speech data, functional data analysis (FDA) is a useful concept which statistically analyses interrelations between principal components (e.g., given accentuation) and their contribution to superimposed forms (e.g., resultingf0 contour). This article describes baseline processing and preliminary results of a pilot study on the intonation-based proficiency classification of German by using FDA methods. The experimental part contains the FDA-based classification results compared to a perceptual classification by German natives. Index Terms: L2 prosody, proficiency, functional data analysis

[1]  Ming Zhou,et al.  Detection of Non-Native Sentences Using Machine-Translated Training Data , 2007, HLT-NAACL.

[2]  Michael Gamon,et al.  A Machine Learning Approach to the Automatic Evaluation of Machine Translation , 2001, ACL.

[3]  P. Boersma Praat : doing phonetics by computer (version 5.1.05) , 2009 .

[4]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[5]  E. Nöth,et al.  Automatic Assessment of Non-Native Prosody for English as L 2 , 2010 .

[6]  Steve J. Young,et al.  Phone-level pronunciation scoring and assessment for interactive language learning , 2000, Speech Commun..

[7]  Shrikanth S. Narayanan,et al.  Better nonnative intonation scores through prosodic theory , 2008, INTERSPEECH.

[8]  Brian North,et al.  Common European Framework of Reference for Languages: learning, teaching, assessment , 2009 .

[9]  Irina Illina,et al.  Foreign accent identification based on prosodic parameters , 2008, INTERSPEECH.

[10]  Lou Boves,et al.  Functional data analysis as a tool for analyzing speech dynamics - a case study on the French word c'était , 2009, INTERSPEECH.

[11]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[12]  Lou Boves,et al.  Joint analysis of f0 and speech rate with Functional Data Analysis , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Rüdiger Hoffmann,et al.  Pronunciation Learning and Foreign Accent Reduction by an Audiovisual Feedback System , 2005, ACII.

[14]  Isabel Trancoso,et al.  A nativeness classifier for TED Talks , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Elmar Nöth,et al.  The Automatic Assessment of Non-native Prosody: Combining Classical Prosodic Analysis with Acoustic Modelling , 2012, INTERSPEECH.

[17]  Marie Frei,et al.  Functional Data Analysis With R And Matlab , 2016 .

[18]  Takehiko Yoshimi,et al.  Automatic Classification of Language Learner Sentences into Native-like or Non-Native-like Based on Word Alignment Distribution , 2009 .

[19]  Prosodic Analysis of German Produced by Russian and Chinese Learners , 2010 .

[20]  Nigel Ward Automatic discovery of simply-composable prosodic elements , 2014 .

[21]  Hongwei Ding,et al.  F 0 Analysis of Chinese Accented German Speech , 2006 .

[22]  Elmar Nöth,et al.  How many labellers revisited - naïves, experts, and real experts , 2011, SLaTE.