The long road from phonological knowledge to phonetic realization: An acoustic account of the temporal composition of Mandarin L2 English

Producing continuous speech in L2 is a challenging task. We accept that the composition of speech tempo involves multiple linguistic levels of contributions. We further hypothesize that respective contributions in the speech signal could be better accounted for through normalization of acoustic contributions, and examined the English phonetic inventory, the way stress type (primary, secondary and tertiary), boundary type (nonphrase final, continuation rise, final rise and final fall), as well as focus status (non-focus, function words, broad focus and narrow focus). Analyses of speech data of L1 vs. Mandarin L2 English not only verified the contribution of each factor examined, but also demonstrate in what explicit ways the temporal composition of Taiwan Mandarin L2 English differs from the L1 norm. In short, a discrepancy between linguistic awareness and phonetic execution leads to difficulties by lower level units such as segments and stress patterns; whereas higher level planning difficulties leads to deviations exhibited in boundary adjustments and realization of broad and narrow focus contrasts. We believe the results shed new light on temporal composition both L1 and L2 English, facilitate better understanding of tempo structure that can be directly applied to L2 teaching and computer aided training.

[1]  Ruili Wang,et al.  Automatic stress exaggeration by prosody modification to assist language learners perceive sentence stress , 2011, International Journal of Speech Technology.

[2]  Chiu-yu Tseng,et al.  Phonetic aspects of content design in AESOP (Asian English Speech cOrpus Project) , 2009, 2009 Oriental COCOSDA International Conference on Speech Database and Assessments.

[3]  Yi Xu,et al.  Speech melody as articulatorily implemented communicative functions , 2005, Speech Commun..

[4]  Shizuka Nakamura Analysis of Relationship between Duration Characteristics and Subjective Evaluation of English Speech by Japanese learners with regard to Contrast of the Stressed to the Unstressed , 2010 .

[5]  Chiu-yu Tseng,et al.  Comparison of English narrow focus production by L1 English, Beijing and Taiwan Mandarin speakers , 2012, 2012 International Conference on Speech Database and Assessments.

[6]  B. Thorén SWEDISH ACCENT - DURATION OF POST-VOCALIC CONSONANTS IN NATIVE SWEDES SPEAKING ENGLISH AND GERMAN , 2007 .

[7]  Niels O. Schiller,et al.  Vowel duration in English as a second language among Javanese learners , 2015, ICPhS.

[8]  Gérard Bailly,et al.  SFC: A trainable prosodic model , 2005, Speech Commun..

[9]  Sumio Ohno,et al.  Analysis and synthesis of fundamental frequency contours of Standard Chinese using the command-response model , 2005, Speech Commun..

[10]  Chiu-yu Tseng,et al.  Fluent speech prosody: Framework and modeling , 2005, Speech Commun..

[11]  Peggy Pik Ki Mok,et al.  Interlanguage influence in cues of narrow focus: A study of Hong Kong English , 2015, ICPhS.

[12]  Chiu-yu Tseng,et al.  Underdifferentiation of English lexical stress contrasts by L2 taiwan speakers , 2013, SLaTE.