Organizing syllables into groups - Evidence from F0 and duration patterns in Mandarin

In this study we investigated grouping-related F0 patterns in Mandarin by examining the effect of syllable position in a group while controlling for tone, speaking mode, number of syllables in a group, and group position in a sentence. We analyzed syllable duration, F0 displacement, ratio of peak velocity to F0 displacement (vp/d ratio) and shape of F0 velocity profile (parameter C) in sequences of Rising, Falling and High tones. Results showed that syllable duration had the most consistent grouping-related patterns. In a short phrase of 1-4 syllables, duration is longest in the final position, second longest in the initial position, and shortest in the medial positions. In Rising and Falling tone sequences, syllable duration was positively related to F0 displacement, but negatively related to vp/d ratio. Sequences consisting of only the High tone, however, showed no duration-matching F0 variations. Modeling simulations with a second-order linear system showed that duration variations alone could generate F0 displacement and vp/d ratio variations comparable to those in actual data. We interpret the results as evidence that grouping is encoded directly by syllable duration, while the corresponding variations in F0 displacement, vp/d ratio and velocity profile are the consequences of duration control.

[1]  San Duanmu,et al.  The Phonology of Standard Chinese , 2001 .

[2]  Fang Liu,et al.  Parallel Encoding of Focus and Interrogative Meaning in Mandarin Intonation , 2005, Phonetica.

[3]  W. L. Nelson Physical principles for economies of skilled movements , 1983, Biological Cybernetics.

[4]  Louis Goldstein,et al.  Articulatory gestures as phonological units , 1989, Phonology.

[5]  Kenneth de Jong,et al.  Stress, lexical focus, and segmental focus in English: patterns of variation in vowel duration , 2004, J. Phonetics.

[6]  Stefanie Shattuck-Hufnagel,et al.  Word-boundary-related duration patterns in English , 2000, J. Phonetics.

[7]  D. Ostry,et al.  Control of rate and duration of speech movements. , 1985, The Journal of the Acoustical Society of America.

[8]  G. E. Peterson,et al.  Some Basic Considerations in the Analysis of Intonation , 1960 .

[9]  趙 元任,et al.  A grammar of spoken Chinese = 中國話的文法 , 1968 .

[10]  Elliot Saltzman,et al.  The dynamical perspectives on speech production: Data and theory , 1986 .

[11]  A M Engebretson,et al.  Indirect assessment of the contribution of subglottal air pressure and vocal-fold tension to changes of fundamental frequency in English. , 1978, The Journal of the Acoustical Society of America.

[12]  N. Umeda “F0 declination” is situation dependent , 1982 .

[13]  M. Beckman,et al.  The articulatory kinematics of final lengthening. , 1991, The Journal of the Acoustical Society of America.

[14]  Yi Xu,et al.  Production of Weak Elements in Speech – Evidence from F₀ Patterns of Neutral Tone in Standard Chinese , 2006, Phonetica.

[15]  Chilin Shih,et al.  Prosody modeling with soft templates , 2003, Speech Commun..

[16]  Johan Sundberg,et al.  Maximum speed of pitch changes in singers and untrained subjects , 1979 .

[17]  Yi Xu Contextual tonal variations in Mandarin , 1997 .

[18]  I. Lehiste,et al.  Role of duration in disambiguating syntactically ambiguous sentences , 1975 .

[19]  Yi Xu,et al.  Fundamental Frequency Peak Delay in Mandarin , 2000, Phonetica.

[20]  Eric Vatikiotis-Bateson,et al.  Rhythm type and articulatory dynamics in English, French and Japanese , 1993 .

[21]  Fang Liu,et al.  Determining the temporal interval of segments with the help of F0 contours , 2007, J. Phonetics.

[22]  M. O'Malley,et al.  Recovering parentheses from spoken algebraic expressions , 1973 .

[23]  Bei Wang,et al.  Prosodic encoding of topic and focus in Mandarin , 2006 .

[24]  D. Ostry,et al.  Characteristics of velocity profiles of speech movements. , 1985, Journal of experimental psychology. Human perception and performance.

[25]  John Kingston,et al.  Papers in Laboratory Phonology: Index of names , 1990 .

[26]  S. M. Holzer,et al.  Book Reviews : SYSTEM DYNAMICS Katsuhiko Ogata Prentice-Hall, Inc., Englewood Cliffs, NJ, 1978 , 1980 .

[27]  Steven G. Lapointe,et al.  Syntactic blocking of phonological rules in speech production , 1977 .

[28]  Ching X. Xu,et al.  Effects of consonant aspiration on Mandarin tones , 2003 .

[29]  J. Perkell,et al.  Economy of effort in different speaking conditions. I. A preliminary study of intersubject differences and modeling issues. , 2002, The Journal of the Acoustical Society of America.

[30]  Ching X. Xu,et al.  Effects of consonant aspiration on Mandarin tones , 2001, Journal of the International Phonetic Association.

[31]  Emily Q. Wang,et al.  Pitch targets and their realization: Evidence from Mandarin Chinese , 2001, Speech Commun..

[32]  I R Titze,et al.  On the relation between subglottal pressure and fundamental frequency in phonation. , 1989, The Journal of the Acoustical Society of America.

[33]  H. Ackermann,et al.  Articulatory control of phonological vowel length contrasts: kinematic analysis of labial gestures. , 1997, The Journal of the Acoustical Society of America.

[34]  J. Kelso,et al.  A qualitative dynamic analysis of reiterant speech production: phase portraits, kinematics, and dynamic modeling. , 1985, The Journal of the Acoustical Society of America.

[35]  Yu-ching Kuo,et al.  The phonetics and phonology of apparent cases of iterative tonal change in Standard Chinese , 2005 .

[36]  Yi Xu,et al.  Effects of tone and focus on the formation and alignment of f0contours , 1999 .

[37]  K. D. Jong The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation , 1995 .

[38]  Yiya Chen,et al.  Durational adjustment under corrective focus in Standard Chinese , 2006, J. Phonetics.

[39]  Ying Wai Wong,et al.  Realization of Cantonese Rising Tones under Different Speaking Rates , 2006 .

[40]  Colin W. Wightman,et al.  Segmental durations in the vicinity of prosodic phrase boundaries. , 1992, The Journal of the Acoustical Society of America.

[41]  D. Fry Experiments in the Perception of Stress , 1958 .

[42]  Santitham Prom-on,et al.  Modeling tone and intonation in Mandarin and English as a process of target approximation. , 2009, The Journal of the Acoustical Society of America.

[43]  Chilin Shih,et al.  Hierarchical Structure and Word Strength Prediction of Mandarin Prosody , 2003, Int. J. Speech Technol..

[44]  Wayne A. Lea,et al.  Trends in Speech Recognition , 1980 .

[45]  Mark Liberman,et al.  Towards an integrated understanding of speaking rate in conversation , 2006, INTERSPEECH.

[46]  Kim E. A. Silverman,et al.  F₀ Segmental Cues Depend on Intonation: The Case of the Rise after Voiced Stops , 1986 .

[47]  Y Xu,et al.  Consistency of Tone-Syllable Alignment across Different Syllable Structures and Speaking Rates , 1998, Phonetica.

[48]  Dani Byrd,et al.  The elastic phrase: modeling the dynamics of boundary-adjacent lengthening , 2003, J. Phonetics.

[49]  K. Pike,et al.  The intonation of American English , 1946 .

[50]  Yi Xu,et al.  Phonetic realization of focus in English declarative intonation , 2005, J. Phonetics.

[51]  D. Klatt Linguistic uses of segmental duration in English: acoustic and perceptual evidence. , 1976, The Journal of the Acoustical Society of America.

[52]  Chilin Shih,et al.  Generation and normalization of tonal variations , 2001 .

[53]  Lloyd H. Nakatani,et al.  Prosodic Aspects of American English Speech Rhythm , 1981 .

[54]  D. Whalen,et al.  The universality of intrinsic F0 of vowels , 1995 .

[55]  Chilin Shih,et al.  The prosodic domain of tone sandhi in Chinese , 1986 .

[56]  Raymond D. Kent,et al.  Speaking rate and speech movement velocity profiles. , 1993, Journal of speech and hearing research.

[57]  Chilin Shih,et al.  Prosodic Structure in Language Understanding: Evidence from Tone Sandhi in Mandarin , 1989 .

[58]  L Saltzman Elliot,et al.  A Dynamical Approach to Gestural Patterning in Speech Production , 1989 .

[59]  V. Fromkin,et al.  Tone : a linguistic survey , 1980 .

[60]  John J. Ohala,et al.  Production of Tone , 1978 .

[61]  Yi Xu,et al.  Maximum speed of pitch change and how it may relate to speech. , 2002, The Journal of the Acoustical Society of America.

[62]  Stefanie Shattuck-Hufnagel,et al.  A prosody tutorial for investigators of auditory sentence processing , 1996, Journal of psycholinguistic research.

[63]  Yi Xu,et al.  Timing and coordination in tone and intonation―An articulatory-functional perspective , 2009 .

[64]  Nina Thorsen An Acoustical Investigation of Danish Intonation. , 1978 .

[65]  Hiroya Fujisaki,et al.  Information, prosody, and modeling - with emphasis on tonal features of speech - , 2004, Speech Prosody 2004.

[66]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[67]  D J Ostry,et al.  Similarities in the control of the speech articulators and the limbs: kinematics of tongue dorsum movement in speech. , 1983, Journal of experimental psychology. Human perception and performance.

[68]  I. Lehiste The Timing of Utterances and Linguistic Boundaries , 1972 .

[69]  M. Swerts Prosodic features at discourse boundaries of different strength. , 1997, The Journal of the Acoustical Society of America.

[70]  Agaath M. C. Sluijter,et al.  Spectral balance as an acoustic correlate of linguistic stress. , 1996, The Journal of the Acoustical Society of America.

[71]  B. Rosner,et al.  Loudness predicts prominence: fundamental frequency lends little. , 2005, The Journal of the Acoustical Society of America.

[72]  Jan Edwards,et al.  Papers in Laboratory Phonology: Lengthenings and shortenings and the nature of prosodic constituency , 1990 .

[73]  Matthew Y. Chen,et al.  Tone Sandhi: Patterns across Chinese Dialects , 2000 .

[74]  M. Beckman,et al.  Gesture, Segment, Prosody: Prosodic structure and tempo in a sonority model of articulatory dynamics , 1992 .