On the acoustic correlates of high and low nuclear pitch accents in American English

Earlier findings in Shue et al. (2007, 2008) raised questions about the alignment of nuclear pitch accents in American English, which are addressed here by eliciting both high and low pitch accents in two different target words in several different positions in a single-phrase utterance (early, late but not final, and final) from 20 speakers (10 male, 10 female). Results show that the F"0 peak associated with a high nuclear pitch accent is systematically displaced to an earlier point in the target word if that word is final in the phrase and thus bears the boundary-related tones as well. This effect of tonal crowding holds across speakers, genders and target words, but was not observed for low accents, adding to the growing evidence that low targets behave differently from highs. Analysis of energy shows that, across target words and genders, the average energy level of a target word is greatest at the start of an utterance and decreases with increasing proximity to the utterance boundary. Duration measures confirms the findings of existing literature on main-stress-syllable lengthening, final syllable lengthening, and lengthening associated with pitch accents, and reveals that final syllable lengthening is further enhanced if the final word also carries a pitch accent. Individual speaker analyses found that while most speakers conformed to the general trends for pitch movements there were 2/10 male and 1/10 female speakers who did not. These results show the importance of taking into account prosodic contexts and speaker variability when interpreting correlates to prosodic events such as pitch accents.

[1]  Martine Grice,et al.  Sources of variation in tonal alignment: Evidence from acoustic and kinematic data , 2009, J. Phonetics.

[2]  D. Klatt,et al.  Structure of a phonological rule component for a synthesis-by-rule program , 1976 .

[3]  Laurence White,et al.  Structural influences on accentual lengthening in English , 1999 .

[4]  Francis Nolan,et al.  Pitch accent realization in four varieties of British English , 2000, J. Phonetics.

[5]  J. A. Edwards,et al.  Talking data : transcription and coding in discourse research , 1995 .

[6]  Stefanie Shattuck-Hufnagel,et al.  Glottalization of word-initial vowels as a function of prosodic structure , 1996 .

[7]  Vincent J. van Heuven,et al.  Acoustic correlates of linguistic stress and accent in Dutch and American English , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[8]  Yi Xu,et al.  Maximum speed of pitch change and how it may relate to speech. , 2002, The Journal of the Acoustical Society of America.

[9]  Robert Krones,et al.  A Study of Larynx Height in Speech Using the Thyro-Umbrometer , 1973 .

[10]  Janet Slifka,et al.  Some physiological correlates to regular and irregular phonation at the end of an utterance. , 2006, Journal of voice : official journal of the Voice Foundation.

[11]  Stefanie Shattuck-Hufnagel,et al.  A prosody tutorial for investigators of auditory sentence processing , 1996, Journal of psycholinguistic research.

[12]  Johan Sundberg,et al.  Maximum speed of pitch changes in singers and untrained subjects , 1979 .

[13]  J. Pierrehumbert The phonology and phonetics of English intonation , 1987 .

[14]  Amalia Arvaniti,et al.  Dialectal variation in the rising accents of American English , 2007 .

[15]  Abeer Alwan,et al.  Effects of intonational phrase boundaries on pitch-accented syllables in american English , 2008, INTERSPEECH.

[16]  Abeer Alwan,et al.  Pitch accent versus lexical stress: quantifying acoustic measures related to the voice source , 2007, INTERSPEECH.

[17]  Kim E. A. Silverman,et al.  The timing of prenuclear high accents in English , 1987 .

[18]  Julia Hirschberg,et al.  On the correlation between energy and pitch accent in read English speech , 2006, INTERSPEECH.

[19]  I. Titze Physiologic and acoustic differences between male and female voices. , 1989, The Journal of the Acoustical Society of America.

[20]  D. Robert Ladd,et al.  Phonetic effects of focus and "tonal crowding" in intonation: Evidence from Greek polar questions , 2006, Speech Commun..

[21]  Janet Pierrehumbert,et al.  Gesture, Segment, Prosody: Lenition of |h| and glottal stop , 1992 .

[22]  John Kingston,et al.  Papers in Laboratory Phonology: Index of names , 1990 .

[23]  John J. Ohala,et al.  Speed of Pitch Change , 1973 .

[24]  J. Pierrehumbert,et al.  Intonational structure in Japanese and English , 1986, Phonology.

[25]  Roy D. Patterson,et al.  An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite , 1998, ICSLP.

[26]  Agaath M. C. Sluijter,et al.  Spectral balance as an acoustic correlate of linguistic stress. , 1996, The Journal of the Acoustical Society of America.

[27]  B. Rosner,et al.  Loudness predicts prominence: fundamental frequency lends little. , 2005, The Journal of the Acoustical Society of America.

[28]  Abeer Alwan,et al.  Age, sex, and vowel dependencies of acoustic measures related to the voice source. , 2007, The Journal of the Acoustical Society of America.

[29]  Cecilia Odé,et al.  Neutralization or truncation? The perception of two Russian pitch accents on utterance-final syllables , 2005, Speech Commun..

[30]  C. Fougeron,et al.  Rate effects on French intonation: prosodic organization and phonetic realization , 1998 .

[31]  D. Klatt Linguistic uses of segmental duration in English: acoustic and perceptual evidence. , 1976, The Journal of the Acoustical Society of America.

[32]  Julia Hirschberg,et al.  The intonational Structuring of Discourse , 1986, ACL.

[33]  Matthias Jilka,et al.  The influence of vowel quality features on peak alignment , 2007, INTERSPEECH.

[34]  Stefanie Shattuck-Hufnagel,et al.  Multiple targets of phrase-final lengthening in American English words , 2007, J. Phonetics.