On the reliability of overall intensity and spectral emphasis as acoustic correlates of focal accents in Swedish

This study shows that increases in overall intensity and spectral emphasis are reliable acoustic correlates of focal accents in Swedish. They are both reliable in the sense that there are statistically significant differences between focally accented words and nonfocal ones for a variety of words, in any position of the phrase and for all speakers in the analyzed materials, and in the sense of their being useful for automatic detection of focal accents. Moreover, spectral emphasis turns out to be the more reliable correlate, as the influence on it of position in the phrase, word accent and vowel height was less pronounced and as it proved a better predictor of focal accents in general and for a majority of the speakers. Finally, the study has resulted in data for overall intensity and spectral emphasis that might prove important in modeling for speech synthesis.

[1]  Mattias Heldner,et al.  Temporal effects of focus in Swedish , 2001, J. Phonetics.

[2]  Johan Liljencrants,et al.  Acoustic-phonetic Analysis of Prominence in Swedish , 2000 .

[3]  D. Bolinger A Theory of Pitch Accent in English , 1958 .

[4]  Björn Granström,et al.  On the Analysis of Prosody in Interaction , 1997, Computing Prosody.

[5]  V. V. van Heuven,et al.  Spectral balance as a cue in the perception of linguistic stress. , 1997, The Journal of the Acoustical Society of America.

[6]  Peter Ladefoged,et al.  Measures of spectral tilt , 1985 .

[7]  Gunnar Fant,et al.  The voice source in connected speech , 1997, Speech Commun..

[8]  Lou Boves,et al.  Acoustic characteristics of lexical stress in continuous telephone speech , 1999, Speech Commun..

[9]  G. Bruce Swedish word accents in sentence perspective , 1977 .

[10]  K. Stevens,et al.  Classification of glottal vibration from acoustic measurements , 1995 .

[11]  W. Cooper,et al.  Acoustical aspects of contrastive stress in question-answer contexts. , 1985, The Journal of the Acoustical Society of America.

[12]  I R Titze,et al.  Vocal intensity in speakers and singers. , 1991, The Journal of the Acoustical Society of America.

[13]  Agaath M. C. Sluijter,et al.  Spectral balance as an acoustic correlate of linguistic stress. , 1996, The Journal of the Acoustical Society of America.

[14]  Johan Liljencrants,et al.  The Source-Filter Frame of Prominence , 2000, Phonetica.

[15]  Bertil Lyberg,et al.  Detection of sentence accents in a speech recognition system. , 1996 .

[16]  Anton Batliner,et al.  Intensity as a predictor of focal accent , 1991 .

[17]  Hartmut Traunmüller,et al.  Perception of speaker sex, age, and vocal effort , 1997 .

[18]  Antonis Botinis Intonation: Analysis, Modelling and Technology , 2000 .

[19]  Gökhan Tür,et al.  Prosody-based automatic segmentation of speech into sentences and topics , 2000, Speech Commun..

[20]  W. Eefting The effect of ‘‘information value’’ and ‘‘accentuation’’ on the duration of Dutch words, syllables, and segments , 1991 .

[21]  Mattias Heldner,et al.  A focus detector using overall intensity and high frequency emphasis , 1999 .

[22]  Mari Ostendorf,et al.  Automatic labeling of prosodic patterns , 1994, IEEE Trans. Speech Audio Process..

[23]  Ilse Lehiste,et al.  Vowel Amplitude and Phonemic Stress in American English , 1959 .

[24]  J. Pierrehumbert The perception of fundamental frequency declination. , 1979, The Journal of the Acoustical Society of America.

[25]  Laurence White,et al.  Structural influences on accentual lengthening in English , 1999 .

[26]  Lennart Nord,et al.  Durational correlates of stress in Swedish, French and English* , 1991 .

[27]  D. Fry Experiments in the Perception of Stress , 1958 .

[28]  Matthew P. Aylett,et al.  Intonation: Theory, Models and Applications , 1997 .

[29]  Alice Turk,et al.  A cross-linguistic study of accentual lengthening: Dutch vs. English , 1999 .

[30]  W. Nick Campbell,et al.  Prosodic encoding of English speech , 1992, ICSLP.

[31]  D. Fry Duration and Intensity as Physical Correlates of Linguistic Stress , 1954 .

[32]  Elmar Nöth,et al.  VERBMOBIL: the use of prosody in the linguistic components of a speech understanding system , 2000, IEEE Trans. Speech Audio Process..

[33]  John Hart,et al.  A Perceptual Study of Intonation , 1990 .

[34]  A.M.C. Sluijter,et al.  Effects of Focus Distribution, Pitch Accent and Lexical Stress on the Temporal Organization of Syllables in Dutch , 1995 .

[35]  Nick Campbell Combining the use of duration and F0 in an automatic analysis of dialogue prosody , 1994, ICSLP.

[36]  Marc Pierce,et al.  Word Prosodic Systems in the Languages of Europe , 2000 .

[37]  H. Traunmüller,et al.  Acoustic effects of variation in vocal effort by men, women, and children. , 2000, The Journal of the Acoustical Society of America.

[38]  Mari Ostendorf,et al.  A Multi-level Model for Recognition of Intonation Labels , 1997, Computing Prosody.

[39]  M. Beckman Stress And Non-Stress Accent , 1986 .

[40]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[41]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.