Voice quality and f0 in prosody: towards a holistic account

This paper presents a discussion of the role of voice quality in prosody. Illustrations from past production and perception data by the authors indicate that source parameters other than f0 are an inherent part of prosody, implicated in both its linguistic and paralinguistic functions. While prosodic (intonational) analyses of a language tend to be largely presented in terms of f0 dynamics, the argument here is for an integrative approach, where f0 and voice quality – two dimensions of the voice source – are treated together, and are related to the temporal/rhythmic structure of utterances. This should yield a fuller understanding of the nature of prosody and of the underlying production and perceptual correlates of prosodic elements such as pitch accent, declination, focus, phrase boundaries, etc. Such an approach may also serve to bring together the currently fragmented accounts of two core aspects of prosodic functioning: its role in signalling (i) linguistic, contrastive and discourse-related information and (ii) in communicating speaker affect, i.e. mood, emotional state and attitude. While the illustrations presented here provide initial hypotheses, a newly initiated project on Irish prosody will seek to incorporate such a holistic approach to prosodic analysis.

[1]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[2]  I. Maddieson,et al.  An exploration of phonation types in Wu dialects of Chinese , 1992 .

[4]  J. L. Mozziconacci PITCH VARIATIONS AND EMOTION IN SPEECH , 1995 .

[5]  Rolf Carlson,et al.  Experiments with emotive speech - acted utterances and synthesized replicas , 1992, ICSLP.

[6]  J. Laver The phonetic description of voice quality , 1980 .

[7]  Kim E. A. Silverman,et al.  Vocal cues to speaker affect: testing two models , 1984 .

[8]  Ailbhe Ní Chasaide,et al.  Voice Quality Variation and the Perception of Affect: Continuous or Categorical? , 2003 .

[9]  J. O'connor Intonation Of Colloquial English , 1961 .

[10]  K. Scherer Vocal affect expression: a review and a model for future research. , 1986, Psychological bulletin.

[11]  C. Gobl The Voice Source in Speech Communication - Production and Perception Experiments Involving Inverse Filtering and Synthesis , 2003 .

[12]  Iain R. Murray,et al.  Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. , 1993, The Journal of the Acoustical Society of America.

[13]  Agaath M. C. Sluijter,et al.  Spectral balance as an acoustic correlate of linguistic stress. , 1996, The Journal of the Acoustical Society of America.

[14]  Ailbhe Ní Chasaide,et al.  The role of voice quality in communicating emotion, mood and attitude , 2003, Speech Commun..

[15]  M. Huffman Measures of phonation type in Hmong. , 1987, The Journal of the Acoustical Society of America.

[16]  Christer Gobl,et al.  Acoustic characteristics of voice quality , 1992, Speech Commun..

[17]  Sjl Mozziconacci Speech variability and emotion : production and perception , 1998 .

[18]  C. Gobl Voice source dynamics in connected speech , 1988 .

[19]  李幼升,et al.  Ph , 1989 .

[20]  P. Rose Phonetics and phonology of Yang tone ; phonation types in Zhenhai , 1989 .

[21]  C. Gobl,et al.  Contextual Variation of the Vowel Voice Source as a Function of Adjacent Consonants , 1993, Language and speech.

[22]  C. Gobl,et al.  Expressive synthesis: how crucial is voice quality? , 2002, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002..