Duration and spectral balance of intervocalic consonants: A case for efficient communication

Abstract The prosodic structure of speech and the redundancy of words can significantly strengthen or weaken segmental articulation. This paper investigates the acoustic effects of lexical stress, intra-word location, and predictability on sentence internal intervocalic consonants from accented words, using meaningful reading materials from 4157 sentences read by two American English speakers. Consonant duration and spectral balance in such reading materials show reduction in unstressed consonants and in consonants occurring later in the word (Initial vs. Medial vs. Final). Coronal consonants behaved distinctly, which was interpreted as a shift from full to flap or tap articulation in a subset of the phoneme realizations. This shift in articulation, and part of the consonant specific acoustic variation, could be linked to the frequency distribution of consonant classes over the investigated conditions. A higher frequency of occurrence of a consonant class in our corpus and a CELEX word-list was associated with shorter durations and differences in spectral balance that would increase the communicative efficiency of speech.

[1]  D Granville,et al.  Stochastic Suprasegmentals: Relationships between Redundancy, Prosodic Structure and Syllabic Duration , 1999 .

[2]  Alice Turk,et al.  The Smooth Signal Redundancy Hypothesis: A Functional Explanation for Relationships between Redundancy, Prosodic Prominence, and Duration in Spontaneous Speech , 2004, Language and speech.

[3]  K. D. Jong The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation , 1995 .

[4]  Alice Turk,et al.  The domain of accentual lengthening in American English , 1997 .

[5]  J. V. Santen Exploring N -way tables with sums-of-products models , 1993 .

[6]  Florien J. van Beinum,et al.  Efficiency as an organizing principle of natural speech , 1998, ICSLP.

[7]  Louis C. W. Pols,et al.  An acoustic description of consonant reduction , 1999, Speech Commun..

[8]  Colin W. Wightman,et al.  Segmental durations in the vicinity of prosodic phrase boundaries. , 1992, The Journal of the Acoustical Society of America.

[9]  Louis C. W. Pols,et al.  The correlation between consonant identification and the amount of acoustic consonant reduction , 1997, EUROSPEECH.

[10]  Vincent J. van Heuven,et al.  Intensity and vocal effort as cues in the perception of stress , 1995, EUROSPEECH.

[11]  Anne Cutler,et al.  The predominance of strong initial syllables in the English vocabulary , 1987 .

[12]  Dani Byrd,et al.  Intragestural dynamics of multiple prosodic boundaries , 1998 .

[13]  Björn Lindblom,et al.  Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .

[14]  Louis C. W. Pols,et al.  Information structure and efficiency in speech production , 2003, INTERSPEECH.

[15]  Louis C. W. Pols,et al.  An acoustic model of communicative efficiency in consonants and vowels taking into account context distinctiveness , 2003 .

[16]  Jan P. H. van Santen,et al.  Contextual effects on vowel duration , 1992, Speech Commun..

[17]  Agaath M. C. Sluijter,et al.  Spectral balance as an acoustic correlate of linguistic stress. , 1996, The Journal of the Acoustical Society of America.

[18]  H M Hanson,et al.  Glottal characteristics of female speakers: acoustic correlates. , 1997, The Journal of the Acoustical Society of America.

[19]  J. V. Santen,et al.  The analysis of contextual effects on segmental duration , 1990 .

[20]  Florien J. van Beinum,et al.  Vowel reduction and stress , 1987, Speech Commun..

[21]  M. Beckman,et al.  The Interplay Between Prosodic Structure and Coarticulation , 1993, Language and speech.

[22]  L P Shapiro,et al.  "How to milk a coat:" the effects of semantic and acoustic information on phoneme categorization. , 1998, The Journal of the Acoustical Society of America.

[23]  M. Tabain Effects of prosodic boundary on /aC/ sequences: acoustic results. , 2003, The Journal of the Acoustical Society of America.

[24]  X. Wang,et al.  Incorporating Knowledge on Segmental Duration in 'HMM'-based Continuous Speech Recognition , 1997 .

[25]  Marianne Pouplier,et al.  The Dynamics of Error , 2003 .

[26]  Louis C. W. Pols,et al.  Evidence for efficiency in vowel production , 2002, INTERSPEECH.

[27]  David M. Rocke Analysis of Experiments With Missing Data , 1987 .

[28]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .

[29]  V. V. van Heuven,et al.  Spectral balance as a cue in the perception of linguistic stress. , 1997, The Journal of the Acoustical Society of America.

[30]  D. Kemmerer,et al.  Phonotactics and Syllable Stress: Implications for the Processing of Spoken Nonsense Words , 1997, Language and speech.

[31]  V. Zue,et al.  Acoustic study of medial /t,d/ in American English , 1979 .

[32]  P. Keating,et al.  Articulatory strengthening at edges of prosodic domains. , 1997, The Journal of the Acoustical Society of America.

[33]  Louis C. W. Pols,et al.  An acoustic profile of consonant reduction , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[34]  Alice Turk,et al.  The American English Flapping Rule and the Effect of Stress on Stop Consonant Durations , 1992 .

[35]  Louis C. W. Pols,et al.  Frequency effects on vowel reduction in three typologically different languages (dutch, finish, Russian) , 2004, INTERSPEECH.

[36]  Louis C. W. Pols,et al.  Incorporating knowledge on segmental duration in HMM-based continuous speech recognition , 1997 .

[37]  A. Jongman,et al.  Acoustic characteristics of English fricatives. , 2000, The Journal of the Acoustical Society of America.

[38]  Ann Cutler,et al.  Prosody in the Comprehension of Spoken Language: A Literature Review , 1997, Language and speech.

[39]  Louis C. W. Pols,et al.  Effects of stress and lexical structure on speech efficiency , 1999, EUROSPEECH.

[40]  Richard Shillcock,et al.  Proceedings of EUROSPEECH-1991. , 1991 .

[41]  L. Pols,et al.  Formant movements of Dutch vowels in a text, read at normal and fast rate. , 1992, The Journal of the Acoustical Society of America.

[42]  René Carré,et al.  Locus equations in the light of articulatory modeling , 1997 .

[43]  M. Fourakis,et al.  Tempo, stress, and vowel reduction in American English. , 1991, The Journal of the Acoustical Society of America.

[44]  Edda Farnetani The spatial and the temporal dimensions of consonant reduction in conversational Italian , 1995, EUROSPEECH.

[45]  E. Chuang,et al.  Glottal characteristics of male speakers: acoustic correlates and comparison with female data. , 1996, The Journal of the Acoustical Society of America.

[46]  D. V. Bergem Acoustic and Lexical Vowel Reduction , 1995 .

[47]  A. House On Vowel Duration in English , 1961 .

[48]  P. Lieberman Some Effects of Semantic and Grammatical Context on the Production and Perception of Speech , 1963 .

[49]  Richard Sproat,et al.  Multilingual Text-to-Speech Synthesis: The Bell Labs Approach , 1998, CL.

[50]  Louis C. W. Pols,et al.  Formant frequencies of Dutch vowels in a text, read at normal and fast rate , 1990 .

[51]  Jan P. H. van Santen,et al.  Strong interaction between factors influencing consonant duration , 1997, EUROSPEECH.

[52]  N. Umeda Vowel duration in American English. , 1975, The Journal of the Acoustical Society of America.

[53]  Matthew P. Aylett,et al.  Stochastic suprasegmentals: relationships between redundancy, prosodic structure and care of articulation in spontaneous speech , 2000, INTERSPEECH.

[54]  Jan P. H. van Santen Timing in text-to-speech systems , 1993, EUROSPEECH.

[55]  K. D. de Jong The supraglottal articulation of prominence in English: linguistic stress as localized hyperarticulation. , 1995, The Journal of the Acoustical Society of America.

[56]  Koopmans-Van Beinum,et al.  Vowel contrast reduction : an acoustic and perceptual study of Dutch vowels in various speech conditions , 1980 .

[57]  Stefanie Shattuck-Hufnagel,et al.  Word-boundary-related duration patterns in English , 2000, J. Phonetics.