Amplitude variations in coarticulated vowels.

This paper seeks to characterize the nature, size, and range of acoustic amplitude variation in naturally produced coarticulated vowels in order to determine its potential contribution and relevance to vowel perception. The study is a partial replication and extension of the pioneering work by House and Fairbanks [J. Acoust. Soc. Am. 22, 105-113 (1953)], who reported large variation in vowel amplitude as a function of consonantal context. Eight American English vowels spoken by men and women were recorded in ten symmetrical CVC consonantal contexts. Acoustic amplitude measures included overall rms amplitude, amplitude of the rms peak along with its relative location in the CVC-word, and the amplitudes of individual formants F1-F4 along with their frequencies. House and Fairbanks' amplitude results were not replicated: Neither the overall rms nor the rms peak varied appreciably as a function of consonantal context. However, consonantal context was shown to affect significantly and systematically the amplitudes of individual formants at the vowel nucleus. These effects persisted in the auditory representation of the vowel signal. Auditory spectra showed that the pattern of spectral amplitude variation as a function of contextual effects may still be encoded and represented at early stages of processing by the peripheral auditory system.

[1]  Eric W Healy,et al.  Measuring the critical band for speech. , 2006, The Journal of the Acoustical Society of America.

[2]  R. Miller Auditory Tests with Synthetic Vowels , 1951 .

[3]  B. Granström,et al.  Music and Hearing Quarterly Progress and Status Report Some studies concerning perception of isolated vowels , 2007 .

[4]  P. Mermelstein,et al.  On the relationship between vowel and consonant identification when cued by the same acoustic information , 1978, Perception & psychophysics.

[5]  Gunnar Fant,et al.  The voice source in connected speech , 1997, Speech Commun..

[6]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[7]  Jean-Luc Schwartz,et al.  Does the human auditory system include large scale spectral integration , 1987 .

[8]  A. House,et al.  The Influence of Consonant Environment upon the Secondary Acoustical Characteristics of Vowels , 1953 .

[9]  Ewa Jacewicz,et al.  Vowel Duration in Three American English Dialects. , 2007, American speech.

[10]  D. Fry Duration and Intensity as Physical Correlates of Linguistic Stress , 1954 .

[11]  J. Flanagan,et al.  Difference limen for formant amplitude. , 1957, The Journal of speech and hearing disorders.

[12]  Peter F. Assmann,et al.  Identification of children's and adults' vowels: intrinsic fundamental frequency, fundamental frequency dynamics, and presence of voicing , 2001, J. Phonetics.

[13]  G. E. Peterson,et al.  Duration of Syllable Nuclei in English , 1960 .

[14]  H. V. van Praag,et al.  The effect of milenperone on the aggressive behavior of psychogeriatric patients. A double-blind placebo-controlled study. , 1985, Neuropsychobiology.

[15]  T. M. Nearey,et al.  Effects of consonant environment on vowel formant patterns. , 1997, The Journal of the Acoustical Society of America.

[16]  C. Gobl,et al.  Contextual Variation of the Vowel Voice Source as a Function of Adjacent Consonants , 1993, Language and speech.

[17]  O. Aaltonen The effect of relative amplitude levels of F2 and F3 on the categorization of synthetic vowels , 1985 .

[18]  D. Whalen,et al.  The universality of intrinsic F0 of vowels , 1995 .

[19]  J B Millar,et al.  The Effect of Relative Formant Amplitude on the Perceived Identity of Synthetic Vowels , 1972, Language and speech.

[20]  R. van Hout,et al.  An acoustic description of the vowels of northern and southern standard Dutch II: regional varieties. , 2007, The Journal of the Acoustical Society of America.

[21]  W. A. Ainsworth,et al.  Duration as a Cue in the Recognition of Synthetic Vowels , 1972 .

[22]  Johan Liljencrants,et al.  The Source-Filter Frame of Prominence , 2000, Phonetica.

[23]  Agaath M. C. Sluijter,et al.  Spectral balance as an acoustic correlate of linguistic stress. , 1996, The Journal of the Acoustical Society of America.

[24]  M. Yano,et al.  On the effectiveness of whole spectral shape for vowel perception. , 2001, The Journal of the Acoustical Society of America.

[25]  Ashok Krishnamurthy,et al.  A perceptual auditory spectral centroid model , 1998 .

[26]  Qiguang Lin,et al.  Glottal source‐vocal tract acoustic interaction , 1987 .

[27]  Michael Kiefte,et al.  The relative importance of spectral tilt in monophthongs and diphthongs. , 2005, The Journal of the Acoustical Society of America.

[28]  Patrice Speeter Beddor,et al.  Perception of Temporal and Spectral Information in French Vowels , 1988, Language and speech.

[29]  D H Whalen,et al.  Vowel and consonant judgments are not independent when cued by the same information. , 1987, Perception & psychophysics.

[30]  Robert Allen Fox,et al.  VOWEL SPACE AREAS ACROSS DIALECTS AND GENDER , 2007 .

[31]  Johan Liljencrants,et al.  Formant‐Amplitude Measurements , 1963 .

[32]  W. Ainsworth Duration as a factor in the recognition of synthetic vowels , 1981 .

[33]  T. M. Nearey Static, dynamic, and relational properties in vowel perception. , 1989, The Journal of the Acoustical Society of America.

[34]  Ilse Lehiste,et al.  Vowel Amplitude and Phonemic Stress in American English , 1959 .

[35]  Johan Liljencrants,et al.  Acoustic-phonetic Analysis of Prominence in Swedish , 2000 .

[36]  E. Jacewicz Listener sensitivity to variations in the relative amplitude of vowel formants , 2005 .