Spectro-temporal description of dipthongs in F1-F2-F3 space

Abstract The prevailing approach to the acoustic-phonetic description oescription of the diphthong is based (1) on the two lowest, vocal-tract resonance (or formant) frequencies ( F 1 and F 2 ) considered either individually or jointly in the F 1 − F 2 plane, and (2) on a very sparse representation of the temporal course of these frequencies. While this time-honoured approach has been particularly useful for characterising the initial and final vowels of the diphthong, there appears to be very little progress beyond the F 2 − F 2 plane, as a parametric framework for elucidating the dynamic nature of the vowel-to-vowel transition. By contrast, a more accurately spectro-temporal description of a subset of the Australian English diphthongs ( ) is obtained in this work by considering a detailed, temporal representation of the three lowest formant-frequencies ( F 1 , F 2 and F 3 ). In particular, certain nonlinearity features of the densely-sampled contour of the F 3 are highlighted, which appear to have hitherto been either unknown or considered inconsequential to the specification of the diphthong. This finding is shown to contribute a new, three-dimensional ( F 1 − F 2 − F 3 ) perspective on the acoustic characteristics of the vocalic transition of the diphthong.

[1]  Chin-Hui Lee Speech Science and Technology. , 1993 .

[2]  W. Koenig,et al.  The Sound Spectrograph , 1946 .

[3]  Z. S. Bond,et al.  Diphthong Dynamics: A Cross-Linguistic Perceptual Analysis of Temporal Patterns in Dutch, English, and German , 1993 .

[4]  H Wakita,et al.  Piecewise--planar representation of vowel formant frequencies. , 1977, The Journal of the Acoustical Society of America.

[5]  Raymond D. Kent Some Considerations in the Cinefluorographic Analysis of Tongue Movements during Speech , 1972, Phonetica.

[6]  G. E. Peterson,et al.  Transitions, Glides, and Diphthongs , 1961 .

[7]  H. Wakita,et al.  Articulatory constraints on vocal tract area functions and their acoustic implications , 1982 .

[8]  Zinny S. Bond Experiments with synthetic diphthongs , 1982 .

[9]  J. C. Steinberg,et al.  Toward the Specification of Speech , 1950 .

[10]  Frantz Clermont Formant-contour models of diphthongs : a study in acoustic phonetics and computer modelling of speech , 1991 .

[11]  Sunil Kumar Jha Acoustic analysis of the Maithili diphthongs , 1985 .

[12]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[13]  M. Mrayati,et al.  Vowel–vowel trajectories and region modeling , 1991 .

[14]  B. Yegnanarayana Formant extraction from linear‐prediction phase spectra , 1978 .

[15]  A. M. B. D. Manrique,et al.  Acoustic Analysis of the Spanish Diphthongs , 1979 .

[16]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[17]  G. Fairbanks,et al.  Diphthong formants and their movements. , 1962, Journal of speech and hearing research.

[18]  Fredericka Bell‐Berti,et al.  Some Acoustic and Physiological Observations on Diphthongs , 1982 .

[19]  Frantz Clermont,et al.  A methodology for modeling vowel formant contours in CVC context , 1987 .

[20]  Raymond D. Kent,et al.  Tongue body articulation during vowel and diphthong gestures. , 1972, Folia phoniatrica.

[21]  André Rigault,et al.  Perception of Segmented Diphthongs , 1972 .

[22]  Ann K. Syrdal,et al.  Aspects of a model of the auditory representation of american english vowels , 1985, Speech Commun..

[23]  H. S. Gopal,et al.  A perceptual model of vowel recognition based on the auditory representation of American English vowels. , 1986, The Journal of the Acoustical Society of America.

[24]  J. Bernard,et al.  Toward the Acoustic Specification of Australian English , 1970 .

[25]  H M Sussman Acoustic correlates of the front/back vowel distinction: a comparison of transition onset versus "steady state". , 1990, The Journal of the Acoustical Society of America.

[26]  Bayya Yegnanarayana,et al.  A distance measure based on the derivative of linear prediction phase spectrum , 1979, ICASSP.

[27]  R. Plomp,et al.  Perceptual and physical space of vowel sounds. , 1969, The Journal of the Acoustical Society of America.

[28]  Z. Bond The Effects of Varying Glide Durations On Diphthong Identification , 1978, Language and speech.

[29]  Gordon E. Peterson,et al.  The Representation of Vowels and Their Movements , 1948 .