Vowel Normalization. A Perceptual acoustic study of Dutch Vowels

In sociolinguistics, language variation in vowel sounds is typically studied using phonetic transcription. Phonetic transcription is carried out by expert listeners, who are capable of perceptually separating (socio-) linguistic variation from anatomical/physiological speaker-related characteristics. However, phonetic transcription is a very laborious task and its reliability is questionable. In phonetics, vowel normalization procedures were designed to separate linguistic variation from anatomical/physiological variation in acoustic measurements. In this thesis, it is evaluated whether vowel normalization procedures are suitable for use in sociolinguistics, for studying language variation in vowels. It is described how 12 procedures for vowel normalization were compared. This comparison consists of two parts. First, the procedures were evaluated in the acoustic domain by applying them to acoustic (formant and fundamental frequency) measurements of a database of vowels pronounced by 160 Dutch teachers from the Netherlands and Flanders. The procedures were evaluated on how well they preserved (socio-) linguistic variation and minimized anatomical/physiological variation. Second, the normalization procedures were compared in a perceptual-acoustic comparison; a comparison with perceptual judgments made by expert listeners. The normalization procedures were applied to the acoustic measurements of vowel tokens from 20 of the 160 Dutch teachers. These tokens were also judged by expert listeners on their perceived (socio-) linguistic characteristics, i.e., tongue height, tongue position, and lip rounding in a listening experiment. Next, the transformed acoustic measurements were compared to the perceptual judgments using linear regression analysis, to evaluate how well the output of the normalization procedures modeled the perceived (socio-) linguistic characteristics of the vowel tokens. These two comparisons of the 12 normalization procedures show that procedures that incorparate information (e.g., the mean or standard deviation of the formant measurements) about other vowels produced by a speaker are most succesful in the acoustic evaluation as well as in the perceptual-acoustic comparison

[1]  P. Ladefoged,et al.  Generating vocal tract shapes from formant frequencies. , 1978, The Journal of the Acoustical Society of America.

[2]  Hartmut R. Pfitzinger,et al.  An IPA vowel diagram approach to analysing L1 effects on vowel production and perception , 2002, INTERSPEECH.

[3]  T. M. Nearey Phonetic feature systems for vowels , 1978 .

[4]  L. Lisker,et al.  Auditory and Visual Cueing of the [±Rounded] Feature Of Vowels , 1992, Language and speech.

[5]  Hynek Hermansky,et al.  Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain , 1985, Speech Commun..

[6]  Vowel labelling consistency as a measure of familiarity with the phonetic code of a language or dialect , 1989 .

[7]  J. Mullennix,et al.  Some effects of talker variability on spoken word recognition. , 1989, The Journal of the Acoustical Society of America.

[8]  H. Wakita Normalization of vowels by vocal-tract length and its application to vowel identification , 1977 .

[9]  D. Klatt,et al.  Phonetic Ability and Related Anatomy of the Newborn and Adult Human, Neanderthal Man, and the Chimpanzee , 1972 .

[10]  B. Lobanov Classification of Russian Vowels Spoken by Different Speakers , 1971 .

[11]  Keith Johnson,et al.  Contrast and normalization in vowel perception , 1990 .

[12]  D. Broadbent,et al.  Vowel judgements and adaptation level , 1960, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[13]  Gordon E. Peterson,et al.  The Representation of Vowels and Their Movements , 1948 .

[14]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[15]  J. C. Steinberg,et al.  Toward the Specification of Speech , 1950 .

[16]  Rebecca E. Eilers,et al.  Phonetic Expectation and Transcription Validity , 1975, Phonetica.

[17]  R.W.N.M. van Hout,et al.  De uitspraak van het Standaard-Nederlands: variatie en varianten in Vlaanderen en Nederland , 1999 .

[18]  O. Aaltonen The effect of relative amplitude levels of F2 and F3 on the categorization of synthetic vowels , 1985 .

[19]  Terrance M. Nearey,et al.  Evaluation of a strategy for automatic formant tracking , 2002 .

[20]  H. Fujisaki,et al.  The roles of pitch and higher formants in the perception of vowels , 1968 .

[21]  R.A.M.G. van Bezooijen,et al.  Het Standaard Nederlands: een verkennend empirisch onderzoek , 1995 .

[22]  R. Plomp,et al.  Frequency analysis of Dutch vowels from 50 male speakers. , 1973, The Journal of the Acoustical Society of America.

[23]  T. M. Nearey Static, dynamic, and relational properties in vowel perception. , 1989, The Journal of the Acoustical Society of America.

[24]  E. Zwicker,et al.  Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency , 1980 .

[25]  C. Watson,et al.  Formant-frequency discrimination for isolated English vowels. , 1994, The Journal of the Acoustical Society of America.

[26]  D. Weenink CATEGORY ART : A VARIATION ON ADAPTIVE RESONANCE THEORY NEURAL NETWORKS , 1997 .

[27]  Pierre Delattre,et al.  Some Factors of Vowel Duration and Their Cross‐Linguistic Validity , 1962 .

[28]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[29]  Jennifer Coates,et al.  Some problems in the sociolinguistic explanation of sex differences , 1985 .

[30]  Perceptual Tolerances of Isolated Dutch Vowels , 1963 .

[31]  Roeland van Hout,et al.  A comparison between human vowel normalization strategies and acoustic vowel transformation techniques , 2001, INTERSPEECH.

[32]  H. S. Gopal,et al.  A perceptual model of vowel recognition based on the auditory representation of American English vowels. , 1986, The Journal of the Acoustical Society of America.

[33]  E. Fischer‐Jorgensen,et al.  Perceptual Dimensions of Vowels , 1968 .

[34]  D. Broadbent,et al.  Information Conveyed by Vowels , 1957 .

[35]  Dennis H. Klatt,et al.  Prediction of perceived phonetic distance from critical-band spectra: A first step , 1982, ICASSP.

[36]  Terrance M. Nearey Applications of generalized linear modeling to vowel data , 1992, ICSLP.

[37]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[38]  B. Repp Phonetic trading relations and context effects: new experimental evidence for a speech mode of perception. , 1982, Psychological bulletin.

[39]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.

[40]  H. Traunmüller Perceptual dimension of openness in vowels. , 1981, The Journal of the Acoustical Society of America.

[41]  Louis C. W. Pols,et al.  Modelling speaker normalization by adapting the BIAS in a neural net , 1993, EUROSPEECH.

[42]  W. Labov Principles of Linguistic Change: Internal Factors , 1994 .

[43]  Ann K. Syrdal,et al.  Aspects of a model of the auditory representation of american english vowels , 1985, Speech Commun..

[44]  B. Lindblom,et al.  Acoustical consequences of lip, tongue, jaw, and larynx movement. , 1970, The Journal of the Acoustical Society of America.

[45]  Vincent J. van Heuven,et al.  Uitspraak van Nederlandse klinkers in Noordelijk Standaardnederlands en in Zuid-Limburg: een akoestische en perceptieve studie , 1999 .

[46]  A. Maynard Engebretson,et al.  Vowel normalization: Differences between vowels spoken by children, women, and men , 1980 .

[47]  O. Fujimura,et al.  On the Second Spectral Peak of Front Vowels: a Perceptual Study of the Role of the Second and Third Formants , 1967, Language and speech.

[48]  P. Lieberman,et al.  Fundamental frequency and vowel perception. , 1982, The Journal of the Acoustical Society of America.

[49]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[50]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[51]  J. Laver,et al.  Variability in Vowel Perception , 1965 .

[52]  W. Labov The intersection of sex and social class in the course of linguistic change , 1990, Language Variation and Change.

[53]  H. P. Jørgensen,et al.  Die gespannten und ungespannten Vokale in der norddeutschen Hochsprache mit einer spezifischen Untersuchung der Struktur ihrer Formantenfrequenzen , 1969 .