Intrinsic normalization and extrinsic denormalization of formant data of vowels

Using a known speaker-intrinsic normalization procedure, formant data are scaled by the reciprocal of the geometric mean of the first three formant frequencies. This reduces the influence of the talker but results in a distorted vowel space. The proposed speaker-extrinsic procedure re-scales the normalized values by the mean formant values of vowels. When tested on the formant data of vowels published by Peterson and Barney, the combined approach leads to well separated clusters by reducing the spread due to talkers. The proposed procedure performs better than two top-ranked normalization procedures based on the accuracy of vowel classification as the objective measure.

[1]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[2]  A. Maynard Engebretson,et al.  Vowel normalization: Differences between vowels spoken by children, women, and men , 1980 .

[3]  Gail A. Carpenter,et al.  Neural Network and Nearest Neighbor Comparison of Speaker Normalization Methods for Vowel Recognition , 1993 .

[4]  Tyler Kendall,et al.  More on Vowels: Plotting and Normalization , 2010 .

[5]  Harvey M. Sussman,et al.  A neuronal model of vowel normalization and representation , 1986, Brain and Language.

[6]  Nancy Niedzielski,et al.  The Effect of Social Information on the Perception of Sociolinguistic Variables , 1999 .

[7]  J. D. Miller,et al.  Auditory-perceptual interpretation of the vowel. , 1989, The Journal of the Acoustical Society of America.

[8]  S. F. Disner Evaluation of vowel normalization procedures. , 1980, The Journal of the Acoustical Society of America.

[9]  Anne Fabricius,et al.  Variation and change in the trap and strut vowels of RP: a real time comparison of five acoustic data sets , 2007, Journal of the International Phonetic Association.

[10]  Elizabeth A. Strand,et al.  Auditory–visual integration of talker gender in vowel perception , 1999 .

[11]  Patricia Martine Adank,et al.  Vowel Normalization. A Perceptual acoustic study of Dutch Vowels , 2003 .

[12]  B. Lobanov Classification of Russian Vowels Spoken by Different Speakers , 1971 .

[13]  Raymond L. Watrous Current status of Peterson-Barney vowel formant data. , 1991, The Journal of the Acoustical Society of America.

[14]  Matthias J. Sjerps,et al.  Speaker Normalization in Speech Perception , 2008, The Handbook of Speech Perception.

[15]  Gunnar Fant,et al.  Speech sounds and features , 1973 .

[16]  H. S. Gopal,et al.  A perceptual model of vowel recognition based on the auditory representation of American English vowels. , 1986, The Journal of the Acoustical Society of America.

[17]  Roel Smits,et al.  A comparison of vowel normalization procedures for language variation research. , 2004, The Journal of the Acoustical Society of America.

[18]  Paul Foulkes,et al.  Comparing Vowel Formant Normalization Methods , 2011, ICPhS.

[19]  Anne Fabricius,et al.  A comparison of three speaker-intrinsic vowel formant frequency normalization algorithms for sociophonetics , 2009, Language Variation and Change.