Analysis and Synthesis of Speaker Age

Speaker age is an important speaker-specific quality, which was investigated in the two studies presented here. The first study automatically extracted 161 acoustic features from six words produced by 527 speakers, and used normalised mean values to compare the features. Segment duration and sound pressure level (SPL) range were identified as two important acoustic correlates of age. The second study developed a research tool for analysis of speaker age by data-driven formant synthesis and age-weigthed linear interpolation to simulate an age between the ages of any two of four female differently-aged reference speakers. Evaluation of the tool revealed that speaker age may in fact be simulated using formant synthesis. Both studies will be used in further attempts to model and simulate speaker age. (Less)

[1]  K. Hadding-Koch,et al.  Acoustico-phonetic studies in the intonation of southern Swedish , 1963 .

[2]  Chen-Chi Wang,et al.  Voice acoustic analysis of normal Taiwanese adults. , 2004, Journal of the Chinese Medical Association : JCMA.

[3]  Arthur Holmer,et al.  A parametric grammar of Seediq , 1997 .

[4]  Eva Gårding,et al.  Internal juncture in Swedish , 1970 .

[5]  Barbara Gawronska,et al.  An MT oriented model of aspect and article semantics , 1993 .

[6]  Antonis Botinis,et al.  Stress and prosodic structure in Greek : a phonological, acoustic, physiological and perceptual study , 1989 .

[7]  Karina Vamling,et al.  Complementation in Georgian , 1989 .

[8]  G. Bruce Swedish word accents in sentence perspective , 1977 .

[9]  Anita Wagner,et al.  Is voice quality language‐dependent? Acoustic analyses based on speakers of three different languages. , 2003 .

[10]  H. Traunmüller,et al.  Acoustic effects of variation in vocal effort by men, women, and children. , 2000, The Journal of the Acoustical Society of America.

[11]  Hartmut Traunmüller,et al.  Perception of speaker sex, age, and vocal effort , 1997 .

[12]  Anders Eriksson,et al.  The frequency range of the voice fundamental in the speech of male and female adults , 1993 .

[13]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[14]  Mechtild Tronnier Nasals and Nasalisation in Speech Production with Special Emphasis on Methodology and Osaka Japanese , 1998 .

[15]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[16]  Anastasia Karlsson,et al.  Rhythm and intonation in Halh Mongolian , 2005 .

[17]  Caroline Willners,et al.  Antonyms in Context : A Corpus-Based Semantic Analysis of Swedish Descriptive Adjectives , 2001 .

[18]  Gösta Bruce,et al.  Phonetics and phonology of the Swedish dialects - a project presentation and a database demonstrator , 1999 .

[19]  Eva Gårding,et al.  The Scandinavian word accents , 1977 .

[20]  Edward Carney,et al.  Hiss transitions and their perception , 1970 .

[21]  Susanne Schötz,et al.  Perception, Analysis and Synthesis of Speaker Age , 2006 .

[22]  W. Ryan,et al.  Acoustic aspects of the aging voice. , 1972, Journal of gerontology.

[23]  E. Söderpalm,et al.  Speech errors in normal and pathological speech , 1979 .

[24]  Yasuka Nagano-Madsen,et al.  Mora and Prosodic Coordination: A Phonetic Study of Japanese, Eskimo and Yoruba , 1992 .

[25]  Ingmarie Mellenius,et al.  The acquisition of nominal compounding in Swedish , 1997 .

[26]  Johan Frid,et al.  Lexical and Acoustic Modelling of Swedish Prosody , 2003 .

[27]  D. House Tonal perception in speech , 1990 .

[28]  Emilio Rivano Fischer,et al.  Topology and dynamics of interactions : with special reference to Spanish and Mapudungu , 1991 .

[29]  Ulrika Nettelbladt,et al.  Developmental studies of dysphonology in children , 1983 .

[30]  M. Olsson,et al.  Hungarian phonology and morphology , 1996 .

[31]  Elisabeth Zetterholm PhD Abstract. Voice Imitation. A phonetic study of perceptual illusions and acoustic success , 2003 .

[32]  Gisela Håkansson,et al.  Teacher Talk: How Teachers Modify Their Speech When Addressing Learners of Swedish As a Second Language , 1987 .

[33]  Petra Hansson,et al.  Prosodic Phrasing in Spontaneous Swedish , 2003 .

[34]  Hartmut Traunmüller,et al.  Evidence for demodulation in speech perception , 2000, 6th International Conference on Spoken Language Processing (ICSLP 2000).

[35]  Carl-Gustaf Söderberg,et al.  A Typological Study on the Phonetic Structure of English Words with an Instrumental-Phonetic Excursus on English Stress , 1959 .

[36]  T. D. Hanley,et al.  Vocal aging. , 1959, Geriatrics.

[37]  D. Markham Phonetic imitation, accent, and the learner , 1999 .

[38]  J. D. Amerman,et al.  Speech timing strategies in elderly adults , 1992 .

[39]  Susanne Schötz Speaker Age: A First Step From Analysis To Synthesis , 2003 .

[40]  Velta Ruke-Dravina,et al.  Mehrsprachigkeit im Vorschulalter , 1969 .

[41]  Rolf Carlson,et al.  Experiments with voice modelling in speech synthesis , 1991, Speech Commun..

[42]  Jan-Olof Svantesson,et al.  Kammu phonology and morphology , 1983 .

[43]  Christian A. Müller,et al.  Zweistufige kontextsensitive Sprecherklassifikation am Beispiel von Alter und Geschlecht , 2005 .

[44]  Anna Flyman Mattsson Teaching, Learning, and Student Output : A Study of French in the Classroom , 2003 .

[45]  Hong Gao,et al.  The Physical Foundation of the Patterning of Physical Action Verbs : A Study of Chinese Verbs , 2001 .

[46]  Christer Johansson,et al.  A View From Language: Growth of Language in Individuals and Populations , 1997 .

[47]  J. C. Liljencrants The OVE III speech synthesizer , 1968 .

[48]  P. Boersma Praat : doing phonetics by computer (version 4.4.24) , 2006 .

[49]  Susanne Schötz,et al.  Stimulus duration and type in perception of female and male speaker age , 2005, INTERSPEECH.

[50]  H. Traunmüller,et al.  Paralinguistic Variation and Invariance in the Characteristic Frequencies of Vowels , 1988, Phonetica.

[51]  P. Green,et al.  Consonant-Vowel Transitions. A Spectrographic Study , 1959 .

[52]  Steve An Xue, Dimitar Deliyski EFFECTS OF AGING ON SELECTED ACOUSTIC VOICE PARAMETERS: PRELIMINARY NORMATIVE DATA AND EDUCATIONAL IMPLICATIONS , 2001 .

[53]  Olle Engstrand,et al.  Effects of sex and age in the Arjeplog dialect: a listening test and measurements of preaspiration and VOT , 2002 .

[54]  R. Winkler,et al.  The Aging Voice: an Acoustic, Electroglottographic and Perceptive Analysis of Male and Female Voices , 2003 .