A review of the acoustic and linguistic properties of children's speech

In this paper, we review the acoustic and linguistic properties of children's speech for both read and spontaneous speech. First, the effect of developmental changes on the absolute values and variability of acoustic correlates is presented for read speech for children ages 6 and up. Then, verbal child-machine spontaneous interaction is reviewed and results from recent studies are presented. Age trends of acoustic, linguistic and interaction parameters are discussed, such as sentence duration, filled pauses, politeness and frustration markers, and modality usage. Some differences between child-machine and human-human interaction are pointed out. The implications for acoustic modeling, linguistic modeling and spoken dialogue systems design for children are discussed.

[1]  Bryan L. Pellom,et al.  Children's speech recognition with application to interactive books and tutors , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[2]  P Tallal,et al.  Anticipatory coarticulation in the speech of adults and young children: acoustic, perceptual, and video data. , 1991, Journal of speech and hearing research.

[3]  Raymond D. Kent,et al.  Speech segment durations in sentence recitations by children and adults , 1980 .

[4]  Sharon L. Oviatt,et al.  Multimodal integration patterns in children , 2002, INTERSPEECH.

[5]  Shrikanth S. Narayanan,et al.  Acoustics of children's speech: developmental changes of temporal and spectral parameters. , 1999, The Journal of the Acoustical Society of America.

[6]  Justine Cassell,et al.  Making Space for Voice: Technologies to Support Children’s Fantasy and Storytelling , 2001, Personal and Ubiquitous Computing.

[7]  Harry Levin,et al.  Hesitation Phenomena in Children's Speech , 1965 .

[8]  Raymond D. Kent,et al.  Anatomical and neuromuscular maturation of the speech mechanism: evidence from acoustic studies. , 1976, Journal of speech and hearing research.

[9]  F. Frome,et al.  Talking back to big bird: Preschool users and a simple speech recognition system , 1993 .

[10]  W F Katz,et al.  Duration and fundamental frequency correlates of phrase boundaries in productions by children and adults. , 1996, The Journal of the Acoustical Society of America.

[11]  Martin J. Russell,et al.  Applications of automatic speech recognition to speech and language development in young children , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[12]  Jack Mostow,et al.  Demonstration of a reading coach that listens , 1995, UIST '95.

[13]  Elmar Nöth,et al.  “You Stupid Tin Box” - Children Interacting with the AIBO Robot: A Cross-linguistic Emotional Speech Corpus , 2004, LREC.

[14]  Joakim Gustafson,et al.  Children's convergence in referring expressions to graphical objects in a speech-enabled computer game , 2007, INTERSPEECH.

[15]  Mark A. Fanty,et al.  Rapid unsupervised adaptation to children's speech on a connected-digit task , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[16]  Fabio Brugnara,et al.  Acoustic variability and automatic recognition of children's speech , 2007, Speech Commun..

[17]  Ursula Gisela Goldstein,et al.  An articulatory model for the vocal tracts of growing children , 1980 .

[18]  Shrikanth S. Narayanan,et al.  Spoken dialog systems for children , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[19]  Shrikanth S. Narayanan,et al.  Robust recognition of children's speech , 2003, IEEE Trans. Speech Audio Process..

[20]  Shrikanth S. Narayanan,et al.  Analyzing Children's Speech: An Acoustic Study of Consonants and Consonant-Vowel Transition , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[21]  Sandra P. Whiteside,et al.  Estimating child and adolescent formant frequency values from adult data , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[22]  Shrikanth S. Narayanan,et al.  Analysis of children's speech: duration, pitch and formants , 1997, EUROSPEECH.

[23]  M. Eskénazi KIDS: A database of children’s speech , 1996 .

[24]  Shrikanth S. Narayanan,et al.  Politeness and frustration language in child-machine interactions , 2001, INTERSPEECH.

[25]  Mattias Heldner,et al.  The Swedish NICE corpus - spoken dialogues between children and embodied characters in a computer game scenario , 2005, INTERSPEECH.

[26]  Shrikanth S. Narayanan,et al.  Creating conversational interfaces for children , 2002, IEEE Trans. Speech Audio Process..

[27]  I. Hirsh,et al.  Development of speech sounds in children. , 1969, Acta oto-laryngologica. Supplementum.

[28]  Daniel Elenius,et al.  The PF_STAR children's speech corpus , 2005, INTERSPEECH.