Detection of Total Syllables and Canonical Syllables in Infant Vocalizations

During the first two years of life, human infants produce increasing numbers of speech-like (canonical) syllables. Both basic research on child speech development and clinical work assessing a child’s pre-speech capabilities stand to benefit from efficient, accurate, and consistent methods for counting the syllables present in a given infant utterance. To date, there have been only a few attempts to perform syllable counting in infant vocalizations automatically, and thorough comparisons to human listener counts are lacking. We apply four existing, openly available systems for detecting syllabic, consonant, or vowel elements in vocalizations and apply them to a set of infant utterances individually and in combination. With the automated methods, we obtain canonical syllable counts that correlate well enough with trained human listener counts to replicate the pattern of increasing canonical syllable frequency as infants get older. However, agreement between the automated methods and human listener canonical syllable counts is considerably weaker than human listeners’ agreement with each other. On the other hand, automatic identification of syllable-like units of any type (canonical and non-canonical both included) match human listeners’ judgments quite well. Interestingly, these total syllable counts also increase with infant age.

[1]  J. Locke,et al.  Linguistic significance of babbling: evidence from a tracheostomized infant , 1990, Journal of Child Language.

[2]  D K Oller,et al.  Speech-like vocalizations in infancy: an evaluation of potential risk factors , 1994, Journal of Child Language.

[3]  Kenneth N Stevens,et al.  Toward a model for lexical access based on acoustic landmarks and distinctive features. , 2002, The Journal of the Acoustical Society of America.

[4]  Rachel E. Stark STAGES OF SPEECH DEVELOPMENT IN THE FIRST YEAR OF LIFE 1 1This work was supported by NINCDS Grant NS.09628 and by NICHD Grant HD. 11970. , 1980 .

[5]  C. Stoel-Gammon,et al.  Babbling development of hearing-impaired and normally hearing subjects. , 1986, The Journal of speech and hearing disorders.

[6]  Heather L. Ramsdell,et al.  A weighted reliability measure for phonetic transcription. , 2006, Journal of speech, language, and hearing research : JSLHR.

[7]  L. Roug,et al.  Phonetic development in early infancy: a study of four Swedish children during the first eighteen months of life , 1989, Journal of Child Language.

[8]  Suneeti Nathani Iyer,et al.  Prelinguistic Vocalizations in Infants and Toddlers with Hearing Loss , 2010 .

[9]  Linda J. Ferrier,et al.  Vocalization age as a clinical tool , 2002, INTERSPEECH.

[10]  Rachel E. Stark,et al.  Speech development in a child after decannulation: Further evidence that babbling facilitates later speech development , 1993 .

[11]  Susan L. Denham An auditory model for the detection of perceptual onsets and beat tracking in singing , 2007 .

[12]  Nivja H. Jong,et al.  Praat script to detect syllable nuclei and measure speech rate automatically , 2009, Behavior research methods.

[13]  D K Oller,et al.  Automated vocal analysis of naturalistic recordings from children with autism, language delay, and typical development , 2010, Proceedings of the National Academy of Sciences.

[14]  D K Oller,et al.  The role of audition in infant babbling. , 1988, Child development.

[15]  Elena Patten,et al.  Vocal Patterns in Infants with Autism Spectrum Disorder: Canonical Babbling Status and Vocalization Frequency , 2014, Journal of Autism and Developmental Disorders.

[16]  Paula Menyuk,et al.  Predicting Phonological Development , 1986 .

[17]  Dongxin Xu,et al.  Automated analysis of child phonetic production using naturalistic recordings. , 2014, Journal of speech, language, and hearing research : JSLHR.

[18]  Harriet J. Fell,et al.  SpeechMark: Landmark Detection Tool for Speech Analysis , 2012, INTERSPEECH.

[19]  A. Warlaumont,et al.  Learning to Produce Syllabic Speech Sounds via Reward-Modulated Neural Plasticity , 2016, PloS one.

[20]  Sue L. Denham,et al.  Robust sound classification through the representation of similarity using response fields derived from stimuli during early experience , 2005, Biological Cybernetics.

[21]  Barbara L. Davis,et al.  Early Vocal Patterns in Infants with Varied Hearing Levels , 2005 .

[22]  D. Oller THE EMERGENCE OF THE SOUNDS OF SPEECH IN INFANCY , 1980 .

[23]  Harriet J. Fell,et al.  Automated tools for identifying syllabic landmark clusters that reflect changes in articulation , 2011, MAVEBA.

[24]  Florien J. Koopmans-van Beinum,et al.  Early Stages in the Development of Speech Movements , 1986 .

[25]  D K Oller,et al.  Development of precursors to speech in infants exposed to two languages , 1997, Journal of Child Language.

[26]  Sue L. Denham,et al.  Model cortical responses for the detection of perceptual onsets and beat tracking in singing , 2009, Connect. Sci..

[27]  Suneeti Nathani Iyer,et al.  Prelinguistic Vocal Development in Infants with Typical Hearing and Infants with Severe-to-Profound Hearing Loss. , 2008, The Volta review.

[28]  D. Kimbrough Oller,et al.  AN ACOUSTIC PHONETIC CATALOG , 2013 .

[29]  Harriet J. Fell,et al.  Vocalization analysis tools , 2005, MAVEBA.