Detecting Abnormal Word Utterances in Children With Autism Spectrum Disorders

Abnormal prosody is often evident in the voice intonations of individuals with autism spectrum disorders. We compared a machine-learning-based voice analysis with human hearing judgments made by 10 speech therapists for classifying children with autism spectrum disorders (n = 30) and typical development (n = 51). Using stimuli limited to single-word utterances, machine-learning-based voice analysis was superior to speech therapist judgments. There was a significantly higher true-positive than false-negative rate for machine-learning-based voice analysis but not for speech therapists. Results are discussed in terms of some artificiality of clinician judgments based on single-word utterances, and the objectivity machine-learning-based voice analysis adds to judging abnormal prosody.

[1]  Yael Adini,et al.  Abnormal Speech Spectrum and Increased Pitch Variability in Young Autistic Children , 2011, Front. Hum. Neurosci..

[2]  Duane G. Watson,et al.  An acoustic analysis of prosody in high-functioning autism , 2009 .

[3]  L. Kanner Autistic disturbances of affective contact. , 1968, Acta paedopsychiatrica.

[4]  F. Volkmar,et al.  Speech and prosody characteristics of adolescents and adults with high-functioning autism and Asperger syndrome. , 2001, Journal of speech, language, and hearing research : JSLHR.

[5]  Raymond D. Kent Hearing and Believing , 1996 .

[6]  M. Dorahy Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-5) , 2014 .

[7]  Björn W. Schuller,et al.  The INTERSPEECH 2009 emotion challenge , 2009, INTERSPEECH.

[8]  Donna Erickson,et al.  Sounds of melody—Pitch patterns of speech in autism , 2010, Neuroscience Letters.

[9]  Tetsuya Takiguchi,et al.  Speech intonation in children with autism spectrum disorder , 2014, Brain and Development.

[10]  Kyle Sterrett,et al.  An Item Response Theory Evaluation of the Autism Diagnostic Interview-Revised (ADI-R) , 2019 .

[11]  Sue Peppé,et al.  Prosody in autism spectrum disorders: a critical review. , 2003, International journal of language & communication disorders.

[12]  Sadaoki Furui,et al.  Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..

[13]  I. Lorge,et al.  Childhood schizophrenia; symposium, 1955. V. A study of speech patterns in a group of schizophrenic children. , 1956, The American journal of orthopsychiatry.

[14]  D. Skuse Asperger's Syndrome: A Guide for Parents and Professionals , 1998 .

[15]  Kathleen Hubbard,et al.  Intonation and Emotion in Autistic Spectrum Disorders , 2007, Journal of psycholinguistic research.

[16]  A. Christophe,et al.  Newborns' Cry Melody Is Shaped by Their Native Language , 2009, Current Biology.

[17]  Katherine E Henson,et al.  Risk of Suicide After Cancer Diagnosis in England , 2018, JAMA psychiatry.

[18]  C. Woolger,et al.  Wechsler Intelligence Scale for Children-Third Edition (wisc-iii) , 2001 .

[19]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[20]  Emily Tucker Prud'hommeaux,et al.  Computational prosodic markers for autism. , 2010, Autism : the international journal of research and practice.

[21]  Helen Tager-Flusberg,et al.  On the nature of linguistic functioning in early infantile autism , 1981, Journal of autism and developmental disorders.

[22]  C. Gillberg,et al.  Asperger syndrome--some epidemiological considerations: a research note. , 1989, Journal of child psychology and psychiatry, and allied disciplines.

[23]  V. Reddy,et al.  Relations Among Detection of Syllable Stress, Speech Abnormalities, and Communicative Ability in Adults With Autism Spectrum Disorders. , 2016, Journal of speech, language, and hearing research : JSLHR.

[24]  Rhea Paul,et al.  Perception and Production of Prosody by Speakers with Autism Spectrum Disorders , 2005, Journal of autism and developmental disorders.

[25]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[26]  C. Baltaxe,et al.  Use of contrastive stress in normal, aphasic, and autistic children. , 1984, Journal of speech and hearing research.