Automatic Assessment of Language Ability in Children with and without Typical Development

This study describes a fully automated method of expressive language assessment based on vocal responses of children to a sentence repetition task (SRT), a language test that taps into core language skills. Our proposed method automatically transcribes the vocal responses using a test-specific automatic speech recognition system. From the transcriptions, a regression model predicts the gold standard test scores provided by speech-language pathologists. Our preliminary experimental results on audio recordings of 104 children (43 with typical development and 61 with a neurodevelopmental disorder) verifies the feasibility of the proposed automatic method for predicting gold standard scores on this language test, with averaged mean absolute error of 6.52 (on a observed score range from 0 to 90 with a mean value of 49.56) between observed and predicted ratings.Clinical relevance—We describe the use of fully automatic voice-based scoring in language assessment including the clinical impact this development may have on the field of speech-language pathology. The automated test also creates a technological foundation for the computerization of a broad array of tests for voice-based language assessment.

[1]  Sanjeev Khudanpur,et al.  Investigation of transfer learning for ASR using LF-MMI trained neural networks , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).

[2]  Mark Davies The Corpus of Contemporary American English (COCA) , 2012 .

[3]  A. Mihailidis,et al.  Difficulties in Automatic Speech Recognition of Dysarthric Speakers and Implications for Speech-Based Applications Used by the Elderly: A Literature Review , 2010, Assistive technology : the official journal of RESNA.

[4]  David R. Dowty,et al.  Natural Language Parsing: Psychological, Computational, and Theoretical Perspectives , 1985 .

[5]  Victor H. Yngve,et al.  A model and an hypothesis for language structure , 1960 .

[6]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[7]  M. Brysbaert,et al.  Age-of-acquisition ratings for 30,000 English words , 2012, Behavior research methods.

[8]  Gail T. Gillon,et al.  Computer-Administrated Versus Paper-Based Assessment of School-Entry Phonological Awareness Ability , 2011 .

[9]  J. Tomblin,et al.  Prevalence of specific language impairment in kindergarten children. , 1997, Journal of speech, language, and hearing research : JSLHR.

[10]  Sanjeev Khudanpur,et al.  Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  О. В. Смурова «Эпистемическая оценка» как частно-оценочный концепт в английской языковой картине мира (на материале the Corpus of Contemporary American English) , 2013 .

[12]  Marc Brysbaert,et al.  Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English , 2009, Behavior research methods.

[13]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[14]  Morag Stuart,et al.  Children's printed word database: continuities and changes over time in children's early reading vocabulary. , 2010, British journal of psychology.

[15]  Joost van Doremalen,et al.  Optimizing Automatic Speech Recognition for Low-Proficient Non-Native Speakers , 2010, EURASIP J. Audio Speech Music. Process..

[16]  A. Sayadian,et al.  Causal Multi Quantile Noise Spectrum Estimation for Speech Enhancement , 2008, 2008 Australasian Telecommunication Networks and Applications Conference.

[17]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[18]  Robert Gale,et al.  Improving ASR Systems for Children with Autism and Language Impairment Using Domain-Focused DNN Transfer Techniques , 2019, INTERSPEECH.

[19]  Meliss Holland,et al.  The Path of Speech Technologies in Computer-Assisted Language Learning , 2007 .

[20]  Alfred Mertins,et al.  Automatic speech recognition and speech variability: A review , 2007, Speech Commun..