Automatic analysis of speech quality for aphasia treatment

Aphasia is a common language disorder which can severely affect an individual's ability to communicate with others. Aphasia rehabilitation requires intensive practice accompanied by appropriate feedback, the latter of which is difficult to satisfy outside of therapy. In this paper we take a first step towards developing an intelligent system capable of providing feedback to patients with aphasia through the automation of two typical therapeutic exercises, sentence building and picture description. We describe the natural speech corpus collected from our interaction with clients in the University of Michigan Aphasia Program (UMAP). We develop classifiers to automatically estimate speech quality based on human perceptual judgment. Our automatic prediction yields accuracies comparable to the average human evaluator. Our feature selection process gives insights into the factors that influence human evaluation. The results presented in this work provide support for the feasibility of this type of system.

[1]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[2]  Dimitra Vergyri,et al.  Learning diagnostic models using speech and language measures , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[3]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[4]  A. Kertesz,et al.  The Aphasia Quotient: The Taxonomic Approach to Measurement of Aphasic Disability , 2004, Canadian Journal of Neurological Sciences / Journal Canadien des Sciences Neurologiques.

[5]  Robert Teasell,et al.  Therapeutic Interventions for Aphasia Initiated More than Six Months Post Stroke: A Review of the Evidence , 2012, Topics in stroke rehabilitation.

[6]  Richard C Katz Computers in the treatment of chronic aphasia. , 2010, Seminars in speech and language.

[7]  M P Black,et al.  Automatic Prediction of Children's Reading Ability for High-Level Literacy Assessment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Walter Huber,et al.  Supervised home training of dialogue skills in chronic aphasia: a randomized parallel group study. , 2011, Journal of speech, language, and hearing research : JSLHR.

[9]  Hao Wang,et al.  Predicting gradation of L2 English mispronunciations using crowdsourced ratings and phonological rules , 2013, SLaTE.

[10]  R. Teasell,et al.  Intensity of Aphasia Therapy, Impact on Recovery , 2003, Stroke.

[11]  Frank Rudzicz,et al.  Using text and acoustic features to diagnose progressive aphasia and its subtypes , 2013, INTERSPEECH.

[12]  Brian Roark,et al.  Spoken Language Derived Measures for Detecting Mild Cognitive Impairment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[15]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Isabel Trancoso,et al.  Automatic word naming recognition for treatment and assessment of aphasia , 2012, INTERSPEECH.

[17]  Serguei V. S. Pakhomov,et al.  Computerized Analysis of Speech and Language to Identify Psycholinguistic Correlates of Frontotemporal Lobar Degeneration , 2010, Cognitive and behavioral neurology : official journal of the Society for Behavioral and Cognitive Neurology.

[18]  Luís C. Oliveira,et al.  Jitter Estimation Algorithms for Detection of Pathological Voices , 2009, EURASIP J. Adv. Signal Process..