Towards Automatic Speech-Language Assessment for Aphasia Rehabilitation

Aphasia is a common neurological disorder that can severely impact a person’s communication abilities. Speech-based technology has the potential to reinforce traditional aphasia therapy through the development of automatic speech-language assessment systems. Such systems can provide clinicians with supplementary information to assist with progress monitoring and treatment planning, and can provide support for on-demand auxiliary treatment. However, current technology cannot support this type of application due to two major limitations. First, the majority of speech-language assessment techniques assume the availability of manually labeled transcripts, which are time consuming to obtain and typically not available in real-world clinical applications. Second, automatic speech recognition (ASR) traditionally has poor performance on aphasic speech, resulting in inaccurate transcripts that prevent the automation of these techniques. The focus of this dissertation is on the development of computational methods that can accurately assess aphasic speech across a range of clinically-relevant dimensions without the need for manual transcripts. The dissertation is organized into three parts: • Part I: The first part focuses on novel techniques for assessing qualitative aspects of intelligibility in constrained aphasic speech. In this problem setup, speech production occurs in controlled environments, lexical content is restricted, and the target prompt for each utterance is known. While the speech-language impairments associated with aphasia often prevent exact verbalization of the prompts, this constraint greatly simplifies ASR and allows for more accurate transcript generation. We show that transcripts for constrained aphasic speech can be generated automatically with

[1]  Oscar Saz-Torralba,et al.  Tools and Technologies for Computer-Aided Speech and Language Therapy , 2009, Speech Commun..

[2]  Shrikanth S. Narayanan,et al.  Predicting children's reading ability using evaluator-informed features , 2009, INTERSPEECH.

[3]  T. Olsen,et al.  Aphasia after Stroke: Type, Severity and Prognosis , 2003, Cerebrovascular Diseases.

[4]  L. Tan,et al.  Measuring prosodic deficits in oral discourse by speakers with fluent aphasia , 2015 .

[5]  Leora R Cherney,et al.  Communication partner training in aphasia: a systematic review. , 2010, Archives of physical medicine and rehabilitation.

[6]  Kaisheng Yao,et al.  KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Florian Metze,et al.  Extracting deep bottleneck features using stacked auto-encoders , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Jan Jantzen,et al.  The Aphasia Database on the Web: Description of a Model for Problems of Classification in Medicine. , 2000 .

[9]  Frank Rudzicz,et al.  Using text and acoustic features to diagnose progressive aphasia and its subtypes , 2013, INTERSPEECH.

[10]  Souvik Kundu,et al.  Speaker-aware training of LSTM-RNNS for acoustic modelling , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Alexander I. Rudnicky,et al.  Using Virtual Technology to Promote Functional Communication in Aphasia: Preliminary Evidence From Interactive Dialogues With Human and Virtual Clinicians. , 2015, American Journal of Speech-Language Pathology.

[12]  L. Manheim,et al.  Patient-reported changes in communication after computer-based script training for aphasia. , 2009, Archives of physical medicine and rehabilitation.

[13]  Mark J. F. Gales,et al.  Transcription of multi-genre media archives using out-of-domain data , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).

[14]  Jean-Pierre Martens,et al.  Automated Intelligibility Assessment of Pathological Speech Using Phonological Features , 2009, EURASIP J. Adv. Signal Process..

[15]  M. Albert,et al.  Manual of Aphasia and Aphasia Therapy , 2013 .

[16]  Brian MacWhinney,et al.  AphasiaBank: A Resource for Clinicians , 2012, Seminars in Speech and Language.

[17]  R H Brookshire,et al.  Presence, completeness, and accuracy of main concepts in the connected speech of non-brain-damaged adults and adults with aphasia. , 1995, Journal of speech and hearing research.

[18]  Lukás Burget,et al.  Simplification and optimization of i-vector extraction , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Viveka Lyberg Åhlander,et al.  Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders , 2009, Logopedics, phoniatrics, vocology.

[20]  Seyed Omid Sadjadi,et al.  The IBM 2016 Speaker Recognition System , 2016, Odyssey.

[21]  W. Ziegler,et al.  Telediagnostic assessment of intelligibility in dysarthria: a pilot investigation of MVP-online. , 2008, Journal of communication disorders.

[22]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[23]  J. Martens,et al.  Speech technology-based assessment of phoneme intelligibility in dysarthria. , 2009, International journal of language & communication disorders.

[24]  H. Stadthagen-González,et al.  The Bristol norms for age of acquisition, imageability, and familiarity , 2006, Behavior research methods.

[25]  Kathleen C. Fraser,et al.  Automated classification of primary progressive aphasia subtypes from narrative speech transcripts , 2014, Cortex.

[26]  Linda J. Ferrier,et al.  Dysarthric speakers' intelligibility and speech characteristics in relation to computer speech recognition , 1995 .

[27]  Donald A. Robin,et al.  Treatment guidelines for acquired apraxia of speech , 2006 .

[28]  Dong Yu,et al.  Improved Bottleneck Features Using Pretrained Deep Neural Networks , 2011, INTERSPEECH.

[29]  Liang Lu,et al.  Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[30]  M. Garrett,et al.  Lexical retrieval and its breakdown in aphasia and developmental language impairment , 2013 .

[31]  James R. Glass,et al.  Learning Lexicons From Speech Using a Pronunciation Mixture Model , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[32]  R. R. Robey A meta-analysis of clinical outcomes in the treatment of aphasia. , 1998, Journal of speech, language, and hearing research : JSLHR.

[33]  Peter Bell,et al.  Complementary tasks for context-dependent deep neural network acoustic models , 2015, INTERSPEECH.

[34]  Heidi Christensen,et al.  Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).

[35]  Rosalind C. Kaye,et al.  Computer-based script training for aphasia: emerging themes from post-treatment interviews. , 2011, Journal of communication disorders.

[36]  Heather Harris Wright,et al.  Lexical diversity for adults with and without aphasia across discourse elicitation tasks , 2011, Aphasiology.

[37]  Geoffrey E. Hinton,et al.  Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[38]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[39]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[40]  Hong Kook Kim,et al.  Dysarthric Speech Recognition Error Correction Using Weighted Finite State Transducers Based on Context-Dependent Pronunciation Variation , 2012, ICCHP.

[41]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[42]  Thomas Hain,et al.  Automatic assessment of English learner pronunciation using discriminative classifiers , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[43]  Isabel Trancoso,et al.  Automatic word naming recognition for treatment and assessment of aphasia , 2012, INTERSPEECH.

[44]  Jen-Tzung Chien,et al.  Automatic speech recognition for acoustical analysis and assessment of cantonese pathological voice and speech , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[45]  Foad Hamidi,et al.  CanSpeak: A Customizable Speech Interface for People with Dysarthric Speech , 2010, ICCHP.

[46]  Siti Salwah Salim,et al.  Exploring the influence of general and specific factors on the recognition accuracy of an ASR system for dysarthric speaker , 2015, Expert Syst. Appl..

[47]  R. Teasell,et al.  Rehabilitation of Aphasia: More Is Better , 2003, Topics in stroke rehabilitation.

[48]  Alexandre Allauzen,et al.  Using Dynamic Time Warping to Compute Prosodic Similarity Measures , 2011, INTERSPEECH.

[49]  Katarina L Haley,et al.  Toward a quantitative basis for assessment and diagnosis of apraxia of speech. , 2012, Journal of speech, language, and hearing research : JSLHR.

[50]  Heidi Christensen,et al.  Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech , 2013, INTERSPEECH.

[51]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[52]  Haipeng Wang,et al.  Analysis of auto-aligned and auto-segmented oral discourse by speakers with aphasia: A preliminary study on the acoustic parameter of duration. , 2013, Procedia, social and behavioral sciences.

[53]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[54]  George Saon,et al.  Speaker adaptation of neural network acoustic models using i-vectors , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[55]  Prisca Stenneken,et al.  Diagnosing residual aphasia using spontaneous speech analysis , 2012 .

[56]  Tomi Kinnunen,et al.  i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[57]  Dong Yu,et al.  Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[58]  Andrew W. Senior,et al.  Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[59]  Dimitra Vergyri,et al.  Learning diagnostic models using speech and language measures , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[60]  Andrew W. Senior,et al.  Fast and accurate recurrent neural network acoustic models for speech recognition , 2015, INTERSPEECH.

[61]  B. Rockstroh,et al.  Long-Term Stability of Improved Language Functions in Chronic Aphasia After Constraint-Induced Aphasia Therapy , 2005, Stroke.

[62]  Haihua Xu,et al.  Maximum F1-Score Discriminative Training Criterion for Automatic Mispronunciation Detection , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[63]  Seyed Omid Sadjadi,et al.  Speaker age estimation on conversational telephone speech using senone posterior based i-vectors , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[64]  Kenneth Ward Church,et al.  Deep neural network features and semi-supervised training for low resource speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[65]  Steve J. Young,et al.  Phone-level pronunciation scoring and assessment for interactive language learning , 2000, Speech Commun..

[66]  M. Schwartz,et al.  Semantic Factors in Verb Retrieval: An Effect of Complexity , 1998, Brain and Language.

[67]  Björn W. Schuller,et al.  The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing , 2016, IEEE Transactions on Affective Computing.

[68]  Nick Miller,et al.  Association between objective measurement of the speech intelligibility of young people with dysarthria and listener ratings of ease of understanding , 2014, International journal of speech-language pathology.

[69]  M P Black,et al.  Automatic Prediction of Children's Reading Ability for High-Level Literacy Assessment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[70]  Emily Mower Provost,et al.  Automatic analysis of speech quality for aphasia treatment , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[71]  A. Kertesz The Western Aphasia Battery , 1982 .

[72]  C M Shewan,et al.  Reliability and validity characteristics of the Western Aphasia Battery (WAB). , 1980, The Journal of speech and hearing disorders.

[73]  James Carmichael,et al.  A speech-controlled environmental control system for people with severe dysarthria. , 2007, Medical engineering & physics.

[74]  Jasha Droppo,et al.  Multi-task learning in deep neural networks for improved phoneme recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[75]  Lyndsey Nickels,et al.  Therapy for naming disorders: Revisiting, revising, and reviewing , 2002 .

[76]  John H. L. Hansen,et al.  UTD-CRSS system for the NIST 2015 language recognition i-vector machine learning challenge , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[77]  James R. Glass,et al.  Pronunciation assessment via a comparison-based system , 2013, SLaTE.

[78]  Tara N. Sainath,et al.  Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[79]  Sree Hari Krishnan Parthasarathi,et al.  Robust i-vector based adaptation of DNN acoustic model for speech recognition , 2015, INTERSPEECH.

[80]  Gary Weismer,et al.  Direct magnitude estimates of speech intelligibility in dysarthria: effects of a chosen standard. , 2002, Journal of speech, language, and hearing research : JSLHR.

[81]  Mari Ostendorf,et al.  TOBI: a standard for labeling English prosody , 1992, ICSLP.

[82]  Martha Danly,et al.  Speech prosody in Broca's aphasia , 1982, Brain and Language.

[83]  Katarina L. Haley,et al.  Temporal and spectral properties of voiceless fricatives in aphasia and apraxia of speech , 2002 .

[84]  Larry Boles,et al.  Conversational treatment in mild aphasia: A case study , 2009 .

[85]  Dong Yu,et al.  Large vocabulary continuous speech recognition with context-dependent DBN-HMMS , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[86]  Patrick Kenny,et al.  Eigenvoice modeling with sparse training data , 2005, IEEE Transactions on Speech and Audio Processing.

[87]  Stephen J. Cox,et al.  Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers , 2009, EURASIP J. Adv. Signal Process..

[88]  Antonio Bonafonte,et al.  Deep Neural Networks for i-Vector Language Identification of Short Utterances in Cars , 2016, INTERSPEECH.

[89]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[90]  Serguei V. S. Pakhomov,et al.  Computerized Analysis of Speech and Language to Identify Psycholinguistic Correlates of Frontotemporal Lobar Degeneration , 2010, Cognitive and behavioral neurology : official journal of the Society for Behavioral and Cognitive Neurology.

[91]  Carlos Gussenhoven,et al.  Durational variability in speech and the Rhythm Class Hypothesis , 2002 .

[92]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[93]  Emily Mower Provost,et al.  Improving Automatic Recognition of Aphasic Speech with AphasiaBank , 2016, INTERSPEECH.

[94]  James R. Glass,et al.  Feature-based Pronunciation Modeling for Speech Recognition , 2004, HLT-NAACL.

[95]  Peter Bell,et al.  Regularization of context-dependent deep neural networks with context-independent multi-task training , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[96]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[97]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .

[98]  R. Bastiaanse,et al.  Analysing the spontaneous speech of aphasic speakers , 2004 .

[99]  Florin Curelaru,et al.  Front-End Factor Analysis For Speaker Verification , 2018, 2018 International Conference on Communications (COMM).

[100]  Emily Mower Provost,et al.  Automatic Paraphasia Detection from Aphasic Speech: A Preliminary Study , 2017, INTERSPEECH.

[101]  Mark Hasegawa-Johnson,et al.  State-Transition Interpolation and MAP Adaptation for HMM-based Dysarthric Speech Recognition , 2010, SLPAT@NAACL.

[102]  Dong Yu,et al.  Automatic Speech Recognition: A Deep Learning Approach , 2014 .

[103]  Visar Berisha,et al.  Modeling pathological speech perception from data with similarity labels , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[104]  Mark Hasegawa-Johnson,et al.  Acoustic model adaptation using in-domain background models for dysarthric speech recognition , 2013, Comput. Speech Lang..

[105]  Margaret Forbes,et al.  AphasiaBank: Methods for studying discourse , 2011, Aphasiology.

[106]  James R. Glass,et al.  Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[107]  Daniel P. W. Ellis,et al.  Tandem connectionist feature extraction for conventional HMM systems , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[108]  Nick Miller,et al.  Prevalence and pattern of perceived intelligibility changes in Parkinson’s disease , 2007, Journal of Neurology, Neurosurgery, and Psychiatry.

[109]  Jack Gandour,et al.  Dysprosody in Broca's aphasia: A case study , 1989, Brain and Language.

[110]  James R. Glass,et al.  A Comparison-based Approach to Mispronunciation Detection by , 2012 .

[111]  Georg Heigold,et al.  Sequence discriminative distributed training of long short-term memory recurrent neural networks , 2014, INTERSPEECH.

[112]  K. Willmes,et al.  The psychometric properties of the English language version of the Aachen Aphasia Test (EAAT) , 2000 .

[113]  K. Hacioglu,et al.  TESTING SUPRASEGMENTAL ENGLISH THROUGH PARROTING , 2010 .

[114]  Emily Mower Provost,et al.  Modeling pronunciation, rhythm, and intonation for automatic assessment of speech quality in aphasia rehabilitation , 2014, INTERSPEECH.

[115]  Eric Fosler-Lussier,et al.  Articulatory feature-based pronunciation modeling , 2016, Comput. Speech Lang..

[116]  Elmar Nöth,et al.  Automatic scoring of the intelligibility in patients with cancer of the oral cavity , 2007, INTERSPEECH.

[117]  Katharine H. Odell,et al.  Perceptual characteristics of consonant production by apraxic speakers. , 1990, The Journal of speech and hearing disorders.

[118]  P. Green,et al.  STARDUST – Speech Training And Recognition for Dysarthric Users of Assistive Technology , 2003 .

[119]  A. Kertesz,et al.  The Aphasia Quotient: The Taxonomic Approach to Measurement of Aphasic Disability , 2004, Canadian Journal of Neurological Sciences / Journal Canadien des Sciences Neurologiques.

[120]  Elmar Nöth,et al.  Towards robust automatic evaluation of pathologic telephone speech , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[121]  N. Miller Measuring up to speech intelligibility. , 2013, International journal of language & communication disorders.

[122]  F. Ramus,et al.  Correlates of linguistic rhythm in the speech signal , 1999, Cognition.

[123]  Shrikanth S. Narayanan,et al.  Improvements in predicting children's overall reading ability by modeling variability in evaluators' subjective judgments , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[124]  Andreas Stolcke,et al.  SRILM at Sixteen: Update and Outlook , 2011 .

[125]  G. Mcreddie Aphasia , 1868, The Indian medical gazette.

[126]  Martina Piefke,et al.  Basic parameters of spontaneous speech as a sensitive method for measuring change during the course of aphasia. , 2008, International journal of language & communication disorders.

[127]  Emily Mower Provost,et al.  Automatic Assessment of Speech Intelligibility for Individuals With Aphasia , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[128]  Phil D. Green,et al.  Speech technology for e-inclusion of people with physical disabilities and disordered speech , 2005, INTERSPEECH.

[129]  Naveen Kumar,et al.  Automatic intelligibility classification of sentence-level pathological speech , 2015, Comput. Speech Lang..

[130]  Kun Li,et al.  Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[131]  Dong Yu,et al.  Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[132]  Frank Rudzicz,et al.  Automatic speech recognition in the diagnosis of primary progressive aphasia , 2013, SLPAT.

[133]  Leora R Cherney,et al.  Computerized script training for aphasia: preliminary results. , 2008, American journal of speech-language pathology.

[134]  Frank Rudzicz,et al.  Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech , 2011, Canadian Conference on AI.

[135]  Andrew W. Senior,et al.  Improving DNN speaker independence with I-vector inputs , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[136]  Nicolas Côté Integral and Diagnostic Intrusive Prediction of Speech Quality , 2011, T-Labs Series in Telecommunication Services.

[137]  Swathi Kiran,et al.  Effect of Verb Network Strengthening Treatment (VNeST) on lexical retrieval of content words in sentences in persons with aphasia , 2009, Aphasiology.

[138]  R. Logie,et al.  Age-of-acquisition, imagery, concreteness, familiarity, and ambiguity measures for 1,944 words , 1980 .

[139]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[140]  Isabel Trancoso,et al.  Automatic word naming recognition for an on-line aphasia treatment system , 2013, Comput. Speech Lang..

[141]  Edward Gibson,et al.  Inter-transcriber reliability for two systems of prosodic annotation: ToBI (Tones and Break Indices) and RaP (Rhythm and Pitch) , 2012 .

[142]  S. Blumstein,et al.  Production deficits in aphasia: A voice-onset time analysis , 1980, Brain and Language.

[143]  Patrick Kenny,et al.  Mixture of PLDA Models in i-vector Space for Gender-Independent Speaker Recognition , 2011, INTERSPEECH.

[144]  Brigitte Rockstroh,et al.  Intensive language training in the rehabilitation of chronic aphasia: Efficient training by laypersons , 2007, Journal of the International Neuropsychological Society.

[145]  H. Goodglass Boston diagnostic aphasia examination , 2013 .

[146]  Heidi Christensen,et al.  A comparative study of adaptive, automatic recognition of disordered speech , 2012, INTERSPEECH.

[147]  Brian Roark,et al.  Spoken Language Derived Measures for Detecting Mild Cognitive Impairment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[148]  Frank Rudzicz,et al.  Treatment intensity and childhood apraxia of speech. , 2015, International journal of language & communication disorders.

[149]  Peter F. Halpin,et al.  Online crowdsourcing for efficient rating of speech: a validation study. , 2015, Journal of communication disorders.

[150]  Shrikanth S. Narayanan,et al.  Automatic detection and classification of disfluent reading miscues in young children's speech for the purpose of assessment , 2007, INTERSPEECH.

[151]  Oscar Saz-Torralba,et al.  Verifying pronunciation accuracy from speakers with neuromuscular disorders , 2008, INTERSPEECH.

[152]  Heidi Christensen,et al.  Learning speaker-specific pronunciations of disordered speech , 2013, INTERSPEECH.

[153]  Frank Rudzicz,et al.  Phonological features in discriminative classification of dysarthric speech , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[154]  Marc Brys,et al.  Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English , 2009 .

[155]  Julie L. Wambaugh,et al.  A Critical Review of Acoustic Analyses of Aphasic and/or Apraxic Speech , 1996 .

[156]  Frank K. Soong,et al.  A Two-Pass Framework of Mispronunciation Detection and Diagnosis for Computer-Aided Pronunciation Training , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[157]  L. Murray,et al.  Functional measures of naming in aphasia: Word retrieval in confrontation naming versus connected speech , 2003 .

[158]  B Hallowell,et al.  A multinational comparison of aphasia management practices. , 2000, International journal of language & communication disorders.

[159]  R. Teasell,et al.  Intensity of Aphasia Therapy, Impact on Recovery , 2003, Stroke.

[160]  Khe Chai Sim,et al.  Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems , 2010, INTERSPEECH.

[161]  Yong Wang,et al.  Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers , 2015, Speech Commun..