Towards Automatic Speech-Language Assessment for Aphasia Rehabilitation
暂无分享,去创建一个
[1] Oscar Saz-Torralba,et al. Tools and Technologies for Computer-Aided Speech and Language Therapy , 2009, Speech Commun..
[2] Shrikanth S. Narayanan,et al. Predicting children's reading ability using evaluator-informed features , 2009, INTERSPEECH.
[3] T. Olsen,et al. Aphasia after Stroke: Type, Severity and Prognosis , 2003, Cerebrovascular Diseases.
[4] L. Tan,et al. Measuring prosodic deficits in oral discourse by speakers with fluent aphasia , 2015 .
[5] Leora R Cherney,et al. Communication partner training in aphasia: a systematic review. , 2010, Archives of physical medicine and rehabilitation.
[6] Kaisheng Yao,et al. KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[7] Florian Metze,et al. Extracting deep bottleneck features using stacked auto-encoders , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[8] Jan Jantzen,et al. The Aphasia Database on the Web: Description of a Model for Problems of Classification in Medicine. , 2000 .
[9] Frank Rudzicz,et al. Using text and acoustic features to diagnose progressive aphasia and its subtypes , 2013, INTERSPEECH.
[10] Souvik Kundu,et al. Speaker-aware training of LSTM-RNNS for acoustic modelling , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Alexander I. Rudnicky,et al. Using Virtual Technology to Promote Functional Communication in Aphasia: Preliminary Evidence From Interactive Dialogues With Human and Virtual Clinicians. , 2015, American Journal of Speech-Language Pathology.
[12] L. Manheim,et al. Patient-reported changes in communication after computer-based script training for aphasia. , 2009, Archives of physical medicine and rehabilitation.
[13] Mark J. F. Gales,et al. Transcription of multi-genre media archives using out-of-domain data , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[14] Jean-Pierre Martens,et al. Automated Intelligibility Assessment of Pathological Speech Using Phonological Features , 2009, EURASIP J. Adv. Signal Process..
[15] M. Albert,et al. Manual of Aphasia and Aphasia Therapy , 2013 .
[16] Brian MacWhinney,et al. AphasiaBank: A Resource for Clinicians , 2012, Seminars in Speech and Language.
[17] R H Brookshire,et al. Presence, completeness, and accuracy of main concepts in the connected speech of non-brain-damaged adults and adults with aphasia. , 1995, Journal of speech and hearing research.
[18] Lukás Burget,et al. Simplification and optimization of i-vector extraction , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Viveka Lyberg Åhlander,et al. Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders , 2009, Logopedics, phoniatrics, vocology.
[20] Seyed Omid Sadjadi,et al. The IBM 2016 Speaker Recognition System , 2016, Odyssey.
[21] W. Ziegler,et al. Telediagnostic assessment of intelligibility in dysarthria: a pilot investigation of MVP-online. , 2008, Journal of communication disorders.
[22] B. MacWhinney. The CHILDES project: tools for analyzing talk , 1992 .
[23] J. Martens,et al. Speech technology-based assessment of phoneme intelligibility in dysarthria. , 2009, International journal of language & communication disorders.
[24] H. Stadthagen-González,et al. The Bristol norms for age of acquisition, imageability, and familiarity , 2006, Behavior research methods.
[25] Kathleen C. Fraser,et al. Automated classification of primary progressive aphasia subtypes from narrative speech transcripts , 2014, Cortex.
[26] Linda J. Ferrier,et al. Dysarthric speakers' intelligibility and speech characteristics in relation to computer speech recognition , 1995 .
[27] Donald A. Robin,et al. Treatment guidelines for acquired apraxia of speech , 2006 .
[28] Dong Yu,et al. Improved Bottleneck Features Using Pretrained Deep Neural Networks , 2011, INTERSPEECH.
[29] Liang Lu,et al. Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[30] M. Garrett,et al. Lexical retrieval and its breakdown in aphasia and developmental language impairment , 2013 .
[31] James R. Glass,et al. Learning Lexicons From Speech Using a Pronunciation Mixture Model , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[32] R. R. Robey. A meta-analysis of clinical outcomes in the treatment of aphasia. , 1998, Journal of speech, language, and hearing research : JSLHR.
[33] Peter Bell,et al. Complementary tasks for context-dependent deep neural network acoustic models , 2015, INTERSPEECH.
[34] Heidi Christensen,et al. Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[35] Rosalind C. Kaye,et al. Computer-based script training for aphasia: emerging themes from post-treatment interviews. , 2011, Journal of communication disorders.
[36] Heather Harris Wright,et al. Lexical diversity for adults with and without aphasia across discourse elicitation tasks , 2011, Aphasiology.
[37] Geoffrey E. Hinton,et al. Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[38] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[39] Ewan Klein,et al. Natural Language Processing with Python , 2009 .
[40] Hong Kook Kim,et al. Dysarthric Speech Recognition Error Correction Using Weighted Finite State Transducers Based on Context-Dependent Pronunciation Variation , 2012, ICCHP.
[41] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[42] Thomas Hain,et al. Automatic assessment of English learner pronunciation using discriminative classifiers , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[43] Isabel Trancoso,et al. Automatic word naming recognition for treatment and assessment of aphasia , 2012, INTERSPEECH.
[44] Jen-Tzung Chien,et al. Automatic speech recognition for acoustical analysis and assessment of cantonese pathological voice and speech , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[45] Foad Hamidi,et al. CanSpeak: A Customizable Speech Interface for People with Dysarthric Speech , 2010, ICCHP.
[46] Siti Salwah Salim,et al. Exploring the influence of general and specific factors on the recognition accuracy of an ASR system for dysarthric speaker , 2015, Expert Syst. Appl..
[47] R. Teasell,et al. Rehabilitation of Aphasia: More Is Better , 2003, Topics in stroke rehabilitation.
[48] Alexandre Allauzen,et al. Using Dynamic Time Warping to Compute Prosodic Similarity Measures , 2011, INTERSPEECH.
[49] Katarina L Haley,et al. Toward a quantitative basis for assessment and diagnosis of apraxia of speech. , 2012, Journal of speech, language, and hearing research : JSLHR.
[50] Heidi Christensen,et al. Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech , 2013, INTERSPEECH.
[51] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[52] Haipeng Wang,et al. Analysis of auto-aligned and auto-segmented oral discourse by speakers with aphasia: A preliminary study on the acoustic parameter of duration. , 2013, Procedia, social and behavioral sciences.
[53] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[54] George Saon,et al. Speaker adaptation of neural network acoustic models using i-vectors , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[55] Prisca Stenneken,et al. Diagnosing residual aphasia using spontaneous speech analysis , 2012 .
[56] Tomi Kinnunen,et al. i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[57] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[58] Andrew W. Senior,et al. Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.
[59] Dimitra Vergyri,et al. Learning diagnostic models using speech and language measures , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.
[60] Andrew W. Senior,et al. Fast and accurate recurrent neural network acoustic models for speech recognition , 2015, INTERSPEECH.
[61] B. Rockstroh,et al. Long-Term Stability of Improved Language Functions in Chronic Aphasia After Constraint-Induced Aphasia Therapy , 2005, Stroke.
[62] Haihua Xu,et al. Maximum F1-Score Discriminative Training Criterion for Automatic Mispronunciation Detection , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[63] Seyed Omid Sadjadi,et al. Speaker age estimation on conversational telephone speech using senone posterior based i-vectors , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[64] Kenneth Ward Church,et al. Deep neural network features and semi-supervised training for low resource speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[65] Steve J. Young,et al. Phone-level pronunciation scoring and assessment for interactive language learning , 2000, Speech Commun..
[66] M. Schwartz,et al. Semantic Factors in Verb Retrieval: An Effect of Complexity , 1998, Brain and Language.
[67] Björn W. Schuller,et al. The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing , 2016, IEEE Transactions on Affective Computing.
[68] Nick Miller,et al. Association between objective measurement of the speech intelligibility of young people with dysarthria and listener ratings of ease of understanding , 2014, International journal of speech-language pathology.
[69] M P Black,et al. Automatic Prediction of Children's Reading Ability for High-Level Literacy Assessment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[70] Emily Mower Provost,et al. Automatic analysis of speech quality for aphasia treatment , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[71] A. Kertesz. The Western Aphasia Battery , 1982 .
[72] C M Shewan,et al. Reliability and validity characteristics of the Western Aphasia Battery (WAB). , 1980, The Journal of speech and hearing disorders.
[73] James Carmichael,et al. A speech-controlled environmental control system for people with severe dysarthria. , 2007, Medical engineering & physics.
[74] Jasha Droppo,et al. Multi-task learning in deep neural networks for improved phoneme recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[75] Lyndsey Nickels,et al. Therapy for naming disorders: Revisiting, revising, and reviewing , 2002 .
[76] John H. L. Hansen,et al. UTD-CRSS system for the NIST 2015 language recognition i-vector machine learning challenge , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[77] James R. Glass,et al. Pronunciation assessment via a comparison-based system , 2013, SLaTE.
[78] Tara N. Sainath,et al. Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[79] Sree Hari Krishnan Parthasarathi,et al. Robust i-vector based adaptation of DNN acoustic model for speech recognition , 2015, INTERSPEECH.
[80] Gary Weismer,et al. Direct magnitude estimates of speech intelligibility in dysarthria: effects of a chosen standard. , 2002, Journal of speech, language, and hearing research : JSLHR.
[81] Mari Ostendorf,et al. TOBI: a standard for labeling English prosody , 1992, ICSLP.
[82] Martha Danly,et al. Speech prosody in Broca's aphasia , 1982, Brain and Language.
[83] Katarina L. Haley,et al. Temporal and spectral properties of voiceless fricatives in aphasia and apraxia of speech , 2002 .
[84] Larry Boles,et al. Conversational treatment in mild aphasia: A case study , 2009 .
[85] Dong Yu,et al. Large vocabulary continuous speech recognition with context-dependent DBN-HMMS , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[86] Patrick Kenny,et al. Eigenvoice modeling with sparse training data , 2005, IEEE Transactions on Speech and Audio Processing.
[87] Stephen J. Cox,et al. Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers , 2009, EURASIP J. Adv. Signal Process..
[88] Antonio Bonafonte,et al. Deep Neural Networks for i-Vector Language Identification of Short Utterances in Cars , 2016, INTERSPEECH.
[89] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.
[90] Serguei V. S. Pakhomov,et al. Computerized Analysis of Speech and Language to Identify Psycholinguistic Correlates of Frontotemporal Lobar Degeneration , 2010, Cognitive and behavioral neurology : official journal of the Society for Behavioral and Cognitive Neurology.
[91] Carlos Gussenhoven,et al. Durational variability in speech and the Rhythm Class Hypothesis , 2002 .
[92] Michael I. Jordan,et al. On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.
[93] Emily Mower Provost,et al. Improving Automatic Recognition of Aphasic Speech with AphasiaBank , 2016, INTERSPEECH.
[94] James R. Glass,et al. Feature-based Pronunciation Modeling for Speech Recognition , 2004, HLT-NAACL.
[95] Peter Bell,et al. Regularization of context-dependent deep neural networks with context-independent multi-task training , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[96] Fuhui Long,et al. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[97] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .
[98] R. Bastiaanse,et al. Analysing the spontaneous speech of aphasic speakers , 2004 .
[99] Florin Curelaru,et al. Front-End Factor Analysis For Speaker Verification , 2018, 2018 International Conference on Communications (COMM).
[100] Emily Mower Provost,et al. Automatic Paraphasia Detection from Aphasic Speech: A Preliminary Study , 2017, INTERSPEECH.
[101] Mark Hasegawa-Johnson,et al. State-Transition Interpolation and MAP Adaptation for HMM-based Dysarthric Speech Recognition , 2010, SLPAT@NAACL.
[102] Dong Yu,et al. Automatic Speech Recognition: A Deep Learning Approach , 2014 .
[103] Visar Berisha,et al. Modeling pathological speech perception from data with similarity labels , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[104] Mark Hasegawa-Johnson,et al. Acoustic model adaptation using in-domain background models for dysarthric speech recognition , 2013, Comput. Speech Lang..
[105] Margaret Forbes,et al. AphasiaBank: Methods for studying discourse , 2011, Aphasiology.
[106] James R. Glass,et al. Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[107] Daniel P. W. Ellis,et al. Tandem connectionist feature extraction for conventional HMM systems , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[108] Nick Miller,et al. Prevalence and pattern of perceived intelligibility changes in Parkinson’s disease , 2007, Journal of Neurology, Neurosurgery, and Psychiatry.
[109] Jack Gandour,et al. Dysprosody in Broca's aphasia: A case study , 1989, Brain and Language.
[110] James R. Glass,et al. A Comparison-based Approach to Mispronunciation Detection by , 2012 .
[111] Georg Heigold,et al. Sequence discriminative distributed training of long short-term memory recurrent neural networks , 2014, INTERSPEECH.
[112] K. Willmes,et al. The psychometric properties of the English language version of the Aachen Aphasia Test (EAAT) , 2000 .
[113] K. Hacioglu,et al. TESTING SUPRASEGMENTAL ENGLISH THROUGH PARROTING , 2010 .
[114] Emily Mower Provost,et al. Modeling pronunciation, rhythm, and intonation for automatic assessment of speech quality in aphasia rehabilitation , 2014, INTERSPEECH.
[115] Eric Fosler-Lussier,et al. Articulatory feature-based pronunciation modeling , 2016, Comput. Speech Lang..
[116] Elmar Nöth,et al. Automatic scoring of the intelligibility in patients with cancer of the oral cavity , 2007, INTERSPEECH.
[117] Katharine H. Odell,et al. Perceptual characteristics of consonant production by apraxic speakers. , 1990, The Journal of speech and hearing disorders.
[118] P. Green,et al. STARDUST – Speech Training And Recognition for Dysarthric Users of Assistive Technology , 2003 .
[119] A. Kertesz,et al. The Aphasia Quotient: The Taxonomic Approach to Measurement of Aphasic Disability , 2004, Canadian Journal of Neurological Sciences / Journal Canadien des Sciences Neurologiques.
[120] Elmar Nöth,et al. Towards robust automatic evaluation of pathologic telephone speech , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).
[121] N. Miller. Measuring up to speech intelligibility. , 2013, International journal of language & communication disorders.
[122] F. Ramus,et al. Correlates of linguistic rhythm in the speech signal , 1999, Cognition.
[123] Shrikanth S. Narayanan,et al. Improvements in predicting children's overall reading ability by modeling variability in evaluators' subjective judgments , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[124] Andreas Stolcke,et al. SRILM at Sixteen: Update and Outlook , 2011 .
[125] G. Mcreddie. Aphasia , 1868, The Indian medical gazette.
[126] Martina Piefke,et al. Basic parameters of spontaneous speech as a sensitive method for measuring change during the course of aphasia. , 2008, International journal of language & communication disorders.
[127] Emily Mower Provost,et al. Automatic Assessment of Speech Intelligibility for Individuals With Aphasia , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[128] Phil D. Green,et al. Speech technology for e-inclusion of people with physical disabilities and disordered speech , 2005, INTERSPEECH.
[129] Naveen Kumar,et al. Automatic intelligibility classification of sentence-level pathological speech , 2015, Comput. Speech Lang..
[130] Kun Li,et al. Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[131] Dong Yu,et al. Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[132] Frank Rudzicz,et al. Automatic speech recognition in the diagnosis of primary progressive aphasia , 2013, SLPAT.
[133] Leora R Cherney,et al. Computerized script training for aphasia: preliminary results. , 2008, American journal of speech-language pathology.
[134] Frank Rudzicz,et al. Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech , 2011, Canadian Conference on AI.
[135] Andrew W. Senior,et al. Improving DNN speaker independence with I-vector inputs , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[136] Nicolas Côté. Integral and Diagnostic Intrusive Prediction of Speech Quality , 2011, T-Labs Series in Telecommunication Services.
[137] Swathi Kiran,et al. Effect of Verb Network Strengthening Treatment (VNeST) on lexical retrieval of content words in sentences in persons with aphasia , 2009, Aphasiology.
[138] R. Logie,et al. Age-of-acquisition, imagery, concreteness, familiarity, and ambiguity measures for 1,944 words , 1980 .
[139] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[140] Isabel Trancoso,et al. Automatic word naming recognition for an on-line aphasia treatment system , 2013, Comput. Speech Lang..
[141] Edward Gibson,et al. Inter-transcriber reliability for two systems of prosodic annotation: ToBI (Tones and Break Indices) and RaP (Rhythm and Pitch) , 2012 .
[142] S. Blumstein,et al. Production deficits in aphasia: A voice-onset time analysis , 1980, Brain and Language.
[143] Patrick Kenny,et al. Mixture of PLDA Models in i-vector Space for Gender-Independent Speaker Recognition , 2011, INTERSPEECH.
[144] Brigitte Rockstroh,et al. Intensive language training in the rehabilitation of chronic aphasia: Efficient training by laypersons , 2007, Journal of the International Neuropsychological Society.
[145] H. Goodglass. Boston diagnostic aphasia examination , 2013 .
[146] Heidi Christensen,et al. A comparative study of adaptive, automatic recognition of disordered speech , 2012, INTERSPEECH.
[147] Brian Roark,et al. Spoken Language Derived Measures for Detecting Mild Cognitive Impairment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[148] Frank Rudzicz,et al. Treatment intensity and childhood apraxia of speech. , 2015, International journal of language & communication disorders.
[149] Peter F. Halpin,et al. Online crowdsourcing for efficient rating of speech: a validation study. , 2015, Journal of communication disorders.
[150] Shrikanth S. Narayanan,et al. Automatic detection and classification of disfluent reading miscues in young children's speech for the purpose of assessment , 2007, INTERSPEECH.
[151] Oscar Saz-Torralba,et al. Verifying pronunciation accuracy from speakers with neuromuscular disorders , 2008, INTERSPEECH.
[152] Heidi Christensen,et al. Learning speaker-specific pronunciations of disordered speech , 2013, INTERSPEECH.
[153] Frank Rudzicz,et al. Phonological features in discriminative classification of dysarthric speech , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[154] Marc Brys,et al. Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English , 2009 .
[155] Julie L. Wambaugh,et al. A Critical Review of Acoustic Analyses of Aphasic and/or Apraxic Speech , 1996 .
[156] Frank K. Soong,et al. A Two-Pass Framework of Mispronunciation Detection and Diagnosis for Computer-Aided Pronunciation Training , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[157] L. Murray,et al. Functional measures of naming in aphasia: Word retrieval in confrontation naming versus connected speech , 2003 .
[158] B Hallowell,et al. A multinational comparison of aphasia management practices. , 2000, International journal of language & communication disorders.
[159] R. Teasell,et al. Intensity of Aphasia Therapy, Impact on Recovery , 2003, Stroke.
[160] Khe Chai Sim,et al. Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems , 2010, INTERSPEECH.
[161] Yong Wang,et al. Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers , 2015, Speech Commun..