Systematic review of virtual speech therapists for speech disorders

A systematic review on virtual speech therapists (VSTs) was presented.The comparison of VSTs and traditional speech therapy was discussed.We analyzed intervention methods used in VSTs such as articulation therapy.Hearing impairments was observed as the most frequent disorder targeted by VSTs.We explored 3D virtual heads and games as efficient therapy delivery approaches. In this paper, a systematic review of relevant published studies on computer-based speech therapy systems or virtual speech therapists (VSTs) for people with speech disorders is presented. We structured this work based on the PRISMA framework. The advancements in speech technology and the increased number of successful real-world projects in this area point to a thriving market for VSTs in the near future; however, there is no standard roadmap to pinpoint how these systems should be designed, implemented, customized, and evaluated with respect to the various speech disorders. The focus of this systematic review is on articulation and phonological impairments. This systematic review addresses three research questions: what types of articulation and phonological disorders do VSTs address, how effective are virtual speech therapists, and what technological elements have been utilized in VST projects. The reviewed papers were sourced from comprehensive digital libraries, and were published in English between 2004 and 2014. All the selected studies involve computer-based intervention in the form of a VST regarding articulation or phonological impairments, followed by qualitative and/or quantitative assessments. To generate this review, we encountered several challenges. Studies were heterogeneous in terms of disorders, type and frequency of therapy, sample size, level of functionality, etc. Thus, overall conclusions were difficult to draw. Commonly, publications with rigorous study designs did not describe the technical elements used in their VST, and publications that did describe technical elements had poor study designs. Despite this heterogeneity, the selected studies reported the effectiveness of computers as a more engaging type of intervention with more tools to enrich the intervention programs, particularly when it comes to children; however, it was emphasized that virtual therapists should not drive the intervention but must be used as a medium to deliver the intervention planned by speech-language pathologists. Based on the reviewed papers, VSTs are significantly effective in training people with a variety of speech disorders; however, it cannot be claimed that a consensus exists in the superiority of VSTs over speech-language pathologists regarding rehabilitation outcomes. Our review shows that hearing-impaired cases were the most frequently addressed disorder in the reviewed studies. Automatic speech recognition, speech corpus, and speech synthesizers were the most popular technologies used in the VSTs.

[1]  Gail T. Gillon,et al.  Phonological Awareness: From Research to Practice , 2003 .

[2]  Joy F. Rosenberg,et al.  Discrimination training of phonemic contrasts enhances phonological processing in mainstream school children , 2005, Brain and Language.

[3]  Michelle L Byrnes,et al.  Acceptance and Commitment Therapy for adults who stutter: psychosocial adjustment and speech fluency. , 2012, Journal of fluency disorders.

[4]  Yvonne Wren,et al.  A comparison between computer and tabletop delivery of phonology therapy , 2008, International journal of speech-language pathology.

[5]  Diane Austin,et al.  The percentage of consonants correct (PCC) metric: extensions and reliability data. , 1997, Journal of speech, language, and hearing research : JSLHR.

[6]  Eliane Segers,et al.  Computer-supported phonological awareness intervention for kindergarten children with specific language impairment. , 2004, Language, speech, and hearing services in schools.

[7]  M. Wild,et al.  Using computer‐aided instruction to support the systematic practice of phonological skills in beginning readers , 2009 .

[8]  Stefan Gheorghe Pentiuc,et al.  Improving Computer Based Speech Therapy Using a Fuzzy Expert System , 2010, Comput. Informatics.

[9]  Isabel Trancoso,et al.  Automatic word naming recognition for an on-line aphasia treatment system , 2013, Comput. Speech Lang..

[10]  S. Rodger,et al.  Comparative content review of children's participation measures using the International Classification of Functioning, Disability and Health-Children and Youth. , 2014, Archives of physical medicine and rehabilitation.

[11]  Sascha Fagel,et al.  A 3-d virtual head as a tool for speech therapy for children , 2008, INTERSPEECH.

[12]  Uta Frith,et al.  Phonological Assessment Battery (Manual and Test Materials) , 1997 .

[13]  Jennifer George,et al.  Developing multimedia interfaces for speech therapy , 2010, Universal Access in the Information Society.

[14]  Oscar Saz-Torralba,et al.  Tools and Technologies for Computer-Aided Speech and Language Therapy , 2009, Speech Commun..

[15]  Anita R. Bowles,et al.  Technologies for foreign language learning: a review of technology types and their effectiveness , 2014 .

[16]  S. Mcleod,et al.  A systematic review of the association between childhood speech impairment and participation across the lifespan , 2009 .

[17]  J Bamford,et al.  The BKB (Bamford-Kowal-Bench) sentence lists for partially-hearing children. , 1979, British journal of audiology.

[18]  Tonya N. Davis,et al.  Using iPods(®) and iPads(®) in teaching programs for individuals with developmental disabilities: a systematic review. , 2013, Research in developmental disabilities.

[19]  Curt D. Furberg,et al.  Approaches to data analyses of clinical trials. , 2012, Progress in cardiovascular diseases.

[20]  Ratchadawan Nimnual,et al.  Interactive multimedia courseware of vowel training for the hearing impaired , 2010, ICCAS 2010.

[21]  Ramanjot Kang,et al.  A Sensitive Period for the Impact of Hearing Loss on Auditory Perception , 2014, The Journal of Neuroscience.

[22]  Björn Granström,et al.  Visualization of speech and audio for hearing-impaired persons , 2008 .

[23]  Yi-Ping Phoebe Chen,et al.  Acoustic feature selection for automatic emotion recognition from speech , 2009, Inf. Process. Manag..

[24]  Maria Cecília Bevilacqua,et al.  Software use in the (re)habilitation of hearing impaired children. , 2012, Jornal da Sociedade Brasileira de Fonoaudiologia.

[25]  M. S. Hawley,et al.  A Voice-Input Voice-Output Communication Aid for People With Severe Speech Impairment , 2013, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[26]  M. Ferguson,et al.  Efficacy of Individual Computer-Based Auditory Training for People with Hearing Loss: A Systematic Review of the Evidence , 2013, PloS one.

[27]  Birgit May-Mederake Early intervention and assessment of speech and language development in young children with cochlear implants. , 2012, International journal of pediatric otorhinolaryngology.

[28]  Olle Bälter,et al.  Design Recommendations for a Computer-Based Speech Training System Based on End User Interviews , 2005 .

[29]  Yi-Ping Phoebe Chen,et al.  Model-based approach to spatial-temporal sampling of video clips for video object detection by classification , 2014, J. Vis. Commun. Image Represent..

[30]  A. R. Tyler Expert Systems Research Trends , 2007 .

[31]  Rafael Prikladnicki,et al.  Speech Recognition for Voice-Based Machine Translation , 2014, IEEE Software.

[32]  Yvonne E Wren,et al.  Computer-Based Interventions , 2010 .

[33]  B. Berg Qualitative Research Methods for the Social Sciences , 1989 .

[34]  Jianping Fan,et al.  Incorporating feature hierarchy and boosting to achieve more effective classifier training and concept-oriented video summarization and skimming , 2008, TOMCCAP.

[35]  Yi-Ping Phoebe Chen,et al.  A scalable and extensible segment-event-object-based sports video retrieval system , 2008, TOMCCAP.

[36]  Amanda Hall,et al.  Early hearing loss and language abilities in children with Down syndrome. , 2014, International journal of language & communication disorders.

[37]  Joanna Light,et al.  Using visible speech to train perception and production of speech for individuals with hearing loss. , 2004, Journal of speech, language, and hearing research : JSLHR.

[38]  Siti Salwah Salim,et al.  A systematic review of strategies and computer-based intervention (CBI) for reading comprehension of children with autism , 2013 .

[39]  Ronald A. Cole,et al.  My science tutor: A conversational multimedia virtual tutor for elementary school science , 2011, TSLP.

[40]  Paula C. Stacey,et al.  Effectiveness of computer-based auditory training for adult users of cochlear implants , 2010, International journal of audiology.

[41]  Sarel van Vuuren,et al.  A virtual speech therapist for individuals with parkinson's disease , 2006 .

[42]  Helene Lidström,et al.  Benefits of the use of ICT in school activities by students with motor, speech, visual, and hearing impairment: A literature review , 2014, Scandinavian journal of occupational therapy.

[43]  C. Thompson,et al.  Sentactics®: Computer-automated treatment of underlying forms , 2010, Aphasiology.

[44]  N. Bhattacharyya,et al.  The prevalence of voice problems among adults in the United States , 2014, The Laryngoscope.

[45]  D. Moher,et al.  CONSORT 2010 Statement: updated guidelines for reporting parallel group randomised trials , 2010, Journal of clinical epidemiology.

[46]  Olle Bälter,et al.  Designing the user interface of the computer-based speech training system ARTUR based on early user tests , 2006, Behav. Inf. Technol..

[47]  C. Geula,et al.  Asymmetry and heterogeneity of Alzheimer's and frontotemporal pathology in primary progressive aphasia. , 2014, Brain : a journal of neurology.

[48]  Sarel van Vuuren,et al.  A Virtual Therapist for Speech and Language Therapy , 2014, IVA.

[49]  J. Ioannidis,et al.  The PRISMA Statement for Reporting Systematic Reviews and Meta-Analyses of Studies That Evaluate Health Care Interventions: Explanation and Elaboration , 2009, Annals of Internal Medicine [serial online].

[50]  Sharynne McLeod,et al.  "When he's around his brothers … he's not so quiet": the private and public worlds of school-aged children with speech sound disorder. , 2013, Journal of communication disorders.

[51]  Pekka Leskinen,et al.  Combining biogas LCA reviews with stakeholder interviews to analyse life cycle impacts at a practical level , 2014 .