A Tool for Training Speech Imitation Accuracy

Dysarthria is a neurological motor speech disorder that commonly results in reduced intelligibility. Communication partners can learn to better understand the speech of someone with dysarthria through perceptual training. Vocal imitation of the degraded speech during perceptual training has been shown to elevate this learning. A tool that provides the learner with real-time feedback regarding the accuracy of their imitation attempts during training may further enhance this learning. We describe a training tool that compares dysarthric speech productions with the imitation attempts of healthy subjects, using a two-level dynamic warp that accounts for both spectral and temporal degradation. Feature vectors derived from both the spectrogram and LPC are examined.

[1]  Stephanie A Borrie,et al.  Effects of Lexical and Somatosensory Feedback on Long-Term Improvements in Intelligibility of Dysarthric Speech. , 2017, Journal of speech, language, and hearing research : JSLHR.

[2]  Stephanie A Borrie,et al.  Familiarisation conditions and the mechanisms that underlie improved recognition of dysarthric speech , 2012, Language and cognitive processes.

[3]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[4]  Stephanie A Borrie,et al.  The Role of Somatosensory Information in Speech Perception: Imitation Improves Recognition of Disordered Speech. , 2015, Journal of speech, language, and hearing research : JSLHR.

[5]  Stephanie A. Borrie,et al.  The role of linguistic and indexical information in improved recognition of dysarthric speech. , 2011, The Journal of the Acoustical Society of America.

[6]  J. W. Black,et al.  Multiple-choice intelligibility tests. , 1957, The Journal of speech and hearing disorders.

[7]  Kari Pulli,et al.  Style translation for human motion , 2005, SIGGRAPH 2005.

[8]  Stephanie A Borrie,et al.  Use of Crowdsourcing to Assess the Ecological Validity of Perceptual-Training Paradigms in Dysarthria. , 2016, American journal of speech-language pathology.

[9]  Laurent Itti,et al.  shapeDTW: Shape Dynamic Time Warping , 2016, Pattern Recognit..

[10]  Georgios Evangelidis,et al.  Continuous Action Recognition Based on Sequence Alignment , 2014, International Journal of Computer Vision.