Bayesian changepoint detection for the automatic assessment of fluency and articulatory disorders

The accurate changepoint detection of different signal segments is a frequent challenge in a wide range of applications. With regard to speech utterances, the changepoints are related to significant spectral changes, mostly represented by the borders between two phonemes. The main aim of this study is to design a novel Bayesian autoregressive changepoint detector (BACD) and test its feasibility in the evaluation of fluency and articulatory disorders. The originality of the proposed method consists in its normalizing of a posteriori probability using Bayesian evidence and designing a recursive algorithm for reliable practice. For further evaluation of the BACD, we used data from (a) 118 people with various severity of stuttering to assess the extent of speech disfluency using a short reading passage, and (b) 24 patients with early Parkinson's disease and 22 healthy speakers for evaluation of articulation accuracy using fast syllable repetition. Subsequently, we designed two measures for each type of disorder. While speech disfluency has been related to greater distances between spectral changes, inaccurate dysarthric articulation has instead been associated with lower spectral changes. These findings have been confirmed by statistically significant differences, which were achieved in separating several degrees of disfluency and distinguishing healthy from parkinsonian speakers. In addition, a significant correlation was found between the automatic assessment of speech fluency and the judgment of human experts. In conclusion, the method proposed provides a cost-effective, easily applicable and freely available evaluation of speech disorders, as well as other areas requiring reliable techniques for changepoint detection. In a more modest scope, BACD may be used in diagnosis of disease severity, monitoring treatment, and support for therapist evaluation.

[1]  Marek Wisniewski,et al.  Automatic Detection of Disorders in a Continuous Speech with the Hidden Markov Models Approach , 2008, Computer Recognition Systems 2.

[2]  Ulrich Appel,et al.  Adaptive sequential segmentation of piecewise stationary time series , 1983, Inf. Sci..

[3]  Chung-Hsien Wu,et al.  Automatic assessment of articulation disorders using confident unit-based model adaptation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Sazali Yaacob,et al.  Classification of Speech Dysfluencies Using LPC Based Parameterization Techniques , 2012, Journal of Medical Systems.

[5]  Neha Singh,et al.  Advances in the treatment of Parkinson's disease , 2007, Progress in Neurobiology.

[6]  P. Bergl Accuracy of Divergence Measures for Detection of Abrupt Changes , 2008 .

[7]  Edward G. Conture,et al.  Stuttering: Its Nature, Diagnosis and Treatment , 2000 .

[8]  G. Riley A stuttering severity instrument for children and adults. , 1972, The Journal of speech and hearing disorders.

[9]  Elmar Nöth,et al.  PEAKS - A system for the automatic evaluation of voice and speech disorders , 2009, Speech Commun..

[10]  Jean-Marc Boite,et al.  SPEAKER TRACKING IN BROADCAST AUDIO MATERIAL IN THE FRAMEWORK OF THE THISL PROJECT , 1999 .

[11]  Elizabeth C. Botha,et al.  An acoustic distance measure for automatic cross-language phoneme mapping , 2001 .

[12]  Hervé Bourlard,et al.  Robust speaker change detection , 2004, IEEE Signal Processing Letters.

[13]  E. Růžička,et al.  Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson's disease. , 2011, The Journal of the Acoustical Society of America.

[14]  Meysam Asgari,et al.  Predicting severity of Parkinson's disease from speech , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[15]  Miguel Angel Ferrer-Ballester,et al.  Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  E. Yairi,et al.  Early childhood stuttering I: persistency and recovery rates. , 1999, Journal of speech, language, and hearing research : JSLHR.

[17]  Robert Lund,et al.  A Review and Comparison of Changepoint Detection Techniques for Climate Data , 2007 .

[18]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[19]  Roman Cmejla,et al.  Acoustic assessment of voice and speech disorders in Parkinson's disease through quick vocal test , 2011, Movement disorders : official journal of the Movement Disorder Society.

[20]  K OrJ Numerical Bayesian methods applied to signal processing , 1996 .

[21]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[22]  J. van Borsel,et al.  Delayed auditory feedback in the treatment of stuttering: clients as consumers. , 2003, International journal of language & communication disorders.

[23]  Jonas Beskow,et al.  Wavesurfer - an open source speech tool , 2000, INTERSPEECH.

[24]  Pavel Sovka,et al.  Estimation of Boundaries between Speech Units Using Bayesian Changepoint Detectors , 2001, TSD.

[25]  P. Chu,et al.  Bayesian Change-Point Analysis of Tropical Cyclone Activity: The Central North Pacific Case* , 2004 .

[26]  Jean-Pierre Martens,et al.  Objective intelligibility assessment of pathological speakers , 2008, INTERSPEECH.

[27]  R. Ruben Redefining the Survival of the Fittest: Communication Disorders in the 21st Century , 2000, The Laryngoscope.

[28]  Oktay Üreten,et al.  Bayesian detection of radio transmitter turn-on transients , 1999, NSIP.

[29]  Pavel Sovka,et al.  Recursive Bayesian autoregressive changepoint detector for sequential signal segmentation , 2004, 2004 12th European Signal Processing Conference.

[30]  J. Dartigues,et al.  Prevalence of parkinsonism and Parkinson's disease in Europe: the EUROPARKINSON Collaborative Study. European Community Concerted Action on the Epidemiology of Parkinson's disease. , 1997, Journal of neurology, neurosurgery, and psychiatry.

[31]  Elmar Nöth,et al.  Automatic stuttering recognition using hidden Markov models , 2000, INTERSPEECH.

[32]  A. Goberman,et al.  Characteristics of speech disfluency in Parkinson disease , 2010, Journal of Neurolinguistics.

[33]  Michèle Basseville,et al.  Detection of abrupt changes: theory and application , 1993 .

[34]  Max A. Little,et al.  Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson's disease symptom severity , 2011, Journal of The Royal Society Interface.

[35]  A. Aronson,et al.  Clusters of deviant speech dimensions in the dysarthrias. , 1969, Journal of speech and hearing research.

[36]  Elmar Nöth,et al.  Automatic intelligibility assessment of speakers after laryngeal cancer by means of acoustic modeling. , 2012, Journal of voice : official journal of the Voice Foundation.

[37]  Heung Wong,et al.  Change-point analysis of hydrological time series using grey relational method , 2006 .

[38]  Raymond D. Kent,et al.  Parametric quantitative acoustic analysis of conversation produced by speakers with dysarthria and healthy speakers. , 2006, Journal of speech, language, and hearing research : JSLHR.

[39]  Etienne Barnard,et al.  Phone clustering using the Bhattacharyya distance , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[40]  P. Snyder,et al.  Acoustic characteristics of Parkinsonian speech: a potential biomarker of early disease progression and treatment , 2004, Journal of Neurolinguistics.

[41]  A. Prochazka,et al.  Wavelet transform use for feature extraction and EEG signal segments classification , 2008, 2008 3rd International Symposium on Communications, Control and Signal Processing.

[42]  Adam P Vogel,et al.  Measures of spectral change and their application to habitual, slow, and clear speaking modes , 2011, International journal of speech-language pathology.

[43]  A. Aronson,et al.  Differential diagnostic patterns of dysarthria. , 1969, Journal of speech and hearing research.

[44]  Raymond D. Kent,et al.  Toward phonetic intelligibility testing in dysarthria. , 1989, The Journal of speech and hearing disorders.

[45]  Douglas M. Hawkins,et al.  Statistical Process Control for Shifts in Mean or Variance Using a Changepoint Formulation , 2005, Technometrics.

[46]  P. Dejonckere,et al.  A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques , 2001, European Archives of Oto-Rhino-Laryngology.

[47]  Penny Jennett,et al.  The effect of information technology on the physician workforce and health care in isolated communities: The Canadian picture , 1999, Journal of telemedicine and telecare.

[48]  O. Hornykiewicz Biochemical aspects of Parkinson's disease , 1998, Neurology.

[49]  Raymond D. Kent,et al.  Acoustic studies of dysarthric speech: methods, progress, and potential. , 1999, Journal of communication disorders.

[50]  Hermann Ackermann,et al.  The Temporal Control of Repetitive Articulatory Movements in Parkinson's Disease , 1997, Brain and Language.

[51]  I. Shafran,et al.  Extracting cues from speech for predicting severity of Parkinson'S disease , 2010, 2010 IEEE International Workshop on Machine Learning for Signal Processing.

[52]  J. Scargle,et al.  Change-point detection in astronomical data by using a hierarchical model and a bayesian sampling approach , 2005, IEEE/SP 13th Workshop on Statistical Signal Processing, 2005.

[53]  L. Hartelius,et al.  Speech and swallowing symptoms associated with Parkinson's disease and multiple sclerosis: a survey. , 1994, Folia phoniatrica et logopaedica : official organ of the International Association of Logopedics and Phoniatrics.

[54]  Ann Packman,et al.  The Lidcombe Behavioral Data Language of stuttering. , 2003, Journal of speech, language, and hearing research : JSLHR.

[55]  Jennifer L. Spielman,et al.  Formant centralization ratio: a proposal for a new acoustic measure of dysarthric speech. , 2010, Journal of speech, language, and hearing research : JSLHR.

[56]  R. Cmejla,et al.  Improved detection of boundaries of phonemes in speech databases , 2007 .

[57]  B. Western,et al.  A Bayesian Change Point Model for Historical Time Series Analysis , 2004, Political Analysis.

[58]  M. Hirano,et al.  Clinical Examination of Voice , 1981 .

[59]  Pedro Gómez Vilda,et al.  Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors , 2004, IEEE Transactions on Biomedical Engineering.

[60]  C. Newman,et al.  The Voice Handicap Index (VHI)Development and Validation , 1997 .