Segment-dependent dynamics in predicting parkinson's disease

Early, accurate detection of Parkinson’s disease may aid in possible intervention and rehabilitation. Thus, simple noninvasive biomarkers are desired for determining severity. In this study, a novel set of acoustic speech biomarkers are introduced and fused with conventional features for predicting clinical assessment of Parkinson’s disease. We introduce acoustic biomarkers reflecting the segment dependence of changes in speech production components, motivated by disturbances in underlying neural motor, articulatory, and prosodic brain centers of speech. Such changes occur at phonetic and larger time scales, including multi-scale perturbations in formant frequency and pitch trajectories, in phoneme durations and their frequency of occurrence, and in temporal waveform structure. We also introduce articulatory features based on a neural computational model of speech production, the Directions into Velocities of Articulators (DIVA) model. The database used is from the Interspeech 2015 Computational Paralinguistic Challenge. By fusing conventional and novel speech features, we obtain Spearman correlations between predicted scores and clinical assessments of r = 0.63 on the training set (four-fold cross validation), r = 0.70 on a held-out development set, and r = 0.97 on a heldout test set.

[1]  K. Tjaden,et al.  Perceptual measures of speech from individuals with Parkinson's disease and multiple sclerosis: intelligibility and beyond. , 2012, Journal of speech, language, and hearing research : JSLHR.

[2]  E. Růžička,et al.  Imprecise vowel articulation as a potential early marker of Parkinson's disease: effect of speaking task. , 2013, The Journal of the Acoustical Society of America.

[3]  S G HOBERMAN Speech techniques in aphasia and parkinsonism. , 1958, Journal - Michigan State Medical Society.

[4]  J. Logemann,et al.  Frequency and cooccurrence of vocal tract dysfunctions in the speech of a large sample of Parkinson patients. , 1978, The Journal of speech and hearing disorders.

[5]  S. Skodda,et al.  Speech rate and rhythm in Parkinson's disease , 2008, Movement disorders : official journal of the Movement Disorder Society.

[6]  G. Canter SPEECH CHARACTERISTICS OF PATIENTS WITH PARKINSON'S DISEASE. 3. ARTICULATION, DIADOCHOKINESIS, AND OVER-ALL SPEECH ADEQUACY. , 1965, The Journal of speech and hearing disorders.

[7]  Björn W. Schuller,et al.  Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.

[8]  Thomas Sporer,et al.  PEAQ - The ITU Standard for Objective Measurement of Perceived Audio Quality , 2000 .

[9]  G. Canter SPEECH CHARACTERISTICS OF PATIENTS WITH PARKINSON'S DISEASE. II. PHYSIOLOGICAL SUPPORT FOR SPEECH. , 1965, The Journal of speech and hearing disorders.

[10]  Hoberman Sg,et al.  Speech techniques in aphasia and parkinsonism. , 1958 .

[11]  Thomas F. Quatieri,et al.  Prediction of cognitive performance in an animal fluency task based on rate and articulatory markers , 2014, INTERSPEECH.

[12]  Thomas F. Quatieri,et al.  Noise reduction based on spectral change , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[13]  S. Skodda Aspects of speech rate and regularity in Parkinson's disease , 2011, Journal of the Neurological Sciences.

[14]  Thomas F. Quatieri,et al.  Vocal and Facial Biomarkers of Depression based on Motor Incoordination and Timing , 2014, AVEC '14.

[15]  Thomas F. Quatieri,et al.  Articulatory dynamics and coordination in classifying cognitive change with preclinical mTBI , 2014, INTERSPEECH.

[16]  P. Wolfe,et al.  Kalman-based autoregressive moving average modeling and inference for formant and antiformant tracking a ) , 2011 .

[17]  Timothy J. Hazen,et al.  A comparison of query-by-example methods for spoken term detection , 2009, INTERSPEECH.

[18]  James R. Williamson,et al.  Seizure prediction using EEG spatiotemporal correlation structure , 2012, Epilepsy & Behavior.

[19]  J. Logemann,et al.  Vocal Tract Control in Parkinson's Disease , 1981 .

[20]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[21]  Jesús Francisco Vargas-Bonilla,et al.  New Spanish speech corpus database for the analysis of people suffering from Parkinson’s disease , 2014, LREC.

[22]  Thomas F. Quatieri,et al.  Phonologically-based biomarkers for major depressive disorder , 2011, EURASIP J. Adv. Signal Process..

[23]  Yannis Stylianou,et al.  Unsupervised Acoustic Analyses of Normal and Lombard Speech, with Spectral Envelope Transformation to Improve Intelligibility , 2012, INTERSPEECH.

[24]  G. Canter SPEECH CHARACTERISTICS OF PATIENTS WITH PARKINSON'S DISEASE: I. INTENSITY, PITCH, AND DURATION. , 1963, The Journal of speech and hearing disorders.

[25]  Thomas F. Quatieri,et al.  Vocal biomarkers of depression based on motor incoordination , 2013, AVEC@ACM Multimedia.

[26]  Satrajit S. Ghosh,et al.  Neural modeling and imaging of the cortical interactions underlying syllable production , 2006, Brain and Language.

[27]  Robert B. Dunn,et al.  Speech enhancement based on auditory spectral change , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[28]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[29]  J. Jankovic,et al.  Movement Disorder Society‐sponsored revision of the Unified Parkinson's Disease Rating Scale (MDS‐UPDRS): Scale presentation and clinimetric testing results , 2008, Movement disorders : official journal of the Movement Disorder Society.

[30]  Elmar Nöth,et al.  The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition , 2015, INTERSPEECH.