Telediagnosis of Parkinson’s Disease Using Measurements of Dysphonia

Parkinson’s disease (PD) is a neurological illness which impairs motor skills, speech, and other functions such as mood, behavior, thinking, and sensation. It causes vocal impairment for approximately 90% of the patients. As the symptoms of PD occur gradually and mostly targeting the elderly people for whom physical visits to the clinic are inconvenient and costly, telemonitoring of the disease using measurements of dysphonia (vocal features) has a vital role in its early diagnosis. Such dysphonia features extracted from the voice come in variety and most of them are interrelated. The purpose of this study is twofold: (1) to select a minimal subset of features with maximal joint relevance to the PD-score, a binary score indicating whether or not the sample belongs to a person with PD; and (2) to build a predictive model with minimal bias (i.e. to maximize the generalization of the predictions so as to perform well with unseen test examples). For these tasks, we apply the mutual information measure with the permutation test for assessing the relevance and the statistical significance of the relations between the features and the PD-score, rank the features according to the maximum-relevance-minimum-redundancy (mRMR) criterion, use a Support Vector Machine (SVM) for building a classification model and test it with a more suitable cross-validation scheme that we called leave-one-individual-out that fits with the dataset in hand better than the conventional bootstrapping or leave-one-out validation methods.

[1]  C. Brayne,et al.  A systematic review of depression and mental illness preceding Parkinson's disease , 2006, Acta neurologica Scandinavica.

[2]  Penny Jennett,et al.  The effect of information technology on the physician workforce and health care in isolated communities: The Canadian picture , 1999, Journal of telemedicine and telecare.

[3]  Miguel Figueroa,et al.  Competitive learning with floating-gate circuits , 2002, IEEE Trans. Neural Networks.

[4]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[5]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[6]  Chris H. Q. Ding,et al.  Minimum redundancy feature selection from microarray gene expression data , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[7]  Max A. Little,et al.  Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection , 2007 .

[8]  Markus Neuhäuser,et al.  Permutation Tests , 2011, International Encyclopedia of Statistical Science.

[9]  Regina Y. Liu Bootstrap Procedures under some Non-I.I.D. Models , 1988 .

[10]  Chong-Ho Choi,et al.  Input Feature Selection by Mutual Information Based on Parzen Window , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Francisco Azuaje,et al.  Genomic data sampling and its effect on classification performance assessment , 2003, BMC Bioinformatics.

[12]  Changbao Wu,et al.  Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis , 1986 .

[13]  Annette M. Molinaro,et al.  Prediction error estimation: a comparison of resampling methods , 2005, Bioinform..

[14]  Max A. Little,et al.  Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease , 2008, IEEE Transactions on Biomedical Engineering.

[15]  J. Jankovic Parkinson’s disease: clinical features and diagnosis , 2008, Journal of Neurology, Neurosurgery, and Psychiatry.

[16]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[17]  Juha Reunanen,et al.  Overfitting in Making Comparisons Between Variable Selection Methods , 2003, J. Mach. Learn. Res..

[18]  R Iansek,et al.  Speech impairment in a large sample of patients with Parkinson's disease. , 1999, Behavioural neurology.

[19]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[20]  Jack J. Jiang,et al.  Phonatory impairment in Parkinson's disease: evidence from nonlinear dynamic analysis and perturbation analysis. , 2007, Journal of voice : official journal of the Voice Foundation.

[21]  D. Huse,et al.  Burden of illness in Parkinson's disease , 2005, Movement disorders : official journal of the Movement Disorder Society.

[22]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Pedro Gómez Vilda,et al.  Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors , 2004, IEEE Transactions on Biomedical Engineering.