An efficient diagnosis system for detection of Parkinson's disease using fuzzy k-nearest neighbor approach

In this paper, we present an effective and efficient diagnosis system using fuzzy k-nearest neighbor (FKNN) for Parkinson's disease (PD) diagnosis. The proposed FKNN-based system is compared with the support vector machines (SVM) based approaches. In order to further improve the diagnosis accuracy for detection of PD, the principle component analysis was employed to construct the most discriminative new feature sets on which the optimal FKNN model was constructed. The effectiveness of the proposed system has been rigorously estimated on a PD data set in terms of classification accuracy, sensitivity, specificity and the area under the receiver operating characteristic (ROC) curve (AUC). Experimental results have demonstrated that the FKNN-based system greatly outperforms SVM-based approaches and other methods in the literature. The best classification accuracy (96.07%) obtained by the FKNN-based system using a 10-fold cross validation method can ensure a reliable diagnostic model for detection of PD. Promisingly, the proposed system might serve as a new candidate of powerful tools for diagnosing PD with excellent performance.

[1]  Jack J. Jiang,et al.  Phonatory impairment in Parkinson's disease: evidence from nonlinear dynamic analysis and perturbation analysis. , 2007, Journal of voice : official journal of the Voice Foundation.

[2]  Olcay Kursun,et al.  Telediagnosis of Parkinson’s Disease Using Measurements of Dysphonia , 2010, Journal of Medical Systems.

[3]  Neha Singh,et al.  Advances in the treatment of Parkinson's disease , 2007, Progress in Neurobiology.

[4]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[5]  Dayou Liu,et al.  A Computer Aided Diagnosis System for Thyroid Disease Using Extreme Learning Machine , 2012, Journal of Medical Systems.

[6]  Theodoros Damoulas,et al.  Multiclass Relevance Vector Machines: Sparsity and Accuracy , 2010, IEEE Transactions on Neural Networks.

[7]  Gang Wang,et al.  An Adaptive Fuzzy k-Nearest Neighbor Method Based on Parallel Particle Swarm Optimization for Bankruptcy Prediction , 2011, PAKDD.

[8]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[9]  David G. Stork,et al.  Pattern Classification , 1973 .

[10]  Dayou Liu,et al.  Design of an Enhanced Fuzzy k-nearest Neighbor Classifier Based Computer Aided Diagnostic System for Thyroid Disease , 2012, Journal of Medical Systems.

[11]  Lindsay I. Smith,et al.  A tutorial on Principal Components Analysis , 2002 .

[12]  Max A. Little,et al.  Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease , 2008, IEEE Transactions on Biomedical Engineering.

[13]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[14]  Babak Shahbaba,et al.  Nonlinear Models Using Dirichlet Process Mixtures , 2007, J. Mach. Learn. Res..

[15]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[16]  T. Warren Liao,et al.  Two manufacturing applications of the fuzzy K-NN algorithm , 1997, Fuzzy Sets Syst..

[17]  R. Iansek,et al.  Speech impairment in a large sample of patients with Parkinson's disease. , 1998, Behavioural neurology.

[18]  Ying Huang,et al.  Prediction of protein subcellular locations using fuzzy k-NN method , 2004, Bioinform..

[19]  Dayou Liu,et al.  A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis , 2011, Expert Syst. Appl..

[20]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[21]  Paul Scheunders,et al.  Genetic feature selection combined with composite fuzzy nearest neighbor classifiers for hyperspectral satellite imagery , 2002, Pattern Recognit. Lett..

[22]  Arif Gülten,et al.  Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms , 2011, Comput. Methods Programs Biomed..

[23]  Gang Wang,et al.  Support Vector Machine Based Diagnostic System for Breast Cancer Using Swarm Intelligence , 2012, Journal of Medical Systems.

[24]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[25]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[26]  Raouf N. Gorgui-Naguib,et al.  A fuzzy logic based-method for prognostic decision making in breast and prostate cancers , 2003, IEEE Transactions on Information Technology in Biomedicine.

[27]  Gang Wang,et al.  A novel bankruptcy prediction model based on an adaptive fuzzy k-nearest neighbor method , 2011, Knowl. Based Syst..

[28]  João Paulo Papa,et al.  Improving Parkinson's disease identification through evolutionary-based feature selection , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[29]  T. Warren Liao,et al.  Medical data mining by fuzzy modeling with selected features , 2008, Artif. Intell. Medicine.

[30]  Resul Das,et al.  A comparison of multiple classification methods for diagnosis of Parkinson disease , 2010, Expert Syst. Appl..

[31]  Gang Wang,et al.  A new hybrid method based on local fisher discriminant analysis and support vector machines for hepatitis disease diagnosis , 2011, Expert Syst. Appl..

[32]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[33]  Freddie Åström,et al.  A parallel neural network approach to prediction of Parkinson's Disease , 2011, Expert Syst. Appl..

[34]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[35]  Pasi Luukka,et al.  Feature selection using fuzzy entropy measures with similarity classifier , 2011, Expert Syst. Appl..

[36]  Nawwaf N. Kharma,et al.  Advances in Detecting Parkinson's Disease , 2010, ICMB.

[37]  Cemal Köse,et al.  A Statistical Segmentation Method for Measuring Age-Related Macular Degeneration in Retinal Fundus Images , 2010, Journal of Medical Systems.

[38]  Der-Chiang Li,et al.  A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets , 2011, Artif. Intell. Medicine.

[39]  Seung-Yeon Kim,et al.  Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method , 2005, Bioinform..