Impact of Patients’ Gender on Parkinson’s disease using Classification Algorithms

In this paper the accuracy of two machine learning algorithms including SVM and Bayesian Network are investigated as two important algorithms in diagnosis of Parkinson’s disease. We use Parkinson's disease data in the University of California, Irvine (UCI). In order to optimize the SVM algorithm, different kernel functions and C parameters have been used and our results show that SVM with C parameter (C-SVM) with average of 99.18% accuracy with Polynomial Kernel function in testing step, has better performance compared to the other Kernel functions such as RBF and Sigmoid as well as Bayesian Network algorithm. It is also shown that ten important factors in SVM algorithm are Jitter (Abs), Subject #, RPDE, PPE, Age, NHR, Shimmer APQ 11, NHR, Total-UPDRS, Shimmer (dB) and Shimmer. We also prove that the accuracy of our proposed C-SVM and RBF approaches is in direct proportion to the value of C parameter such that with increasing the amount of C, accuracy in both Kernel functions is increased. But unlike Polynomial and RBF, Sigmoid has an inverse relation with the amount of C. Indeed, by using these methods, we can find the most effective factors common in both genders (male and female). To the best of our knowledge there is no study on Parkinson's disease for identifying the most effective factors which are common in both genders.

[1]  Moloud Abdar,et al.  Performance analysis of classification algorithms on early detection of liver disease , 2017, Expert Syst. Appl..

[2]  Thomas H. Morris,et al.  Developing a Hybrid Intrusion Detection System Using Data Mining for Power Systems , 2015, IEEE Transactions on Smart Grid.

[3]  M. Cevdet Ince,et al.  An expert system for detection of breast cancer based on association rules and neural network , 2009, Expert Syst. Appl..

[4]  Lorene M Nelson,et al.  Incidence of Parkinson's disease: variation by age, gender, and race/ethnicity. , 2003, American journal of epidemiology.

[5]  Tawseef Ayoub Shaikh A Prototype of Parkinson’s and Primary Tumor Diseases Prediction Using Data Mining Techniques , 2014 .

[6]  Nada Lavrac,et al.  Selected techniques for data mining in medicine , 1999, Artif. Intell. Medicine.

[7]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[8]  David J. Spiegelhalter,et al.  Probabilistic Networks and Expert Systems - Exact Computational Methods for Bayesian Networks , 1999, Information Science and Statistics.

[9]  Salvatore J. Stolfo,et al.  Data Mining Approaches for Intrusion Detection , 1998, USENIX Security Symposium.

[10]  K. Hamberg,et al.  Gender differences in Parkinson's disease: A clinical perspective , 2017, Acta neurologica Scandinavica.

[11]  A. Hofman,et al.  Prevalence of Parkinson's disease in Europe: A collaborative study of population-based cohorts. Neurologic Diseases in the Elderly Research Group. , 2000, Neurology.

[12]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[13]  P. James An Essay on the Shaking Palsy , 1817, The Medico-Chirurgical Journal and Review.

[14]  Bhavani M. Thuraisingham Data mining for security applications , 2004, ICMLA.

[15]  M Suman,et al.  Direct Marketing with the Application of Data Mining , 2012 .

[16]  Judy Kay,et al.  Clustering and Sequential Pattern Mining of Online Collaborative Learning Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[17]  Richard J. Kryscio,et al.  Sugihara causality analysis of scalp EEG for detection of early Alzheimer's disease , 2014, NeuroImage: Clinical.

[18]  Charles X. Ling,et al.  Data Mining for Direct Marketing: Problems and Solutions , 1998, KDD.

[19]  K. Marder,et al.  A population-based investigation of Parkinson's disease with and without dementia. Relationship to age and gender. , 1992, Archives of neurology.

[20]  Moloud Abdar,et al.  Using Decision Trees in Data Mining for Predicting Factors Influencing of Heart Disease , 2015 .

[21]  S. Shoorabi Sani A case study for application of fuzzy inference and data mining in structural health monitoring , 2018 .

[22]  Max A. Little,et al.  Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease , 2008, IEEE Transactions on Biomedical Engineering.

[23]  Jack Chin Pang Cheng,et al.  A data-driven study of important climate factors on the achievement of LEED-EB credits , 2015 .

[24]  Ruo-Ping Han,et al.  Disease prediction with different types of neural network classifiers , 2016, Telematics Informatics.

[25]  Resul Das,et al.  A comparison of multiple classification methods for diagnosis of Parkinson disease , 2010, Expert Syst. Appl..

[26]  Paolo Barone,et al.  The relevance of gender in Parkinson’s disease: a review , 2017, Journal of Neurology.

[27]  Kristin P. Bennett,et al.  Support vector machines: hype or hallelujah? , 2000, SKDD.

[28]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[29]  Adel Sabry Eesa,et al.  A novel feature-selection approach based on the cuttlefish optimization algorithm for intrusion detection systems , 2015, Expert Syst. Appl..