Predication of Parkinson's disease using data mining methods: A comparative analysis of tree, statistical and support vector machine classifiers

The prediction of Parkinson's disease in early age has been challenging task among researchers because the symptoms of disease come into existence in middle and late middle age. There is lot of the symptoms that leads to Parkinson's disease. But this paper focus on the speech articulation difficulty symptoms of PD affected people and try to formulate the model on the behalf of three data mining methods. These three data mining methods are taken from three different domains of data mining i.e. from tree classifier, statistical classifier and support vector machine classifier. Performance of these three classifiers is measured with three performance matrices i.e. accuracy, sensitivity and specificity. So, the main task of this paper is tried to find out which model identified the PD affected people more accurately.

[1]  E. Petricoin,et al.  SELDI-TOF-based serum proteomic pattern diagnostics for early detection of cancer. , 2004, Current opinion in biotechnology.

[2]  S. Larson The shrinkage of the coefficient of multiple correlation. , 1931 .

[3]  Robert M. Nishikawa,et al.  A study on several Machine-learning methods for classification of Malignant and benign clustered microcalcifications , 2005, IEEE Transactions on Medical Imaging.

[4]  N. Lavrac,et al.  Intelligent Data Analysis in Medicine and Pharmacology , 1997 .

[5]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[6]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[7]  Andrew J. Lees,et al.  Management of Parkinson's disease: An evidence‐based review , 2002, Movement disorders : official journal of the Movement Disorder Society.

[8]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.

[9]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[10]  Massimiliano Pontil,et al.  Support Vector Machines for 3D Object Recognition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Kenneth Revett,et al.  Feature selection in Parkinson's disease: A rough sets approach , 2009, 2009 International Multiconference on Computer Science and Information Technology.

[12]  J. Bartelsman,et al.  Inflammatory Bowel Disease Questionnaire: cross-cultural adaptation and further validation , 1995, European journal of gastroenterology & hepatology.

[13]  ChinKhew-Voon Logistic regression for disease classification using microarray data , 2007 .

[14]  George J. Knafl,et al.  Logistic regression modeling for context-based classification , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[15]  Stephen T. C. Wong,et al.  Cancer classification and prediction using logistic regression with Bayesian gene selection , 2004, J. Biomed. Informatics.

[16]  Thorsten Joachims,et al.  Text categorization with support vector machines , 1999 .

[17]  T. Minka A comparison of numerical optimizers for logistic regression , 2004 .

[18]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[19]  Parag C. Pendharkar,et al.  Association, statistical, mathematical and neural approaches for mining breast cancer patterns , 1999 .

[20]  P. James An Essay on the Shaking Palsy , 1817, The Medico-Chirurgical Journal and Review.

[21]  Nikolas P. Galatsanos,et al.  A support vector machine approach for detection of microcalcifications , 2002, IEEE Transactions on Medical Imaging.

[22]  A. Zwinderman,et al.  Influence of dopaminergic medication on automatic postural responses and balance impairment in Parkinson's disease , 1996, Movement disorders : official journal of the Movement Disorder Society.

[23]  J. Garland The New England Journal of Medicine. , 1961, Canadian Medical Association journal.

[24]  Yunfeng Wu,et al.  Statistical Analysis of Gait Rhythm in Patients With Parkinson's Disease , 2010, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[25]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[26]  D V Cicchetti,et al.  Neural networks and diagnosis in the clinical laboratory: state of the art. , 1992, Clinical chemistry.

[27]  Nagiza F. Samatova,et al.  An SVM-based algorithm for identification of photosynthesis-specific genome features , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[28]  L. Breiman,et al.  Submodel selection and evaluation in regression. The X-random case , 1992 .

[29]  Arman Maghbouleh A logistic regression model for detecting prominences , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[30]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[31]  Shu Zheng,et al.  Application of serum protein fingerprinting coupled with artificial neural network model in diagnosis of hepatocellular carcinoma. , 2005, Chinese medical journal.

[32]  José Mira Mira,et al.  DIAVAL, a Bayesian expert system for echocardiography , 1997, Artif. Intell. Medicine.

[33]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[34]  T. Hastie,et al.  Classification of gene microarrays by penalized logistic regression. , 2004, Biostatistics.

[35]  John Rand,et al.  Using neural networks to diagnose cancer , 1991, Journal of Medical Systems.

[36]  Steen Andreassen,et al.  MUNIN - A Causal Probabilistic Network for Interpretation of Electromyographic Findings , 1987, IJCAI.

[37]  G Coppini,et al.  Detection of single and clustered microcalcifications in mammograms using fractals models and neural networks. , 2004, Medical engineering & physics.

[38]  Cullen Schaffer,et al.  Selecting a classification method by cross-validation , 1993, Machine Learning.

[39]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[40]  S. Sathiya Keerthi,et al.  A fast iterative nearest point algorithm for support vector machine classifier design , 2000, IEEE Trans. Neural Networks Learn. Syst..

[41]  Connie Marras,et al.  Occupation and risk of parkinsonism: a multicenter case-control study. , 2009, Archives of neurology.

[42]  B. Bloem,et al.  Evidence‐based analysis of physical therapy in Parkinson's disease with recommendations for practice and research , 2007, Movement disorders : official journal of the Movement Disorder Society.

[43]  Miguel Angel Ferrer-Ballester,et al.  Automatic Detection of Pathologies in The Voice by HOS Based Parameters , 2001, EURASIP J. Adv. Signal Process..

[44]  Daniel J Schaid,et al.  Survival study of Parkinson disease in Olmsted County, Minnesota. , 2003, Archives of neurology.

[45]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[46]  Nir Friedman,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004, Science.

[47]  Constantin F. Aliferis,et al.  An evaluation of machine-learning methods for predicting pneumonia mortality , 1997, Artif. Intell. Medicine.

[48]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[49]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[50]  Jean Schoentgen,et al.  Low-frequency vocal modulations in vowels produced by Parkinsonian subjects , 2008, Speech Commun..

[51]  W. Dauer,et al.  Parkinson's Disease Mechanisms and Models , 2003, Neuron.

[52]  Ji Zhu,et al.  Kernel Logistic Regression and the Import Vector Machine , 2001, NIPS.

[53]  Carlo Tomasi,et al.  A statistical 3-D pattern processing method for computer-aided detection of polyps in CT colonography , 2001, IEEE Transactions on Medical Imaging.

[54]  A. Benabid,et al.  Electrical stimulation of the subthalamic nucleus in advanced Parkinson's disease. , 1998, The New England journal of medicine.

[55]  Y Ben-Shlomo,et al.  The effects of caring for a spouse with Parkinson's disease on social, psychological and physical well-being. , 1996, The British journal of general practice : the journal of the Royal College of General Practitioners.

[56]  Daniel Nikovski,et al.  Constructing Bayesian Networks for Medical Diagnosis from Incomplete and Partially Correct Statistics , 2000, IEEE Trans. Knowl. Data Eng..

[57]  S. Majumder,et al.  Support vector machine for optical diagnosis of cancer. , 2005, Journal of biomedical optics.

[58]  J. Speelman,et al.  Quality of life in patients with Parkinson's disease: development of a questionnaire. , 1996, Journal of neurology, neurosurgery, and psychiatry.

[59]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[60]  S.V.M. Vishwanathan,et al.  SSVM: a simple SVM algorithm , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).