Voice-Based Detection of Parkinson's Disease through Ensemble Machine Learning Approach: A Performance Study

INTRODUCTION: Parkinson's disease (PD) occurs due to the deficiency of dopamine that regulates various activities of the human body. Researchers have identified that voice is an underlying symptom of PD. Recently, Machine learning (ML) has helped in solving problems of computer vision, natural language processing, speech recognition etc. OBJECTIVES: This paper aims to analyse the effect of feature type selection i.e. MFCC and TQWT on the efficiency of voice based PD detection system along with the use an ensemble learning based classifier for this task. METHODS: Hence, in this work, various machine learning models, including Logistic Regression, Naive Bayes, KNN, Random Forest, Decision Tree, SVM, MLP, and XGBoost, have been employed and explored for PD detection purpose. The task of Feature selection was also done using minimum-Redundancy and Maximum-Relevance (mRMR) and Recursive Feature Elimination (RFE) techniques. RESULTS: The results of the XGBoost with mRMR feature selection, outperformed all other models with a high accuracy of 95.39% and precision, recall and F1-score of 0.95 each, when both MFCC and TQWT features were selected. CONCLUSION: The results obtained strongly support the use of XGBoost model for the voice sample based detection of PD along with mRMR feature selection technique.

[1]  C. Warren Olanow,et al.  The scientific and clinical basis for the treatment of Parkinson disease (2009) , 2009, Neurology.

[2]  P. Dejonckere,et al.  A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques , 2001, European Archives of Oto-Rhino-Laryngology.

[3]  Jack J. Jiang,et al.  Phonatory impairment in Parkinson's disease: evidence from nonlinear dynamic analysis and perturbation analysis. , 2007, Journal of voice : official journal of the Voice Foundation.

[4]  Olcay Kursun,et al.  Telediagnosis of Parkinson’s Disease Using Measurements of Dysphonia , 2010, Journal of Medical Systems.

[5]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Ismail Saritas,et al.  Classification of Parkinson disease data with artificial neural networks , 2019, IOP Conference Series: Materials Science and Engineering.

[7]  Neha Singh,et al.  Advances in the treatment of Parkinson's disease , 2007, Progress in Neurobiology.

[8]  Mahinder Pal Singh Bhatia,et al.  SVM classification to distinguish Parkinson disease patients , 2010, A2CWiC '10.

[9]  Aysegul Gunduz,et al.  A comparative analysis of speech signal processing algorithms for Parkinson's disease classification and the use of the tunable Q-factor wavelet transform , 2019, Appl. Soft Comput..

[10]  L. Ramig,et al.  The Parkinson larynx: tremor and videostroboscopic findings. , 1996, Journal of voice : official journal of the Voice Foundation.

[11]  Ronald J. Baken,et al.  Clinical measurement of speech and voice , 1987 .

[12]  Max A. Little,et al.  Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease , 2008, IEEE Transactions on Biomedical Engineering.

[13]  J. Langston,et al.  Parkinson's disease: current and future challenges. , 2002, Neurotoxicology.

[14]  A. Hofman,et al.  Prevalence of Parkinson's disease in Europe: A collaborative study of population-based cohorts. Neurologic Diseases in the Elderly Research Group. , 2000, Neurology.

[15]  A. Benba,et al.  Voiceprints analysis using MFCC and SVM for detecting patients with Parkinson's disease , 2015, 2015 International Conference on Electrical and Information Technologies (ICEIT).

[16]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[17]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[18]  J. Logemann,et al.  Frequency and cooccurrence of vocal tract dysfunctions in the speech of a large sample of Parkinson patients. , 1978, The Journal of speech and hearing disorders.

[19]  Lorene M Nelson,et al.  Incidence of Parkinson's disease: variation by age, gender, and race/ethnicity. , 2003, American journal of epidemiology.

[20]  R. Iansek,et al.  Speech impairment in a large sample of patients with Parkinson's disease. , 1998, Behavioural neurology.

[21]  A. Lang,et al.  Parkinson's disease. First of two parts. , 1998, The New England journal of medicine.

[22]  David Zhang,et al.  Feature selection and analysis on correlated gas sensor data with recursive feature elimination , 2015 .

[23]  Dimitrios I. Fotiadis,et al.  Assessment of Tremor Activity in the Parkinson’s Disease Using a Set of Wearable Sensors , 2012, IEEE Transactions on Information Technology in Biomedicine.

[24]  D. Huse,et al.  Burden of illness in Parkinson's disease , 2005, Movement disorders : official journal of the Movement Disorder Society.

[25]  Vibhakar Pathak,et al.  Parkinson Disease Prediction Using Machine Learning Algorithm , 2018, Advances in Intelligent Systems and Computing.

[26]  Jennifer L. Spielman,et al.  Effects of intensive voice treatment (the Lee Silverman Voice Treatment [LSVT]) on vowel articulation in dysarthric individuals with idiopathic Parkinson disease: acoustic and perceptual findings. , 2007, Journal of speech, language, and hearing research : JSLHR.

[27]  Fikret S. Gürgen,et al.  Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings , 2013, IEEE Journal of Biomedical and Health Informatics.

[28]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .