Development of Hepatitis Disease Detection System by Exploiting Sparsity in Linear Support Vector Machine to Improve Strength of AdaBoost Ensemble Model

Hepatitis disease is a deadliest disease. The management and diagnosis of hepatitis disease is expensive and requires high level of human expertise which poses challenges for the health care system in underdeveloped and developing countries. Hence, development of automated methods for accurate prediction of hepatitis disease is inevitable. In this paper, we develop a diagnostic system which hybridizes a linear support vector machine (SVM) model with adaptive boosting (AdaBoost) model. We exploit sparsity in linear SVM that is caused by regularization. The sparse - regularized SVM is capable of eliminating redundant or irrelevant features from feature space. After filtering features through the sparse linear SVM, the output of the SVM is applied to the AdaBoost ensemble model which is used for classification purposes. Two types of numerical experiments are performed on the clinical features of hepatitis disease collected from UCI machine learning repository. In the first experiment, only conventional AdaBoost model is used, while in the second experiment, a feature vector is applied to the sparse linear SVM before its application to the AdaBoost model. Simulation results demonstrate that the strength of a conventional AdaBoost model is enhanced by 6.39% by the proposed method, and its time complexity is also reduced. In addition, the proposed method shows better performance than many previously developed methods for hepatitis disease prediction.

[1]  Hui Gao,et al.  Adaptive Wavelet Based MRI Brain Image De-noising , 2020, Frontiers in Neuroscience.

[2]  Yilmaz Kaya,et al.  A hybrid decision support system based on rough set and extreme learning machine for diagnosis of hepatitis disease , 2013, Appl. Soft Comput..

[3]  Esin Dogantekin,et al.  A new intelligent hepatitis diagnosis system: PCA-LSSVM , 2011, Expert Syst. Appl..

[4]  Mehrbakhsh Nilashi,et al.  A predictive method for hepatitis disease diagnosis using ensembles of neuro-fuzzy technique. , 2019, Journal of infection and public health.

[5]  Kemal Polat,et al.  An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease , 2007, Digit. Signal Process..

[6]  Gang Wang,et al.  A new hybrid method based on local fisher discriminant analysis and support vector machines for hepatitis disease diagnosis , 2011, Expert Syst. Appl..

[7]  Moloud Abdar,et al.  Performance analysis of classification algorithms on early detection of liver disease , 2017, Expert Syst. Appl..

[8]  Pasi Luukka,et al.  Similarity classifier using similarities based on modified probabilistic equivalence relations , 2009, Knowl. Based Syst..

[9]  Ce Zhu,et al.  Automated Detection of Parkinson’s Disease Based on Multiple Types of Sustained Phonations Using Linear Discriminant Analysis and Genetically Optimized Neural Network , 2019, IEEE Journal of Translational Engineering in Health and Medicine.

[10]  Noorbakhsh Amiri Golilarz,et al.  Reliable Parkinson’s Disease Detection by Analyzing Handwritten Drawings: Construction of an Unbiased Cascaded Learning System Based on Feature Selection and Adaptive Boosting Model , 2019, IEEE Access.

[11]  Syed Ahmad Chan Bukhari,et al.  A statistically rigorous deep neural network approach to predict mortality in trauma patients admitted to the intensive care unit , 2020, The journal of trauma and acute care surgery.

[12]  Babak Mohammadzadeh Asl,et al.  Automated diagnosis of coronary artery disease (CAD) patients using optimized SVM , 2017, Comput. Methods Programs Biomed..

[13]  Osman Altay,et al.  Prediction of wear loss quantities of ferro-alloy coating using different machine learning algorithms , 2019, Friction.

[14]  Xiaolu Tian,et al.  Using Machine Learning Algorithms to Predict Hepatitis B Surface Antigen Seroclearance , 2019, Comput. Math. Methods Medicine.

[15]  Adeeb Noor,et al.  A Feature-Driven Decision Support System for Heart Failure Prediction Based on χ2 Statistical Model and Gaussian Naive Bayes , 2019, Comput. Math. Methods Medicine.

[16]  Andrew K. C. Wong,et al.  Classification of Imbalanced Data: a Review , 2009, Int. J. Pattern Recognit. Artif. Intell..

[17]  Gunasekaran Manogaran,et al.  RETRACTED ARTICLE: Hybrid Recommendation System for Heart Disease Diagnosis based on Multiple Kernel Learning with Adaptive Neuro-Fuzzy Inference System , 2017, Multimedia Tools and Applications.

[18]  Kemal Polat,et al.  Hepatitis disease diagnosis using a new hybrid system based on feature selection (FS) and artificial immune recognition system with fuzzy resource allocation , 2006, Digit. Signal Process..

[19]  Dan Keun Sung,et al.  Solar Power Prediction Based on Satellite Images and Support Vector Machine , 2016, IEEE Transactions on Sustainable Energy.

[20]  Ce Zhu,et al.  Early diagnosis of Parkinson's disease from multiple voice recordings by simultaneous sample and feature selection , 2019, Expert Syst. Appl..

[21]  B. Rehermann,et al.  Immunology of hepatitis B virus and hepatitis C virus infection , 2005, Nature Reviews Immunology.

[22]  Seyed Amir Naghibi,et al.  Application of Support Vector Machine, Random Forest, and Genetic Algorithm Optimized Random Forest Models in Groundwater Potential Mapping , 2017, Water Resources Management.

[23]  Richard Weber,et al.  Feature selection for Support Vector Machines via Mixed Integer Linear Programming , 2014, Inf. Sci..

[24]  Syed Ahmad Chan Bukhari,et al.  LDA–GA–SVM: improved hepatocellular carcinoma prediction through dimensionality reduction and genetically optimized support vector machine , 2020, Neural Computing and Applications.

[25]  C. Chu,et al.  Hepatitis B virus infection , 2009, The Lancet.

[26]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[27]  D. Vigneswari,et al.  Hepatitis-Infectious Disease Prediction using Classification Algorithms , 2019, Research Journal of Pharmacy and Technology.

[28]  Shafqat Ullah Khan,et al.  Correction of array failure using grey wolf optimizer hybridized with an interior point algorithm , 2018, Frontiers of Information Technology & Electronic Engineering.

[29]  Kourosh Mozafari,et al.  Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA) , 2012, Comput. Methods Programs Biomed..

[30]  Syed Ahmad Chan Bukhari,et al.  An Approach Based on Mutually Informed Neural Networks to Optimize the Generalization Capabilities of Decision Support Systems Developed for Heart Failure Prediction , 2020 .

[31]  Craig S. Miller,et al.  Cardiac ScoreCard: A diagnostic multivariate index assay system for predicting a spectrum of cardiovascular disease , 2016, Expert Syst. Appl..

[32]  U. Rajendra Acharya,et al.  Automated detection of coronary artery disease using different durations of ECG segments with convolutional neural network , 2017, Knowl. Based Syst..

[33]  Adeeb Noor,et al.  An Optimized Stacked Support Vector Machines Based Expert System for the Effective Prediction of Heart Failure , 2019, IEEE Access.

[34]  Esin Dogantekin,et al.  Automatic hepatitis diagnosis system based on Linear Discriminant Analysis and Adaptive Network based on Fuzzy Inference System , 2009, Expert Syst. Appl..

[35]  Sagheer Abbas,et al.  Automated Diagnosis of Hepatitis B Using Multilayer Mamdani Fuzzy Inference System , 2019, Journal of healthcare engineering.

[36]  Yanbin Yuan,et al.  Wind power prediction using hybrid autoregressive fractionally integrated moving average and least square support vector machine , 2017 .

[37]  Kemal Polat,et al.  Prediction of hepatitis disease based on principal component analysis and artificial immune recognition system , 2007, Appl. Math. Comput..