A predictive method for hepatitis disease diagnosis using ensembles of neuro-fuzzy technique.

BACKGROUND Hepatitis is an inflammation of the liver, most commonly caused by a viral infection. Supervised data mining techniques have been successful in hepatitis disease diagnosis through a set of datasets. Many methods have been developed by the aids of data mining techniques for hepatitis disease diagnosis. The majority of these methods are developed by single learning techniques. In addition, these methods do not support the ensemble learning of the data. Combining the outputs of several predictors can result in improved accuracy in classification problems. This study aims to propose an accurate method for the hepatitis disease diagnosis by taking the advantages of ensemble learning. METHODS We use Non-linear Iterative Partial Least Squares to perform the data dimensionality reduction, Self-Organizing Map technique for clustering task and ensembles of Neuro-Fuzzy Inference System for predicting the hepatitis disease. We also use decision trees for the selection of most important features in the experimental dataset. We test our method on a real-world dataset and present our results in comparison with the latest results of previous studies. RESULTS The results of our analyses on the dataset demonstrated that our method performance is superior to the Neural Network, ANFIS, K-Nearest Neighbors and Support Vector Machine. CONCLUSIONS The method has potential to be used as an intelligent learning system for hepatitis disease diagnosis in the healthcare.

[1]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[2]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[3]  Gang Wang,et al.  A new hybrid method based on local fisher discriminant analysis and support vector machines for hepatitis disease diagnosis , 2011, Expert Syst. Appl..

[4]  Mehrbakhsh Nilashi,et al.  A hybrid intelligent system for the prediction of Parkinson's Disease progression using machine learning techniques , 2017 .

[5]  Hanan H Balkhy,et al.  Improvement of the low knowledge, attitude and practice of hepatitis B virus infection among Saudi national guard personnel after educational intervention , 2012, BMC Research Notes.

[6]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[7]  Mehrbakhsh Nilashi,et al.  Accuracy Improvement for Predicting Parkinson’s Disease Progression , 2016, Scientific Reports.

[8]  Aiman El-Saed,et al.  Magnitude and causes of loss to follow-up among patients with viral hepatitis at a tertiary care hospital in Saudi Arabia. , 2017, Journal of infection and public health.

[9]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Ponnuthurai N. Suganthan,et al.  Random Forests with ensemble of feature spaces , 2014, Pattern Recognit..

[11]  Yilmaz Kaya,et al.  A hybrid decision support system based on rough set and extreme learning machine for diagnosis of hepatitis disease , 2013, Appl. Soft Comput..

[12]  Z. Memish,et al.  Epidemiologic shift in the prevalence of Hepatitis A virus in Saudi Arabia: a case for routine Hepatitis A vaccination. , 2006, Vaccine.

[13]  Kevin W. Bowyer,et al.  Combination of Multiple Classifiers Using Local Accuracy Estimates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Mehrbakhsh Nilashi,et al.  Multi-criteria collaborative filtering with high accuracy using higher order singular value decomposition and Neuro-Fuzzy system , 2014, Knowl. Based Syst..

[15]  M. Götte,et al.  Resistance Patterns Associated with HCV NS5A Inhibitors Provide Limited Insight into Drug Binding , 2014, Viruses.

[16]  Tong Heng Lee,et al.  Evolutionary computing for knowledge discovery in medical diagnosis , 2003, Artif. Intell. Medicine.

[17]  Esin Dogantekin,et al.  A new intelligent hepatitis diagnosis system: PCA-LSSVM , 2011, Expert Syst. Appl..

[18]  B. Bell,et al.  The contributions of hepatitis B virus and hepatitis C virus infections to cirrhosis and primary liver cancer worldwide. , 2006, Journal of hepatology.

[19]  Kindie Biredagn Nahato,et al.  Hybrid approach using fuzzy sets and extreme learning machine for classifying clinical datasets , 2016 .

[20]  Smaranda Belciug,et al.  Evolutionary strategy to develop learning-based decision systems. Application to breast cancer and liver fibrosis stadialization , 2014, J. Biomed. Informatics.

[21]  W. Sheng,et al.  Natural history of hepatitis D viral superinfection: significance of viremia detected by polymerase chain reaction. , 1995, Gastroenterology.

[22]  Imran Khan,et al.  Feature extraction through parallel Probabilistic Principal Component Analysis for heart disease diagnosis , 2017 .

[23]  Kemal Polat,et al.  Medical decision support system based on artificial immune recognition immune system (AIRS), fuzzy weighted pre-processing and feature selection , 2007, Expert Syst. Appl..

[24]  M. Tahar Kechadi,et al.  Tools for Statistical Analysis with Missing Data: Application to a Large Medical Database , 2005, MIE.

[25]  Erkki Oja,et al.  Engineering applications of the self-organizing map , 1996, Proc. IEEE.

[26]  Qingsheng Zhu,et al.  Spectral clustering with density sensitive similarity function , 2011, Knowl. Based Syst..

[27]  Z. Memish,et al.  Hepatitis B virus among Saudi National Guard personnel: seroprevalence and risk of exposure. , 2013, Journal of infection and public health.

[28]  W. M. Lee,et al.  Hepatitis B virus infection. , 1997, The New England journal of medicine.

[29]  D. Lavanchy,et al.  Hepatitis B virus epidemiology, disease burden, treatment, and current and emerging prevention and control measures , 2004, Journal of viral hepatitis.

[30]  Harry Vennema,et al.  Chronic hepatitis E virus infection in liver transplant recipients , 2008, Liver transplantation : official publication of the American Association for the Study of Liver Diseases and the International Liver Transplantation Society.

[31]  Yuri Levin,et al.  THE GEOMETRY OF LINEAR SEPARABILITY IN DATA SETS , 2006 .

[32]  Kourosh Mozafari,et al.  Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA) , 2012, Comput. Methods Programs Biomed..

[33]  H. Wold Causal flows with latent variables: Partings of the ways in the light of NIPALS modelling , 1974 .

[34]  Igor Kononenko,et al.  Machine learning for medical diagnosis: history, state of the art and perspective , 2001, Artif. Intell. Medicine.

[35]  Selma Ayse Özel,et al.  A hybrid approach of differential evolution and artificial bee colony for feature selection , 2016, Expert Syst. Appl..

[36]  Kemal Polat,et al.  Hepatitis disease diagnosis using a new hybrid system based on feature selection (FS) and artificial immune recognition system with fuzzy resource allocation , 2006, Digit. Signal Process..

[37]  Colin W Shepard,et al.  Global epidemiology of hepatitis C virus infection. , 2005, The Lancet. Infectious diseases.