ECO-AMLP: A Decision Support System using an Enhanced Class Outlier with Automatic Multilayer Perceptron for Diabetes Prediction

With advanced data analytical techniques, efforts for more accurate decision support systems for disease prediction are on rise. Surveys by World Health Organization (WHO) indicate a great increase in number of diabetic patients and related deaths each year. Early diagnosis of diabetes is a major concern among researchers and practitioners. The paper presents an application of \textit{Automatic Multilayer Perceptron }which\textit{ }is combined with an outlier detection method \textit{Enhanced Class Outlier Detection using distance based algorithm }to create a prediction framework named as Enhanced Class Outlier with Automatic Multi layer Perceptron (ECO-AMLP). A series of experiments are performed on publicly available Pima Indian Diabetes Dataset to compare ECO-AMLP with other individual classifiers as well as ensemble based methods. The outlier technique used in our framework gave better results as compared to other pre-processing and classification techniques. Finally, the results are compared with other state-of-the-art methods reported in literature for diabetes prediction on PIDD and achieved accuracy of 88.7\% bests all other reported studies.

[1]  Nabil M. Hewahi,et al.  A comparative Study of Outlier Mining and Class Outlier Mining , 2009 .

[2]  Lawrence O. Hall,et al.  Predicting Juvenile Diabetes from Clinical Test Results , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[3]  K Kasikumar,et al.  Applications of Data Mining Techniques in Healthcare and Prediction of Heart Attacks , 2018, International Journal of Data Mining Techniques and Applications.

[4]  Mira Kania Sabariah,et al.  Early detection of type II Diabetes Mellitus with random forest and classification and regression tree (CART) , 2014 .

[5]  C. Floyd,et al.  Prediction of breast cancer malignancy using an artificial neural network , 1994, Cancer.

[6]  N. A. Nnamoko,et al.  Meta-classification model for diabetes onset forecast: A proof of concept , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[7]  G. M. Nasira,et al.  A New Approach for Diagnosis of Diabetes and Prediction of Cancer Using ANFIS , 2014, 2014 World Congress on Computing and Communication Technologies.

[8]  Khawar Khurshid,et al.  An expert system for diabetes prediction using auto tuned multi-layer perceptron , 2017, 2017 Intelligent Systems Conference (IntelliSys).

[9]  Imran Siddiqi,et al.  Improving handwriting based gender classification using ensemble classifiers , 2017, Expert Syst. Appl..

[10]  Shweta Kharya,et al.  Using data mining techniques for diagnosis and prognosis of cancer disease , 2012, ArXiv.

[11]  Jin Park,et al.  A sequential neural network model for diabetes prediction , 2001, Artif. Intell. Medicine.

[12]  Illhoi Yoo,et al.  Data Mining in Healthcare and Biomedicine: A Survey of the Literature , 2012, Journal of Medical Systems.

[13]  Senlin Luo,et al.  Rule Extraction From Support Vector Machines Using Ensemble Learning Approach: An Application for Diagnosis of Diabetes , 2015, IEEE Journal of Biomedical and Health Informatics.

[14]  M A Musen,et al.  Sequential versus standard neural networks for pattern recognition: An example using the domain of coronary heart disease , 1997, Comput. Biol. Medicine.

[15]  M Anbarasi,et al.  ENHANCED PREDICTION OF HEART DISEASE WITH FEATURE SUBSET SELECTION USING GENETIC ALGORITHM , 2010 .

[16]  Nesma Settouti,et al.  Recognition of diabetes disease using a new hybrid learning algorithm for NEFCLASS , 2013, 2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA).

[17]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[18]  C. Lursinsap,et al.  Critical support vector machine without kernel function , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[19]  Praveen Kumar,et al.  Diagnosis of Diabetes Mellitus based on Risk Factors , 2010 .

[20]  Zhilbert Tafa,et al.  An intelligent system for diabetes prediction , 2015, 2015 4th Mediterranean Conference on Embedded Computing (MECO).

[21]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[22]  G. Sathyadevi,et al.  Application of CART algorithm in hepatitis disease diagnosis , 2011, 2011 International Conference on Recent Trends in Information Technology (ICRTIT).

[23]  A. A. Shafie,et al.  Application of modeling techniques to diabetes diagnosis , 2010, 2010 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES).

[24]  Nabil M. Hewahi,et al.  Class Outliers Mining: Distance-Based Approach , 2007 .

[25]  D. G. Bounds,et al.  A multilayer perceptron network for the diagnosis of low back pain , 1988, IEEE 1988 International Conference on Neural Networks.

[26]  Murali S. Shanker,et al.  Using Neural Networks To Predict the Onset of Diabetes Mellitus , 1996, J. Chem. Inf. Comput. Sci..

[27]  Amir Akramin Shafie,et al.  A novel signal diagnosis technique using pseudo complex-valued autoregressive technique , 2011, Expert Syst. Appl..

[28]  W. Wettayaprasit,et al.  Linguistic Knowledge Extraction from Neural Networks Using Maximum Weight and Frequency Data Representation , 2006, 2006 IEEE Conference on Cybernetics and Intelligent Systems.

[29]  Sungyoung Lee,et al.  Prediction of Diabetes Mellitus Based on Boosting Ensemble Modeling , 2014, UCAmI.

[30]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[31]  Vaibhav Bajpai,et al.  Application of Artificial Neural Network (ANN) technique for Diagnosing Diabetes Mellitus , 2008, 2008 IEEE Region 10 and the Third international Conference on Industrial and Information Systems.

[32]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.

[33]  Chang-Shing Lee,et al.  Ontology-based Fuzzy Inference Agent for Diabetes Classification , 2007, NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society.

[34]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[35]  Fevzullah Temurtas,et al.  A comparative study on diabetes disease diagnosis using neural networks , 2009, Expert Syst. Appl..

[36]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.

[37]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[38]  Bayu Adhi Tama,et al.  An Early Detection Method of Type-2 Diabetes Mellitus in Public Hospital , 2011 .

[39]  S. Anitha,et al.  Application of a radial basis function neural network for diagnosis of diabetes mellitus , 2006 .

[40]  J. Hertzberg,et al.  Artificial Neural Network-Based Method of Screening Heart Murmurs in Children , 2001, Circulation.

[41]  T. Yıldırım,et al.  MEDICAL DIAGNOSIS ON PIMA INDIAN DIABETES USING GENERAL REGRESSION NEURAL NETWORKS , 2003 .

[42]  Lin Li Diagnosis of Diabetes Using a Weight-Adjusted Voting Approach , 2014, 2014 IEEE International Conference on Bioinformatics and Bioengineering.

[43]  S. Jaafar,et al.  Diabetes mellitus forecast using artificial neural network (ANN) , 2005, 2005 Asian Conference on Sensors and the International Conference on New Techniques in Pharmaceutical and Biomedical Research.

[44]  Somula Ramasubbareddy,et al.  Classification of Heart Disease Using Support Vector Machine , 2019, Journal of Computational and Theoretical Nanoscience.

[45]  Mohsen Beheshti,et al.  Diabetes Data Analysis and Prediction Model Discovery Using RapidMiner , 2008, 2008 Second International Conference on Future Generation Communication and Networking.

[46]  Yang Guo,et al.  Using Bayes Network for Prediction of Type-2 diabetes , 2012, 2012 International Conference for Internet Technology and Secured Transactions.

[47]  Kemal Polat,et al.  A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine , 2008, Expert Syst. Appl..

[48]  Richa Sharma,et al.  Diabetes mellitus prediction system evaluation using C4.5 rules and partial tree , 2015, 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions).

[49]  Bekir Karlik,et al.  Diagnosing diabetes using neural networks on small mobile devices , 2012, Expert Syst. Appl..

[50]  Saudi Arabia,et al.  Decision Tree Discovery for the Diagnosis of Type II Diabetes , 2011 .

[51]  Thomas M. Breuel,et al.  Scanning Neural Network for Text Line Recognition , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[52]  Mehreen Ahmed,et al.  MCS: Multiple classifier system to predict the churners in the telecom industry , 2017, 2017 Intelligent Systems Conference (IntelliSys).

[53]  Sudipto Saha,et al.  Prediction of continuous B‐cell epitopes in an antigen using recurrent neural network , 2006, Proteins.