Hybrid models based on genetic algorithm and deep learning algorithms for nutritional Anemia disease classification

Abstract Deep learning algorithms are an important part of disease prediction and diagnosis by analyzing health data. If not diagnosed and treated early, symptoms of nutritional anemia can be seen as a common laboratory finding of dyspnea, dizziness, lack of concentration, pale skin color, and life-threatening diseases. In the literature, several data mining techniques have been used for the prediction of nutritional anemia, especially, for the iron deficiency anemia. However, each algorithm does not perform well for every data, and therefore new techniques need to be developed. It is because the characteristics of each dataset are different and their dataset sizes, that is, the number of records and the number of parameters are different. In this study, we propose two hybrid models using genetic algorithm (GA) and deep learning algorithms of Stacked Autoencoder (SAE) and Convolutional Neural Network (CNN) for the prediction of HGB-anemia, nutritional anemia, (iron deficiency anemia, B12 deficiency anemia, and folate deficiency anemia), and patients without anemia. In the proposed GA-SAE and GA-CNN models, the hyperparameters of SAE and CNN algorithms are optimized using GA since it is not easy to determine suitable values of deep learning algorithms. Accuracy, F-score, precision, and sensitivity criteria were used to evaluate the prediction and classification performances of the proposed algorithms. As a result of the experimental evaluations using the dataset, the performance of the proposed GA-CNN algorithm whose layers trained separately and sequentially was found to be better than the performance of the studies proposed in the literature, by a 98.50% accuracy.

[1]  Jianzhong Wu,et al.  Stacked Sparse Autoencoder (SSAE) for Nuclei Detection on Breast Cancer Histopathology Images , 2016, IEEE Transactions on Medical Imaging.

[2]  Nathalie Japkowicz,et al.  Multi-label Classification of Anemia Patients , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[3]  Dervis Karaboga,et al.  Performance analysis of ABCMiner algorithm with different objective functions , 2013, 2013 21st Signal Processing and Communications Applications Conference (SIU).

[4]  Petro Liashchynskyi,et al.  Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS , 2019, ArXiv.

[6]  Qingjin Peng,et al.  The Tabu_Genetic Algorithm: A Novel Method for Hyper-Parameter Optimization of Learning Algorithms , 2019, Electronics.

[7]  İclal ÖZCAN,et al.  Developing Recommendation System Using Genetic Algorithm Based Alternative Least Squares , 2018, 2018 International Conference on Artificial Intelligence and Data Processing (IDAP).

[8]  U. Rajendra Acharya,et al.  Deep learning for healthcare applications based on physiological signals: A review , 2018, Comput. Methods Programs Biomed..

[9]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[10]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[11]  R. Unger,et al.  Application of machine learning algorithms for clinical predictive modeling: a data-mining approach in SCT , 2014, Bone Marrow Transplantation.

[12]  Zhiyong Chen,et al.  Fast Automatic Optimisation of CNN Architectures for Image Classification Using Genetic Algorithm , 2019, 2019 IEEE Congress on Evolutionary Computation (CEC).

[13]  H. Cohen,et al.  Anemia and Hemoglobin Levels in Older Persons: Relationship with Age, Gender, and Health Status , 1992, Journal of the American Geriatrics Society.

[14]  R. Stoltzfus,et al.  Defining iron-deficiency anemia in public health terms: a time for reflection. , 2001, The Journal of nutrition.

[15]  Yanhui Guo,et al.  White blood cells identification system based on convolutional deep neural learning networks , 2017, Comput. Methods Programs Biomed..

[16]  Shuiping Gou,et al.  Classification of PolSAR Images Using Multilayer Autoencoders and a Self-Paced Learning Approach , 2018, Remote. Sens..

[17]  Oguz Kaynar,et al.  Sentiment Analizinde Öznitelik Düşürme Yöntemlerinin Oto Kodlayıcılı Derin Öğrenme Makinaları ile Karşılaştırılması , 2017 .

[18]  T. Tuncer,et al.  Discrimination of β-thalassemia and iron deficiency anemia through extreme learning machine and regularized extreme learning machine based decision support system. , 2020, Medical hypotheses.

[19]  Meghana Nagori,et al.  Classification of Anemia Using Data Mining Techniques , 2011, SEMCCO.

[20]  Eyad H. Elshami,et al.  AUTOMATED DIAGNOSIS OF THALASSEMIA BASED ON DATAMINING CLASSIFIERS , 2012 .

[21]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[22]  Onur Cömert,et al.  Classification and diagnosis of cervical cancer with stacked autoencoder and softmax classification , 2019, Expert Systems with Applications.

[23]  Yang Liu,et al.  A method for multi-class sentiment classification based on an improved one-vs-one (OVO) strategy and the support vector machine (SVM) algorithm , 2017, Inf. Sci..

[24]  Tanveer J. Siddiqui,et al.  Machine Learning Algorithms for Anemia Disease Prediction , 2018, Lecture Notes in Electrical Engineering.

[25]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[26]  Jian Sun,et al.  Convolutional neural networks at constrained time cost , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[28]  M. Abdullah S. Al-Asmari Anemia types prediction based on data mining classification algorithms , 2016 .

[29]  Gülbin Özçelikay,et al.  THE SITUATION OF CURRICULUMS OF FACULTY OF PHARMACIES IN TURKEY , 2016 .

[30]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[31]  H. F. Ahmad,et al.  Assessing the relative importance of social determinants of health in malaria and anemia classification based on machine learning techniques , 2019, Informatics for health & social care.

[32]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[33]  L. Parthiban,et al.  Intelligent Heart Disease Prediction System Using CANFIS and Genetic Algorithm , 2007 .

[34]  Ashutosh Kumar Singh,et al.  Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980–2015: a systematic analysis for the Global Burden of Disease Study 2015 , 2016, The Lancet.

[35]  M. Hasani,et al.  Automated Diagnosis of Iron Deficiency Anemia and Thalassemia by Data Mining Techniques , 2017 .

[36]  Shahriar Gharibzadeh,et al.  Artificial Intelligence Models for Predicting Iron Deficiency Anemia and Iron Serum Level Based on Accessible Laboratory Data , 2012, Journal of Medical Systems.

[37]  David E. Goldberg,et al.  Genetic algorithms and Machine Learning , 1988, Machine Learning.

[38]  George E. Andrews,et al.  A four parameter generalization of Göllnitz's (big) partition theorem , 2000, Unusual Applications of Number Theory.

[39]  Leandro Nunes de Castro,et al.  Fundamentals of Natural Computing - Basic Concepts, Algorithms, and Applications , 2006, Chapman and Hall / CRC computer and information science series.

[40]  Yan Zhang,et al.  Deep neural network for halftone image classification based on sparse auto-encoder , 2016, Eng. Appl. Artif. Intell..

[41]  Mostafa Langarizadeh,et al.  Designing an expert system for differential diagnosis of β-Thalassemia minor and Iron-Deficiency anemia using neural network , 2016 .

[42]  Dervis Karaboga,et al.  CoABCMiner: An Algorithm for Cooperative Rule Classification System Based on Artificial Bee Colony , 2016, Int. J. Artif. Intell. Tools.

[43]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[44]  Ashutosh Kumar Singh,et al.  Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990–2015: a systematic analysis for the Global Burden of Disease Study 2015 , 2016, Lancet.

[45]  I. Obeid,et al.  Automated identification of abnormal adult EEGs , 2015, 2015 IEEE Signal Processing in Medicine and Biology Symposium (SPMB).

[46]  Duangmanee Sanmun,et al.  The Observation Report of Red Blood Cell Morphology in Thailand Teenager by Using Data Mining Technique , 2014, Advances in hematology.

[47]  Atefeh Farzindar,et al.  Anemic Status Prediction using Multilayer Perceptron Neural Network Model , 2017, GCAI.

[48]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[49]  Luca Maria Gambardella,et al.  Deep, Big, Simple Neural Nets for Handwritten Digit Recognition , 2010, Neural Computation.

[50]  Stefan Wermter,et al.  An analysis of Convolutional Long Short-Term Memory Recurrent Neural Networks for gesture recognition , 2017, Neurocomputing.

[51]  Alaa M. El-Halees,et al.  Blood Tumor Prediction Using Data Mining Techniques , 2017 .

[52]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[53]  Enayetur Raheem,et al.  Machine Learning Algorithms To Predict The Childhood Anemia In Bangladesh , 2021 .

[54]  Mayank Sharma,et al.  White Blood Cell Classification Using Convolutional Neural Network , 2019, Advances in Intelligent Systems and Computing.

[55]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[56]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[57]  U. Rajendra Acharya,et al.  A deep convolutional neural network model to classify heartbeats , 2017, Comput. Biol. Medicine.

[58]  B. Blanc Nutritional anemias. Report of a WHO Scientific Group , 1968 .

[59]  Devendra K. Tayal,et al.  Using classification techniques for statistical analysis of Anemia , 2019, Artif. Intell. Medicine.

[60]  Sengul Dogan,et al.  Iron-Deficiency Anemia Detection From Hematology Parameters By Using Decision Trees , 2008 .