Antlion re-sampling based deep neural network model for classification of imbalanced multimodal stroke dataset

Stroke is enlisted as one of the leading causes of death and serious disability affecting millions of human lives across the world with high possibilities of becoming an epidemic in the next few decades. Timely detection and prompt decision making pertinent to this disease, plays a major role which can reduce chances of brain death, paralysis and other resultant outcomes. Machine learning algorithms have been a popular choice for the diagnosis, analysis and predication of this disease but there exists issues related to data quality as they are collected cross-institutional resources. The present study focuses on improving the quality of stroke data implementing a rigorous pre-processing technique. The present study uses a multimodal stroke dataset available in the publicly available Kaggle repository. The missing values in this dataset are replaced with attribute means and LabelEncoder technique is applied to achieve homogeneity. However the dataset considered was observed to be imbalanced which reflect that the results may not represent the actual accuracy and would be biased. In order to overcome this imbalance, resampling technique was used. In case of oversampling, some data points in the minority class are replicated to increase the cardinality value and rebalance the dataset. transformed and oversampled data is further normalized using Standardscalar technique. Antlion optimization (ALO) algorithm is implemented on the deep neural network (DNN) model to select optimal hyperparameters in minimal time consumption. The proposed model consumed only 38.13% of the training time which was also a positive aspect. The experimental results proved the superiority of proposed model.

[1]  Ali Kashif Bashir,et al.  A hybrid egocentric video summarization method to improve the healthcare for Alzheimer patients , 2019, J. Ambient Intell. Humaniz. Comput..

[2]  Victor I. Chang,et al.  Image pattern recognition in big data: taxonomy and open challenges: survey , 2017, Multimedia Tools and Applications.

[3]  Simon Fong,et al.  Adaptive multi-objective swarm fusion for imbalanced data classification , 2018, Inf. Fusion.

[4]  Lijun Xie,et al.  A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data , 2018, Pattern Recognit..

[5]  Sunil A. Sheth,et al.  Machine Learning in Acute Ischemic Stroke Neuroimaging , 2018, Front. Neurol..

[6]  B. K. Tripathy,et al.  A New Approach to Interval-Valued Fuzzy Soft Sets and Its Application in Decision-Making , 2017 .

[7]  Fabien Scalzo,et al.  Prediction of Hemorrhagic Transformation Severity in Acute Stroke From Source Perfusion MRI , 2018, IEEE Transactions on Biomedical Engineering.

[8]  Harshita Patel,et al.  A review on classification of imbalanced data for wireless sensor networks , 2020, Int. J. Distributed Sens. Networks.

[9]  Praveen Kumar Reddy Maddikunta,et al.  Deep neural networks to predict diabetic retinopathy , 2020, Journal of Ambient Intelligence and Humanized Computing.

[10]  Gunasekaran Manogaran,et al.  RETRACTED ARTICLE: Hybrid Recommendation System for Heart Disease Diagnosis based on Multiple Kernel Learning with Adaptive Neuro-Fuzzy Inference System , 2017, Multimedia Tools and Applications.

[11]  Xiao Hu,et al.  Multi-center prediction of hemorrhagic transformation in acute ischemic stroke using permeability imaging features. , 2013, Magnetic resonance imaging.

[12]  Wazir Zada Khan,et al.  A deep neural networks based model for uninterrupted marine environment monitoring , 2020, Comput. Commun..

[13]  Anirban Mitra,et al.  On Rough Equalities and Rough Equivalences of Sets , 2008, RSCTC.

[14]  Ali Kashif Bashir,et al.  Optimal Haptic Communications Over Nanonetworks for E-Health Systems , 2019, IEEE Transactions on Industrial Informatics.

[15]  Jing Xia,et al.  Class Weights Random Forest Algorithm for Processing Class Imbalanced Medical Data , 2018, IEEE Access.

[16]  Zhaohui Liu,et al.  Multi-objective comprehensive evaluation approach to a river health system based on fuzzy entropy† , 2014, Mathematical Structures in Computer Science.

[17]  Chuangxia Huang,et al.  New studies on dynamic analysis of inertial neural networks involving non-reduced order method , 2019, Neurocomputing.

[18]  Neelu Khare,et al.  Heart disease classification system using optimised fuzzy rule based algorithm , 2018 .

[19]  Radu Prodan,et al.  DRUMS: Demand Response Management in a Smart City Using Deep Learning and SVR , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[20]  Du-Yih Tsai,et al.  An automated detection method for the MCA dot sign of acute stroke in unenhanced CT , 2013, Radiological Physics and Technology.

[21]  Neeraj Kumar,et al.  Whale Optimization Algorithm With Applications to Resource Allocation in Wireless Networks , 2020, IEEE Transactions on Vehicular Technology.

[22]  H. A. Sattar,et al.  A New Strategy Based on GSABAT to Solve Single Objective Optimization Problem , 2019, Int. J. Swarm Intell. Res..

[23]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[24]  Pattanapong Chantamit-o-pas,et al.  Prediction of Stroke Using Deep Learning Model , 2017, ICONIP.

[25]  Hossam Faris,et al.  Ant Lion Optimizer: Theory, Literature Review, and Application in Multi-layer Perceptron Neural Networks , 2019, Nature-Inspired Optimizers.

[26]  Manoj Singh Gaur,et al.  A Systematic Survey on Cloud Forensics Challenges, Solutions, and Future Directions , 2019, ACM Comput. Surv..

[27]  Taghi M. Khoshgoftaar,et al.  Survey on deep learning with class imbalance , 2019, J. Big Data.

[28]  Haruna Chiroma,et al.  Nature Inspired Meta-heuristic Algorithms for Deep Learning: Recent Progress and Novel Perspective , 2019, CVC.

[29]  Joel J. P. C. Rodrigues,et al.  Hybrid Deep-Learning-Based Anomaly Detection Scheme for Suspicious Flow Detection in SDN: A Social Multimedia Perspective , 2019, IEEE Transactions on Multimedia.

[30]  Pattanapong Chantamit-o-pas,et al.  Long Short-Term Memory Recurrent Neural Network for Stroke Prediction , 2018, MLDM.

[31]  Praveen Kumar Reddy Maddikunta,et al.  Load balancing of energy cloud using wind driven and firefly algorithms in internet of everything , 2020, J. Parallel Distributed Comput..

[32]  Husanbir Singh Pannu,et al.  A Systematic Review on Imbalanced Data Challenges in Machine Learning , 2019, ACM Comput. Surv..

[33]  Haizhou Li,et al.  A Cost-Sensitive Deep Belief Network for Imbalanced Classification , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Wang Yong,et al.  Socio-Technological Factors Affecting User’s Adoption of eHealth Functionalities: A Case Study of China and Ukraine eHealth Systems , 2019, IEEE Access.

[35]  Yu Cao,et al.  An integrated machine learning approach to stroke prediction , 2010, KDD.

[36]  Thar Baker,et al.  Remote health monitoring of elderly through wearable sensors , 2019, Multimedia Tools and Applications.

[37]  Seyed Mohammad Mirjalili,et al.  The Ant Lion Optimizer , 2015, Adv. Eng. Softw..

[38]  D. Rueckert,et al.  Prediction of stroke thrombolysis outcome using CT brain machine learning , 2014, NeuroImage: Clinical.

[39]  Suresh N. Mali,et al.  Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach☆ , 2016 .

[40]  Praveen Kumar Reddy Maddikunta,et al.  Green communication in IoT networks using a hybrid optimization algorithm , 2020, Comput. Commun..

[41]  Liang Chen,et al.  Fully automatic acute ischemic lesion segmentation in DWI using convolutional neural networks , 2017, NeuroImage: Clinical.

[42]  Thar Baker,et al.  Analysis of Dimensionality Reduction Techniques on Big Data , 2020, IEEE Access.

[43]  Lihong Huang,et al.  Dissipativity and Synchronization of Generalized BAM Neural Networks With Multivariate Discontinuous Activations , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[44]  C. Gerloff,et al.  MRI‐Guided Thrombolysis for Stroke with Unknown Time of Onset , 2018, The New England journal of medicine.

[45]  Tianyu Liu,et al.  A hybrid machine learning approach to cerebral stroke prediction based on imbalanced medical dataset , 2019, Artif. Intell. Medicine.

[46]  Francesca N. Delling,et al.  Heart Disease and Stroke Statistics—2018 Update: A Report From the American Heart Association , 2018, Circulation.

[47]  Fadi Thabtah,et al.  Data imbalance in classification: Experimental evaluation , 2020, Inf. Sci..