A Hadoop‐big data analytic model to predict and classify chronic kidney diseases using improved fractional rough fuzzy K‐means clustering and extreme gradient boost rat swarm optimizer

In this article, a Hadoop‐big data based chronic kidney disease prediction and classification using improved fractional rough fuzzy K‐means (IF‐RFKM) clustering and XG boost rat swarm optimizer is proposed. Here, IF‐RFKM clustering method is contemplated for the disease prediction. This disease is classified using XG boost classifier for classifying the stages of chronic kidney diseases as normal and abnormal. Moreover, the rat swarm optimization (RSO) algorithm is proposed for optimizing the parameters of the XG boost classifier. Initially, the data is randomly generated from CKD dataset. The simulation is carried out in python language. From the simulation, the proposed method attains higher accuracy 99.57%, 98.28%, and 97.35%, higher recall 98.23%, 88.34%, and 78.96% and lower execution time 92.15%, 90.25%, and 92.48% compared with existing methods, like chronic kidney disease detection and classification by recursive feature elimination using decision tree (CCKD‐RFE‐DT). An efficient chronic kidney disease classification and clustering using logistic regression (CCKD‐LR) and efficient chronic kidney disease classification utilizing multi‐kernel support vector machine with (CCKD‐MKC‐SVM).

[1]  Francis H. Shajin,et al.  Leveraging a Dynamic Differential Annealed Optimization and Recalling Enhanced Recurrent Neural Network for Maximum Power Point Tracking in Wind Energy Conversion System , 2022, Technology and Economics of Smart Grids and Sustainable Energy.

[2]  Francis H. Shajin,et al.  FPGA Realization of a Reversible Data Hiding Scheme for 5G MIMO-OFDM System by Chaotic Key Generation-Based Paillier Cryptography Along with LDPC and Its Side Channel Estimation Using Machine Learning Technique , 2021, J. Circuits Syst. Comput..

[3]  P. Rajesh,et al.  An Efficient VLSI Architecture for Fast Motion Estimation Exploiting Zero Motion Prejudgment Technique and a New Quadrant-Based Search Algorithm in HEVC , 2021, Circuits, Systems, and Signal Processing.

[4]  Basab Chakraborty,et al.  A meta-analytic approach for determining the success factors for energy conservation , 2021 .

[5]  Francis H. Shajin,et al.  An Efficient Estimation Model for Induction Motor Using BMO-RBFNN Technique , 2021, Process Integration and Optimization for Sustainability.

[6]  Manik Sharma,et al.  A comprehensive review and analysis of supervised-learning and soft computing techniques for stress diagnosis in humans , 2021, Comput. Biol. Medicine.

[7]  Qinglong Zhan,et al.  Visualization Analysis of Artificial Intelligence Technology in Higher Education Based on SSCI and SCI Journals from 2009 to 2019 , 2021, Int. J. Emerg. Technol. Learn..

[8]  Avijit Kumar Chaudhuri,et al.  A novel enhanced decision tree model for detecting chronic kidney disease , 2021, Netw. Model. Anal. Health Informatics Bioinform..

[9]  P. Pradeepa,et al.  An efficient self-tuning spectral clustering algorithm for chronic kidney disease prediction , 2020 .

[10]  Aditya Khamparia,et al.  KDSAE: Chronic kidney disease classification with multimedia data learning using deep stacked autoencoder network , 2019, Multimedia Tools and Applications.

[11]  Atulya K. Nagar,et al.  A novel algorithm for global optimization: Rat Swarm Optimizer , 2020, Journal of Ambient Intelligence and Humanized Computing.

[12]  Dong Yue,et al.  Interval Type-2 Fuzzy Local Enhancement Based Rough K-Means Clustering Considering Imbalanced Clusters , 2020, IEEE Transactions on Fuzzy Systems.

[13]  Mohammadreza Nemati,et al.  Machine-Learning Approaches in COVID-19 Survival Analysis and Discharge-Time Likelihood Prediction Using Clinical Data , 2020, Patterns.

[14]  Adebayo Olusola Adetunmbi,et al.  Development of an ensemble approach to chronic kidney disease diagnosis , 2020 .

[15]  Nesreen Alsharman,et al.  Machine Learning and Answer Set Program Rules towards Traffic Light Management , 2020, International Journal of Advanced Trends in Computer Science and Engineering.

[16]  Saurabh Singh,et al.  SMO-DNN: Spider Monkey Optimization and Deep Neural Network Hybrid Classifier Model for Intrusion Detection , 2020, Electronics.

[17]  Zihao Wang,et al.  A review of data mining technologies in building energy systems: Load prediction, pattern identification, fault detection and diagnosis , 2020 .

[18]  Ching-Hsien Hsu,et al.  Hybrid method for mining rules based on enhanced Apriori algorithm with sequential minimal optimization in healthcare industry , 2020, Neural Computing and Applications.

[19]  T. Wong,et al.  Logistic regression was as good as machine learning for predicting major chronic diseases. , 2020, Journal of clinical epidemiology.

[20]  Theyazn H. H. Aldhyani,et al.  Soft Clustering for Enhancing the Diagnosis of Chronic Diseases over Machine Learning Algorithms , 2020, Journal of healthcare engineering.

[21]  Eswaran Perumal,et al.  Efficient classification of chronic kidney disease by using multi-kernel support vector machine and fruit fly optimization algorithm , 2020, Int. J. Imaging Syst. Technol..

[22]  Mohsen Guizani,et al.  Machine learning in the Internet of Things: Designed techniques for smart cities , 2019, Future Gener. Comput. Syst..

[23]  Maninder Singh,et al.  Big data analytics for healthcare industry: impact, applications, and tools , 2019, Big Data Min. Anal..

[24]  Selahaddin Batuhan Akben,et al.  Early Stage Chronic Kidney Disease Diagnosis by Applying Data Mining Methods to Urinalysis, Blood Analysis and Disease History , 2018, IRBM.

[25]  Gunter Saake,et al.  Analyzing data quality issues in research information systems via data profiling , 2018, Int. J. Inf. Manag..

[26]  Kyung E. Rhee,et al.  Provider views on childhood obesity management in primary care settings: a mixed methods analysis , 2018, BMC Health Services Research.

[27]  M. Javaid,et al.  Internet of Things (IoT) enabled healthcare helps to take the challenges of COVID-19 Pandemic. , 2021, Journal of oral biology and craniofacial research.

[28]  Aditi Sanjay Bhosale,et al.  Study of Medicine Dispensing Machine and Health Monitoring Devices , 2021 .

[29]  C. Anna Palagan,et al.  Kidney disease detection and segmentation using artificial neural network and multi-kernel k-means clustering for ultrasound images , 2020 .

[30]  Cedric Baudoin,et al.  Towards the Deployment of Machine Learning Solutions in Network Traffic Classification: A Systematic Survey , 2019, IEEE Communications Surveys & Tutorials.

[31]  U. Sairam,et al.  HEART DISEASE PREDICTION USING DATA MINING , 2018 .