Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique

Now days, health prediction in modern life becomesvery much essential. Big data analysis plays a crucial role to predict future status of healthand offerspreeminenthealth outcome to people. Heart disease is a prevalent disease cause’s death around the world. A lotof research is going onpredictive analytics using machine learning techniques to reveal better decision making. Big data analysis fosters great opportunities to predict future health status from health parameters and provide best outcomes. WeusedBig Data Predictive Analytics Model for Disease Prediction using Naive Bayes Technique (BPA-NB). It providesprobabilistic classification based on Bayes’ theorem with independence assumptions between the features. Naive Bayes approach suitable for huge data sets especially for bigdata. The Naive Bayes approachtrain the heart disease data taken from UCI machine learning repository. Then, it was making predictions on the test data to predict the classification. The results reveal that the proposed BPA-NB scheme providesbetter accuracy about 97.12% to predict the disease rate. The proposed BPA-NB scheme used Hadoop-spark as big data computing tool to obtain significant insight on healthcare data. The experiments are done to predict different patients’ future health condition. It takes the training dataset to estimate the health parameters necessary for classification. The results show the early disease detection to figure out future health of patients.

[1]  P. Subbulakshmi,et al.  Optimization using Artificial Bee Colony based clustering approach for big data , 2018, Cluster Computing.

[2]  Li Yang,et al.  Exploring feature sets for two-phase biomedical named entity recognition using semi-CRFs , 2013, Knowledge and Information Systems.

[3]  Jui-Feng Yeh,et al.  Condition Random Fields-based Grammatical Error Detection for Chinese as Second Language , 2015, NLP-TEA@ACL/IJCNLP.

[4]  Madasamy Kaliappan,et al.  Enhancing secure routing in Mobile Ad Hoc Networks using a Dynamic Bayesian Signalling Game model , 2015, Comput. Electr. Eng..

[5]  Lidong Wang,et al.  Big Data Analytics in Heart Attack Prediction , 2017 .

[6]  Min Chen,et al.  Disease Prediction by Machine Learning Over Big Data From Healthcare Communities , 2017, IEEE Access.

[7]  Cipriano Galindo,et al.  Guest-Editorial: Computer-Based Intelligent Technologies for Improving the Quality of Life , 2015, IEEE J. Biomed. Health Informatics.

[8]  Susan Augustine,et al.  Enhancing energy efficiency and load balancing in mobile ad hoc network using dynamic genetic algorithms , 2016, J. Netw. Comput. Appl..

[9]  Chandra Naik,et al.  Energy Efficient Routing Protocol in MANET , 2015 .

[10]  Terry Anthony Byrd,et al.  Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations , 2018 .

[11]  Madasamy Kaliappan,et al.  Development of a secure routing protocol using game theory model in mobile ad hoc networks , 2015, Journal of Communications and Networks.

[12]  Indranil Palit,et al.  Scalable and Parallel Boosting with MapReduce , 2012, IEEE Transactions on Knowledge and Data Engineering.

[13]  N. K. Salma Banu,et al.  Prediction of heart disease at early stage using data mining and big data analytics: A survey , 2016, 2016 International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques (ICEECCOT).

[14]  Nishita Mehta,et al.  Concurrence of big data analytics and healthcare: A systematic review , 2018, Int. J. Medical Informatics.

[15]  R. Varatharajan,et al.  Health care data analysis using evolutionary algorithm , 2018, The Journal of Supercomputing.

[16]  Guang-Zhong Yang,et al.  Deep Learning for Health Informatics , 2017, IEEE Journal of Biomedical and Health Informatics.

[17]  L. Kalaivani,et al.  Development of secured data transmission using machine learning-based discrete-time partially observed Markov model and energy optimization in cognitive radio networks , 2018, Neural Computing and Applications.

[18]  Prasan Kumar Sahoo,et al.  Analyzing Healthcare Big Data With Prediction for Future Health Condition , 2016, IEEE Access.

[19]  A. Suresh,et al.  Predictive big data analytic on demonetization data using support vector machine , 2018, Cluster Computing.

[20]  Yi Chen,et al.  Real-Time or Near Real-Time Persisting Daily Healthcare Data Into HDFS and ElasticSearch Index Inside a Big Data Platform , 2017, IEEE Transactions on Industrial Informatics.

[21]  Cheryl Ann Alexander,et al.  Big Data in Medical Applications and Health Care , 2015 .

[22]  Wenli Zhang,et al.  Predicting Asthma-Related Emergency Department Visits Using Big Data , 2015, IEEE Journal of Biomedical and Health Informatics.

[23]  R. Udendhran,et al.  Hybridized neural network and decision tree based classifier for prognostic decision making in breast cancers , 2020, Soft Comput..

[24]  Shinji Nakadai,et al.  Optimizing Multiple Machine Learning Jobs on MapReduce , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[25]  Miriam A. M. Capretz,et al.  Machine Learning With Big Data: Challenges and Approaches , 2017, IEEE Access.

[26]  Ajay Kumar,et al.  A Cell-Array-Based Multibiometric Cryptosystem , 2016, IEEE Access.

[27]  Valerio Persico,et al.  Big Data for Health , 2019, Encyclopedia of Big Data Technologies.

[28]  R. Udendhran,et al.  A Novel Internet of Things Framework Integrated with Real Time Monitoring for Intelligent Healthcare Environment , 2019, Journal of Medical Systems.

[29]  Nanhay Singh,et al.  Indian Health Care Analysis using Big Data Programming Tool , 2016 .

[30]  L. Kalaivani,et al.  Collaborative approach on mitigating spectrum sensing data hijack attack and dynamic spectrum allocation based on CASG modeling in wireless cognitive radio networks , 2017, Cluster Computing.

[31]  Abdulsalam Yassine,et al.  Mining Human Activity Patterns From Smart Home Big Data for Health Care Applications , 2017, IEEE Access.

[32]  Zahir Tari,et al.  A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis , 2014, IEEE Transactions on Emerging Topics in Computing.

[33]  Peter Wittek,et al.  Accelerating text mining workloads in a MapReduce-based distributed GPU environment , 2013, J. Parallel Distributed Comput..

[34]  Peter J. Hunter,et al.  Big Data, Big Knowledge: Big Data for Personalized Healthcare , 2015, IEEE Journal of Biomedical and Health Informatics.