A Systematic Review of Techniques and Sources of Big Data in the Healthcare Sector

The main objective of this paper is to present a review of existing researches in the literature, referring to Big Data sources and techniques in health sector and to identify which of these techniques are the most used in the prediction of chronic diseases. Academic databases and systems such as IEEE Xplore, Scopus, PubMed and Science Direct were searched, considering the date of publication from 2006 until the present time. Several search criteria were established as ‘techniques’ OR ‘sources’ AND ‘Big Data’ AND ‘medicine’ OR ‘health’, ‘techniques’ AND ‘Big Data’ AND ‘chronic diseases’, etc. Selecting the paper considered of interest regarding the description of the techniques and sources of Big Data in healthcare. It found a total of 110 articles on techniques and sources of Big Data on health from which only 32 have been identified as relevant work. Many of the articles show the platforms of Big Data, sources, databases used and identify the techniques most used in the prediction of chronic diseases. From the review of the analyzed research articles, it can be noticed that the sources and techniques of Big Data used in the health sector represent a relevant factor in terms of effectiveness, since it allows the application of predictive analysis techniques in tasks such as: identification of patients at risk of reentry or prevention of hospital or chronic diseases infections, obtaining predictive models of quality.

[1]  H. Ellis,et al.  Current Therapeutic Advances Targeting EGFR and EGFRvIII in Glioblastoma , 2015, Front. Oncol..

[2]  KhanSamiya,et al.  A survey on scholarly data , 2017 .

[3]  Sean D. Young,et al.  A "big data" approach to HIV epidemiology and prevention. , 2015, Preventive medicine.

[4]  Ms. Ishtake " Intelligent Heart Disease Prediction System Using Data Mining Techniques " , .

[5]  David S. Goodsell,et al.  The RCSB Protein Data Bank: redesigned web site and web services , 2010, Nucleic Acids Res..

[6]  José Manuel Martínez Sesmero,et al.  "Big Data"; aplicación y utilidad para el sistema sanitario , 2015 .

[7]  Sa'diyah Noor Novita Alfisahrin,et al.  Data Mining Techniques for Optimization of Liver Disease Classification , 2013, 2013 International Conference on Advanced Computer Science Applications and Technologies.

[8]  Abhay Bansal,et al.  Chronic Kidney Disease analysis using data mining classification techniques , 2016, 2016 6th International Conference - Cloud System and Big Data Engineering (Confluence).

[9]  Wendy J. Ungar,et al.  National Database for Autism Research (NDAR): Big Data Opportunities for Health Services Research and Health Technology Assessment , 2016, PharmacoEconomics.

[10]  Fabrício F. Costa Big data in biomedicine. , 2014, Drug discovery today.

[11]  Priyanka Sharma,et al.  Big data for better health planning , 2014, 2014 International Conference on Advances in Engineering & Technology Research (ICAETR - 2014).

[12]  Anwar Haque,et al.  Large-scale machine learning based on functional networks for biomedical big data with high performance computing platforms , 2015, J. Comput. Sci..

[13]  M. Durairaj,et al.  An empirical study on applying data mining techniques for the analysis and prediction of heart disease , 2013, 2013 International Conference on Information Communication and Embedded Systems (ICICES).

[14]  Nidheesh Melethadathil,et al.  Classification and clustering for neuroinformatics: Assessing the efficacy on reverse-mapped NeuroNLP data using standard ML techniques , 2015, 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[15]  Suresh Chalasani,et al.  Predictive analytics on Electronic Health Records (EHRs) using Hadoop and Hive , 2015, 2015 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT).

[16]  Amit Kumar,et al.  Cloud computing for improved healthcare: Techniques, potential and challenges , 2013, 2013 E-Health and Bioengineering Conference (EHB).

[17]  T. Pramananda Perumal,et al.  A Predictive Approach for Diabetes Mellitus Disease through Data Mining Technologies , 2014, 2014 World Congress on Computing and Communication Technologies.

[18]  Mansaf Alam,et al.  A survey on scholarly data: From big data perspective , 2017, Inf. Process. Manag..

[19]  Shaila H. Koppad,et al.  Application of big data analytics in healthcare system to predict COPD , 2016, 2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT).

[20]  David Meyre,et al.  From big data analysis to personalized medicine for all: challenges and opportunities , 2015, BMC Medical Genomics.

[21]  Naveen Garg,et al.  Challenges and Techniques for Testing of Big Data , 2016 .

[22]  Leo Anthony Celi,et al.  Preparing a New Generation of Clinicians for the Era of Big Data. , 2015, Harvard medical student review.

[23]  João Falcão e Cunha,et al.  Health Twitter Big Bata Management with Hadoop Framework , 2015 .

[24]  Yelena Yesha,et al.  SQL-like big data environments: Case study in clinical trial analytics , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[25]  Athanasios V. Vasilakos,et al.  Big data: From beginning to future , 2016, Int. J. Inf. Manag..

[26]  Kayvan Najarian,et al.  Big Data Analytics in Healthcare , 2015, BioMed research international.

[27]  Timothy N. Showalter,et al.  Big Data and Comparative Effectiveness Research in Radiation Oncology: Synergy and Accelerated Discovery , 2015, Front. Oncol..

[28]  Witold Pedrycz,et al.  An overview on the roles of fuzzy set techniques in big data processing: Trends, challenges and opportunities , 2017, Knowl. Based Syst..

[29]  Raghunath Nambiar,et al.  A look at challenges and opportunities of Big Data analytics in healthcare , 2013, 2013 IEEE International Conference on Big Data.

[30]  Carmen C. Y. Poon,et al.  Big Data for Health , 2015, IEEE Journal of Biomedical and Health Informatics.

[31]  Maamoun Ahmed,et al.  Data Mining and Fusion Techniques for WSNs as a Source of the Big Data , 2015 .

[32]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[33]  Marylyn D. Ritchie,et al.  A comparison of cataloged variation between International HapMap Consortium and 1000 Genomes Project data , 2012, J. Am. Medical Informatics Assoc..

[34]  Glòria Pérez,et al.  Peligros del uso de los big data en la investigación en salud pública y en epidemiología , 2016 .

[35]  S. Lavanya,et al.  Predictive Methodology for Diabetic Data Analysis in Big Data , 2015 .

[36]  Tao Huang,et al.  Promises and Challenges of Big Data Computing in Health Sciences , 2015, Big Data Res..

[37]  Malcolm Keech,et al.  Emerging Technologies for Health Data Analytics Research: A Conceptual Architecture , 2015, 2015 26th International Workshop on Database and Expert Systems Applications (DEXA).

[38]  Roy D. Sleator,et al.  'Big data', Hadoop and cloud computing in genomics , 2013, J. Biomed. Informatics.

[39]  Fei Jiang,et al.  Big data issues in smart grid – A review , 2017 .

[40]  Ivan Merelli,et al.  Managing, Analysing, and Integrating Big Data in Medical Bioinformatics: Open Problems and Future Perspectives , 2014, BioMed research international.

[41]  David S. Wishart,et al.  HMDB 3.0—The Human Metabolome Database in 2013 , 2012, Nucleic Acids Res..

[42]  Ahmed Patel,et al.  Empirical rapid and accurate prediction model for data mining tasks in cloud computing environments , 2014, 2014 International Congress on Technology, Communication and Knowledge (ICTCK).