A review of the literature on big data analytics in healthcare

Abstract Big data analytics (BDA) is of paramount importance in healthcare aspects such as patient diagnostics, fast epidemic recognition, and improvement of patient management. The objective of this profiling study is (a) to provide an overview of the BDA publication dynamics in the healthcare domain and (b) to discuss this scientific field through related examples. A sampling literature review has been conducted. A total of 804 papers have been identified and content analysis has been performed to mine knowledge in the domain for the years 2000–2016. The findings show that co-authors’ backgrounds are from the subject areas of medicine and computer sciences. Most articles are experimental in nature and use modeling and machine learning techniques to exploit clinical data, for health monitoring and prediction purposes. Many articles are relevant to the medical specialties of neurology/neurosurgery/neuropsychiatry, medical oncology, and cardiology. Well-cited papers investigate the identification and management of high-risk/cost patients, the use of big data, Hadoop and cloud computing in genomics, and the development of mobile applications for disease management. Important is also the research about improving disease prediction by investigating patients' medical results using advanced analysis (such as segmentation and predictive modelling, machine learning, visualisation, etc.).

[1]  Mary J Wills,et al.  Decisions Through Data: Analytics in Healthcare , 2014, Journal of healthcare management / American College of Healthcare Executives.

[2]  Julio César Hernández Castro,et al.  Detecting discussion communities on vaccination in twitter , 2017, Future Gener. Comput. Syst..

[3]  Deepak S. Turaga Introduction to the Interfaces Special Issue: Applications of Analytics and Operations Research in Big Data Analysis , 2018, Interfaces.

[4]  Hao Wang,et al.  A Clinical Decision Support Framework for Heterogeneous Data Sources , 2018, IEEE Journal of Biomedical and Health Informatics.

[5]  Shinyi Wu,et al.  An agent-based simulation model to study accountable care organizations , 2014, Health Care Management Science.

[6]  H. Krumholz Big data and new knowledge in medicine: the thinking, training, and tools needed for a learning health system. , 2014, Health affairs.

[7]  Hiroshi Mitsumaki,et al.  MODULAR ANALYTICS: A New Approach to Automation in the Clinical Laboratory , 2005, Journal of automated methods & management in chemistry.

[8]  Dhiya Al-Jumeily,et al.  Machine learning approaches to the application of disease modifying therapy for sickle cell using classification models , 2017, Neurocomputing.

[9]  Natasa Przulj,et al.  Integrative methods for analyzing big data in precision medicine , 2016, Proteomics.

[10]  Jeffrey S. Barrett,et al.  Bmc Medical Informatics and Decision Making Integration of Modeling and Simulation into Hospital-based Decision Support Systems Guiding Pediatric Pharmacotherapy , 2008 .

[11]  Mark F. Thouin,et al.  The effect of information technology investment on firm-level performance in the health care industry , 2008, Health care management review.

[12]  Mihaela IVAN,et al.  Healthcare Industry Improvement with Business Intelligence , 2015 .

[13]  Cheol Lee,et al.  Epidemiology of complex regional pain syndrome in Korea: An electronic population health data study , 2018, PloS one.

[14]  R. Cameron Craddock,et al.  Clinical applications of the functional connectome , 2013, NeuroImage.

[15]  Meredith A Barrett,et al.  Big Data and Disease Prevention: From Quantified Self to Quantified Communities , 2013, Big Data.

[16]  Darcy A. Davis,et al.  Bringing Big Data to Personalized Healthcare: A Patient-Centered Framework , 2013, Journal of General Internal Medicine.

[17]  Paul Nagy,et al.  Business intelligence for the radiologist: making your data work for you. , 2014, Journal of the American College of Radiology : JACR.

[18]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[19]  K. Marsolo,et al.  Applications of Business Analytics in Healthcare. , 2014, Business horizons.

[20]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[21]  Samee U. Khan,et al.  Personalized healthcare cloud services for disease risk assessment and wellness management using social media , 2016, Pervasive and Mobile Computing.

[22]  Régis Beuscart,et al.  Toward a Literature-Driven Definition of Big Data in Healthcare , 2015, BioMed research international.

[23]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[24]  Shahriar Akter,et al.  How ‘Big Data’ Can Make Big Impact: Findings from a Systematic Review and a Longitudinal Case Study , 2015 .

[25]  Ann Christine Catlin,et al.  Comparative analytics of infusion pump data across multiple hospital systems. , 2015, American journal of health-system pharmacy : AJHP : official journal of the American Society of Health-System Pharmacists.

[26]  Mary E. Helander,et al.  Supply Chain Scenario Modeler: A Holistic Executive Decision Support Solution , 2014, Interfaces.

[27]  J. Broughman,et al.  Using big data for quality assessment in oncology. , 2016, Journal of comparative effectiveness research.

[28]  Phung Anh Nguyen,et al.  Cancer-disease associations: A visualization and animation through medical big data , 2016, Comput. Methods Programs Biomed..

[29]  Andrew Stranieri,et al.  Addressing the Complexities of Big Data Analytics in Healthcare: The Diabetes Screening Case , 2015, Australas. J. Inf. Syst..

[30]  Luciano Floridi,et al.  The Ethics of Big Data: Current and Foreseeable Issues in Biomedical Contexts , 2015, Science and Engineering Ethics.

[31]  Vasa Curcin,et al.  Model-driven approach to data collection and reporting for quality improvement , 2014, J. Biomed. Informatics.

[32]  Lemuria Carter,et al.  literature review of RFID-enabled healthcare applications and issues amuel , 2013 .

[33]  Nick Wilson,et al.  Updated New Zealand health system cost estimates from health events by sex, age and proximity to death: further improvements in the age of 'big data'. , 2015, The New Zealand medical journal.

[34]  C. Quinn,et al.  WellDoc mobile diabetes management randomized controlled trial: change in clinical and behavioral outcomes and patient and physician satisfaction. , 2008, Diabetes technology & therapeutics.

[35]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[36]  Geoff Royston,et al.  Operational Research for the Real World: big questions from a small island , 2013, J. Oper. Res. Soc..

[37]  Roy D. Sleator,et al.  'Big data', Hadoop and cloud computing in genomics , 2013, J. Biomed. Informatics.

[38]  Dursun Delen,et al.  Analysis of cancer data: a data mining approach , 2009, Expert Syst. J. Knowl. Eng..

[39]  Min Chen,et al.  Disease Prediction by Machine Learning Over Big Data From Healthcare Communities , 2017, IEEE Access.

[40]  Bernhard Preim,et al.  Interactive Visual Analysis of Heterogeneous Cohort-Study Data , 2014, IEEE Computer Graphics and Applications.

[41]  Anandhi S. Bharadwaj,et al.  A Resource-Based Perspective on Information Technology Capability and Firm Performance: An Empirical Investigation , 2000, MIS Q..

[42]  Michael W. Weiner,et al.  Crowdsourced estimation of cognitive decline and resilience in Alzheimer's disease , 2016, Alzheimer's & Dementia.

[43]  Courtney Corley,et al.  Social Web mining and exploitation for serious applications: Technosocial Predictive Analytics and related technologies for public health, environmental and national security surveillance , 2010, Comput. Methods Programs Biomed..

[44]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[45]  Kai Yang,et al.  An analytics approach to designing patient centered medical homes , 2014, Health care management science.

[46]  Vasant Dhar,et al.  Editorial - Big Data, Data Science, and Analytics: The Opportunity and Challenge for IS Research , 2014, Inf. Syst. Res..

[47]  Igor Jurisica,et al.  Knowledge Discovery and Data Mining in Biomedical Informatics: State-of-the-Art and Future Challenges , 2014 .

[48]  Erik W. Kuiler From Big Data to Knowledge: An Ontological Approach to Big Data Analytics , 2014 .

[49]  A. Gunasekaran,et al.  Big data analytics in logistics and supply chain management: Certain investigations for research and applications , 2016 .

[50]  Shahriar Akter,et al.  Big data analytics in E-commerce: a systematic review and agenda for future research , 2016, Electronic Markets.

[51]  David Baum An intelligent patient focus. Cambridge Memorial Hospital is increasing efficiency and improving patient care with a new emergency room tracking board and business-intelligence system. , 2010, Health management technology.

[52]  Christopher G. Chute,et al.  The Enterprise Data Trust at Mayo Clinic: a semantically integrated warehouse of biomedical data , 2010, J. Am. Medical Informatics Assoc..

[53]  Andreas Holzinger,et al.  Knowledge discovery of drug data on the example of adverse reaction prediction , 2014, BMC Bioinformatics.

[54]  Carmen C. Y. Poon,et al.  Big Data for Health , 2015, IEEE Journal of Biomedical and Health Informatics.

[55]  M. Hilario,et al.  Processing and classification of protein mass spectra. , 2006, Mass spectrometry reviews.

[56]  Jinjun Chen,et al.  A Scalable Two-Phase Top-Down Specialization Approach for Data Anonymization Using MapReduce on Cloud , 2014, IEEE Transactions on Parallel and Distributed Systems.

[57]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[58]  Indranil R. Bardhan,et al.  Predictive Analytics for Readmission of Patients with Congestive Heart Failure , 2015, Inf. Syst. Res..

[59]  David L. Olson,et al.  Business Analytics for Supply Chain: a Dynamic-Capabilities Framework , 2013, Int. J. Inf. Technol. Decis. Mak..

[60]  Ludo Waltman,et al.  Constructing bibliometric networks: A comparison between full and fractional counting , 2016, J. Informetrics.

[61]  Luca Berdondini,et al.  Microelectronics, bioinformatics and neurocomputation for massive neuronal recordings in brain circuits with large scale multielectrode array probes , 2015, Brain Research Bulletin.

[62]  Mustafa Y. Sir,et al.  Nurse-patient assignment models considering patient acuity metrics and nurses' perceived workload , 2015, J. Biomed. Informatics.

[63]  Ivo D. Dinov,et al.  Methodological challenges and analytic opportunities for modeling and interpreting Big Healthcare Data , 2016, GigaScience.

[64]  Alison Kenner,et al.  Asthma on the move: how mobile apps remediate risk for disease management , 2016 .

[65]  T M Dugan,et al.  Machine Learning Techniques for Prediction of Early Childhood Obesity. , 2015, Applied clinical informatics.

[66]  Min-Hooi Chuah,et al.  A review of business intelligence and its maturity models , 2011 .

[67]  Mahmoud Al-Ayyoub,et al.  Cloud support for large scale e-healthcare systems , 2016, Ann. des Télécommunications.

[68]  Allan D. Kirk,et al.  Lessons of War: Turning Data Into Decisions , 2015, EBioMedicine.

[69]  Jeff Appelbaum,et al.  Cardiac catheterization laboratory inpatient forecast tool: a prospective evaluation , 2016, J. Am. Medical Informatics Assoc..

[70]  David Meyre,et al.  From big data analysis to personalized medicine for all: challenges and opportunities , 2015, BMC Medical Genomics.

[71]  Debora Viana Thompson,et al.  Defeating feature fatigue. , 2006, Harvard business review.

[72]  Tiffany J. Chen,et al.  Cytobank: providing an analytics platform for community cytometry data analysis and collaboration. , 2014, Current topics in microbiology and immunology.

[73]  Vivian West,et al.  Innovative information visualization of electronic health record data: a systematic review , 2014, J. Am. Medical Informatics Assoc..

[74]  Vikas Kumar,et al.  Understanding variations in pediatric asthma care processes in the emergency department using visual analytics , 2015, J. Am. Medical Informatics Assoc..

[75]  Ronald K. Klimberg,et al.  Benchmarking Academic Programs in Business Analytics , 2014, Interfaces.

[76]  S. Fawcett,et al.  Data Science, Predictive Analytics, and Big Data: A Revolution that Will Transform Supply Chain Design and Management , 2013 .

[77]  Mireille E Schnitzer,et al.  Methods for the assessment of selection bias in drug safety during pregnancy studies using electronic medical data , 2018, Pharmacology research & perspectives.

[78]  T. Murdoch,et al.  The inevitable application of big data to health care. , 2013, JAMA.

[79]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[80]  Yan Zhang,et al.  Quality of health information for consumers on the web: A systematic review of indicators, criteria, tools, and evaluation results , 2015, J. Assoc. Inf. Sci. Technol..

[81]  Samee Ullah Khan,et al.  Future Generation Computer Systems ( ) – Future Generation Computer Systems a Cloud Based Health Insurance Plan Recommendation System: a User Centered Approach , 2022 .

[82]  Landon Fridman Detwiler,et al.  Visualization and analytics tools for infectious disease epidemiology: A systematic review , 2014, J. Biomed. Informatics.

[83]  Lian Duan,et al.  Big data analytics and business analytics , 2015 .

[84]  Marcelo Fantinato,et al.  A study on the author collaboration network in big data* , 2017, Inf. Syst. Frontiers.

[85]  D. Bates,et al.  Big data in health care: using analytics to identify and manage high-risk and high-cost patients. , 2014, Health affairs.

[86]  Eren Demir,et al.  A Decision Support Tool for Predicting Patients at Risk of Readmission: A Comparison of Classification Trees, Logistic Regression, Generalized Additive Models, and Multivariate Adaptive Regression Splines , 2014, Decis. Sci..

[87]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[88]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[89]  A. Abernethy,et al.  Importance of health information technology, electronic health records, and continuously aggregating data to comparative effectiveness research and learning health care. , 2012, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[90]  Navonil Mustafee,et al.  Applications of simulation within the healthcare context , 2010, J. Oper. Res. Soc..

[91]  Kensaku Kawamoto,et al.  A three-step approach for the derivation and validation of high-performing predictive models using an operational dataset: congestive heart failure readmission case study , 2014, BMC Medical Informatics and Decision Making.

[92]  Le Zhang,et al.  Relational Network for Knowledge Discovery through Heterogeneous Biomedical and Clinical Features , 2016, Scientific Reports.

[93]  Benjamin T. Hazen,et al.  Back in business: operations research in support of big data analytics for operations and supply chain management , 2016, Annals of Operations Research.