Big data analytics in health sector: Theoretical framework, techniques and prospects

Abstract Clinicians, healthcare providers-suppliers, policy makers and patients are experiencing exciting opportunities in light of new information deriving from the analysis of big data sets, a capability that has emerged in the last decades. Due to the rapid increase of publications in the healthcare industry, we have conducted a structured review regarding healthcare big data analytics. With reference to the resource-based view theory we focus on how big data resources are utilised to create organization values/capabilities, and through content analysis of the selected publications we discuss: the classification of big data types related to healthcare, the associate analysis techniques, the created value for stakeholders, the platforms and tools for handling big health data and future aspects in the field. We present a number of pragmatic examples to show how the advances in healthcare were made possible. We believe that the findings of this review are stimulating and provide valuable information to practitioners, policy makers and researchers while presenting them with certain paths for future research.

[1]  Li Wang,et al.  NeuroPigPen: A Scalable Toolkit for Processing Electrophysiological Signal Data in Neuroscience Applications Using Apache Pig , 2016, Front. Neuroinform..

[2]  A. Gunasekaran,et al.  Green supply chain management: theoretical framework and further research directions , 2017 .

[3]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[4]  Stephen Relyea,et al.  An Analytics Approach to Designing Combination Chemotherapy Regimens for Cancer , 2016, Manag. Sci..

[5]  Navonil Mustafee,et al.  A game-based approach towards facilitating decision making for perishable products: An example of blood supply chain , 2014, Expert Syst. Appl..

[6]  Dursun Delen,et al.  Analysis of cancer data: a data mining approach , 2009, Expert Syst. J. Knowl. Eng..

[7]  Min Chen,et al.  Disease Prediction by Machine Learning Over Big Data From Healthcare Communities , 2017, IEEE Access.

[8]  Martin Schmitz,et al.  Scalable Predictive Analysis in Critically Ill Patients Using a Visual Open Data Analysis Platform , 2016, PloS one.

[9]  Yogesh Kumar Dwivedi,et al.  Artificial intelligence for decision making in the era of Big Data - evolution, challenges and research agenda , 2019, Int. J. Inf. Manag..

[10]  Bernhard Preim,et al.  Interactive Visual Analysis of Heterogeneous Cohort-Study Data , 2014, IEEE Computer Graphics and Applications.

[11]  Viju Raghupathi,et al.  An Overview of Health Analytics , 2013 .

[12]  Arpan Kumar Kar,et al.  Big data with cognitive computing: A review for the future , 2018, Int. J. Inf. Manag..

[13]  Muhammad Shiraz,et al.  Big Data: Survey, Technologies, Opportunities, and Challenges , 2014, TheScientificWorldJournal.

[14]  Tiffany J. Chen,et al.  Cytobank: providing an analytics platform for community cytometry data analysis and collaboration. , 2014, Current topics in microbiology and immunology.

[15]  Ilona Buchem,et al.  Wearable Enhanced Learning for Healthy Ageing : Conceptual Framework and Architecture of the “ Fitness MOOC ” , 2015 .

[16]  Tanja Grubljesic,et al.  The role of compatibility in predicting business intelligence and analytics use intentions , 2018, Int. J. Inf. Manag..

[17]  Yike Guo,et al.  Wiki-health : from quantified self to self-understanding , 2014 .

[18]  Victor Chang,et al.  A review and future direction of agile, business intelligence, analytics and data science , 2016, Int. J. Inf. Manag..

[19]  Georgia D Tourassi,et al.  Predicting diagnostic error in radiology via eye-tracking and image analytics: preliminary investigation in mammography. , 2013, Medical physics.

[20]  Yang Yu,et al.  FVGWAS: Fast voxelwise genome wide association analysis of large-scale imaging genetic data , 2015, NeuroImage.

[21]  Vicente García-Díaz,et al.  Machine learning classification analysis for a hypertensive population as a function of several risk factors , 2018, Expert Syst. Appl..

[22]  Álvaro Rocha,et al.  A health data analytics maturity model for hospitals information systems , 2019, Int. J. Inf. Manag..

[23]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[24]  S. Fawcett,et al.  Data Science, Predictive Analytics, and Big Data: A Revolution that Will Transform Supply Chain Design and Management , 2013 .

[25]  Vitalii Doban,et al.  Big data, advanced analytics and the future of comparative effectiveness research. , 2014, Journal of comparative effectiveness research.

[26]  Andy Haas,et al.  A Real-Time Magnetoencephalography Brain-Computer Interface Using Interactive 3D Visualization and the Hadoop Ecosystem , 2015, Brain sciences.

[27]  Peter Groves,et al.  The 'big data' revolution in healthcare: Accelerating value and innovation , 2016 .

[28]  John Whang,et al.  Actionable data analytics in oncology: are we there yet? , 2014, Journal of oncology practice.

[29]  Vikas Kumar,et al.  Understanding variations in pediatric asthma care processes in the emergency department using visual analytics , 2015, J. Am. Medical Informatics Assoc..

[30]  Athanasios V. Vasilakos,et al.  Big data: From beginning to future , 2016, Int. J. Inf. Manag..

[31]  Sungyong Lee,et al.  The Mining Minds digital health and wellness framework , 2016, Biomedical engineering online.

[32]  J. Barney Firm Resources and Sustained Competitive Advantage , 1991 .

[33]  Hong Zheng,et al.  Massive Medical Images Retrieval System Based on Hadoop , 2014, J. Multim..

[34]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[35]  Yichuan Wang,et al.  An integrated big data analytics-enabled transformation model: Application to health care , 2018, Inf. Manag..

[36]  Jake Luo,et al.  Big Data Application in Biomedical Research and Health Care: A Literature Review , 2016, Biomedical informatics insights.

[37]  Doron Lancet,et al.  GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data , 2016, Omics : a journal of integrative biology.

[38]  Ben-Ari FuchsShani,et al.  GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data , 2016 .

[39]  Ofir Turel,et al.  Increasing firm agility through the use of data analytics: The role of fit , 2017, Decis. Support Syst..

[40]  Régis Beuscart,et al.  Toward a Literature-Driven Definition of Big Data in Healthcare , 2015, BioMed research international.

[41]  Benjamin T. Hazen,et al.  Big data and predictive analytics for supply chain and organizational performance , 2017 .

[42]  Mohammad-Reza Siadat,et al.  Unstructured medical image query using big data - An epilepsy case study , 2016, J. Biomed. Informatics.

[43]  Lian Duan,et al.  Big data analytics and business analytics , 2015 .

[44]  Angappa Gunasekaran,et al.  Empirical investigation of data analytics capability and organizational flexibility as complements to supply chain resilience , 2019, Int. J. Prod. Res..

[45]  D. Jacofsky,et al.  The myths of 'big data' in health care. , 2017, The bone & joint journal.

[46]  Dhiya Al-Jumeily,et al.  Machine learning approaches to the application of disease modifying therapy for sickle cell using classification models , 2017, Neurocomputing.

[47]  William M. Wells,et al.  A Feature-Based Approach to Big Data Analysis of Medical Images , 2015, IPMI.

[48]  Kayvan Najarian,et al.  Big Data Analytics in Healthcare , 2015, BioMed research international.

[49]  Jeff Appelbaum,et al.  Cardiac catheterization laboratory inpatient forecast tool: a prospective evaluation , 2016, J. Am. Medical Informatics Assoc..

[50]  Feras Batarseh,et al.  Assessing the Quality of Service Using Big Data Analytics: With Application to Healthcare , 2016, Big Data Res..

[51]  David Meyre,et al.  From big data analysis to personalized medicine for all: challenges and opportunities , 2015, BMC Medical Genomics.

[52]  Matthew J. Liberatore,et al.  The analytic hierarchy process in medical and health care decision making: A literature review , 2008, Eur. J. Oper. Res..

[53]  K. Marsolo,et al.  Applications of Business Analytics in Healthcare. , 2014, Business horizons.

[54]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[55]  Shahriar Akter,et al.  How ‘Big Data’ Can Make Big Impact: Findings from a Systematic Review and a Longitudinal Case Study , 2015 .

[56]  Steven Woolf,et al.  Holistics 3.0 for Health , 2014, ISPRS Int. J. Geo Inf..

[57]  Kipp W. Johnson,et al.  Machine learning in cardiovascular medicine: are we there yet? , 2018, Heart.

[58]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[59]  David Calabrese,et al.  Pharmacosynchrony: Road Map to Transformation in Pharmacy Benefi t Management , 2014 .

[60]  Morgan Swink,et al.  An Investigation of Visibility and Flexibility as Complements to Supply Chain Analytics: An Organizational Information Processing Theory Perspective , 2018 .

[61]  Paul P. Maglio,et al.  From data to value: A nine-factor framework for data-based value creation in information-intensive services , 2018, Int. J. Inf. Manag..

[62]  Shahriar Akter,et al.  Big data analytics and firm performance: Effects of dynamic capabilities , 2017 .

[63]  J. Broughman,et al.  Using big data for quality assessment in oncology. , 2016, Journal of comparative effectiveness research.

[64]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[65]  Andrew Stranieri,et al.  Addressing the Complexities of Big Data Analytics in Healthcare: The Diabetes Screening Case , 2015, Australas. J. Inf. Syst..

[66]  Jeffrey Soar,et al.  Cloud computing-enabled healthcare opportunities, issues, and applications: A systematic review , 2018, Int. J. Inf. Manag..

[67]  Cyril R. H. Foropon,et al.  Big data analytics and organizational culture as complements to swift trust and collaborative performance in the humanitarian supply chain , 2019, International Journal of Production Economics.

[68]  Ivo D. Dinov,et al.  Methodological challenges and analytic opportunities for modeling and interpreting Big Healthcare Data , 2016, GigaScience.

[69]  Yao Zhang,et al.  Uses of information and communication technologies in HIV self-management: A systematic review of global literature , 2017, Int. J. Inf. Manag..

[70]  F. Cabitza,et al.  Unintended Consequences of Machine Learning in Medicine , 2017, JAMA.

[71]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[72]  Sumeet Dua,et al.  Introduction to Machine Learning in Healthcare Informatics , 2014, Machine Learning in Healthcare Informatics.

[73]  Charbel José Chiappetta Jabbour,et al.  Management theory and big data literature: From a review to a research agenda , 2018, Int. J. Inf. Manag..

[74]  Karin M. Verspoor,et al.  Biomedical Text Mining: State-of-the-Art, Open Problems and Future Challenges , 2014, Interactive Knowledge Discovery and Data Mining in Biomedical Informatics.

[75]  A T McCray,et al.  Biomedical ontologies. , 2005, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[76]  Yacine Rezgui,et al.  Progress in ambient assisted systems for independent living by the elderly , 2016, SpringerPlus.

[77]  Samee Ullah Khan,et al.  Future Generation Computer Systems ( ) – Future Generation Computer Systems a Cloud Based Health Insurance Plan Recommendation System: a User Centered Approach , 2022 .

[78]  Surendra Sarnikar,et al.  A framework for developing a domain specific business intelligence maturity model: Application to healthcare , 2015, Int. J. Inf. Manag..

[79]  A. Mackey,et al.  An In Vitro Cynomolgus Vascular Surrogate System for Preclinical Drug Assessment and Human Translation , 2015, Arteriosclerosis, thrombosis, and vascular biology.

[80]  Stanislav Mamonov,et al.  The strategic value of data resources in emergent industries , 2018, Int. J. Inf. Manag..

[81]  Lemuria Carter,et al.  literature review of RFID-enabled healthcare applications and issues amuel , 2013 .

[82]  Helmut Krcmar,et al.  Big Data , 2014, Wirtschaftsinf..

[83]  Alfonso Farruggia,et al.  A text based indexing system for mammographic image retrieval and classification , 2014, Future Gener. Comput. Syst..

[84]  Courtney Corley,et al.  Social Web mining and exploitation for serious applications: Technosocial Predictive Analytics and related technologies for public health, environmental and national security surveillance , 2010, Comput. Methods Programs Biomed..

[85]  Kai Yang,et al.  An analytics approach to designing patient centered medical homes , 2014, Health care management science.

[86]  Valerio Persico,et al.  Big Data for Health , 2019, Encyclopedia of Big Data Technologies.

[87]  Emmanouel Garoufallou,et al.  The Effectiveness of Big Data in Health Care: A Systematic Review , 2014, MTSR.

[88]  Ying Wah Teh,et al.  Big data reduction framework for value creation in sustainable enterprises , 2016, Int. J. Inf. Manag..

[89]  D. Tranfield,et al.  Towards a Methodology for Developing Evidence-Informed Management Knowledge by Means of Systematic Review , 2003 .

[90]  Richard Vidgen,et al.  Management challenges in creating value from business analytics , 2017, Eur. J. Oper. Res..

[91]  Guo-Qiang Zhang,et al.  Biomedical Ontology Quality Assurance Using a Big Data Approach , 2016, ACM Trans. Knowl. Discov. Data.

[92]  Constantin Zopounidis,et al.  Editorial to the special issue “business analytics” , 2016 .

[93]  Kim,et al.  Big Data in Healthcare Hype and Hope , 2013 .

[94]  Viju Raghupathi,et al.  An Unstructured Information Management Architecture Approach to Text Analytics of Cancer Blogs , 2014, Int. J. Heal. Inf. Syst. Informatics.

[95]  James H. Oliver,et al.  A Multi-facetted Visual Analytics Tool for Exploratory Analysis of Human Brain and Function Datasets , 2016, Front. Neuroinform..