An overview of Big Data in Healthcare: multiple angle analyses

Big data have been in use since the 1990s, which usually include some complex data sets whose sizes are beyond the ability of commonly used software to handle within a reasonable period of time. In recent years, big data analytics by providing personalized medicine and regulation analysis, providing clinical risk intervention and forecast analysis, reducing waste and nursing patients with external and internal variability, standardization of medical terminology and patient registration, and fragmentation of the solution, help to improve health care. This paper provides an overview of the contents of big data healthcare. We summarize some kinds of medical big data, including the electronic health records, the medical image data, the healthcare system big data, the health Internet of Things and healthcare informatics, the remote medical monitoring big data, the biomedical big data, and other sources of big data. Furthermore, we discuss some methods for handling different kinds of medical big data. Additionally, we analyze the privacy of medical big data and summarize some methods and technologies to protect privacy. Aiming at some special cases, we list some other analyses and methods for them. Most importantly, we discuss the potential challenges and future research directions related to big data healthcare.

[1]  Atle Bjørnerud,et al.  Towards an Integrated Semantic Framework for Neurological Multidimensional Data Analysis , 2015, IWINAC.

[2]  Xiangji Huang,et al.  Cloud computing and its decision-making for medical and health informatization in the context of big data , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[3]  E. Siegel,et al.  Artificial Intelligence in Medicine and Cardiac Imaging: Harnessing Big Data and Advanced Computing to Provide Personalized Medical Diagnosis and Treatment , 2013, Current Cardiology Reports.

[4]  G. Lissovoy Big data meets the electronic medical record: a commentary on "identifying patients at increased risk for unplanned readmission". , 2013 .

[5]  P. Scuffham,et al.  The Internet of Things: Impact and Implications for Health Care Delivery , 2020, Journal of medical Internet research.

[6]  Anita Grigoriadis,et al.  Big Data: the challenge for small research groups in the era of cancer genomics , 2015, British Journal of Cancer.

[7]  Jianguo Zhang,et al.  Big data issues in medical imaging informatics , 2015, Medical Imaging.

[8]  V. K. Bairagi,et al.  Automated region-based hybrid compression for digital imaging and communications in medicine magnetic resonance imaging images for telemedicine applications , 2012 .

[9]  Belchin Kostov,et al.  Google-driven search for big data in autoimmune geoepidemiology: analysis of 394,827 patients with systemic autoimmune diseases. , 2015, Autoimmunity reviews.

[10]  Dong Li,et al.  Intelligent medical heterogeneous big data set balanced clustering using deep learning , 2020, Pattern Recognit. Lett..

[11]  Meredith A Barrett,et al.  Big Data and Disease Prevention: From Quantified Self to Quantified Communities , 2013, Big Data.

[12]  Subha Madhavan,et al.  G-DOC Plus – an integrative bioinformatics platform for precision medicine , 2016, BMC Bioinformatics.

[13]  Chunxiao Xing,et al.  Chronic Knowledge Retrieval and Smart Health Services Based on Big Data , 2015, ICSH.

[14]  Rengarajan Amirtharajan,et al.  Neural-assisted image-dependent encryption scheme for medical image cloud storage , 2020, Neural Computing and Applications.

[15]  Enrique Herrera-Viedma,et al.  Restoring incomplete PUMLPRs for evaluating the management way of online public opinion , 2020, Inf. Sci..

[16]  Heather Lea Moulaison,et al.  Electronic Health Records Data and Metadata: Challenges for Big Data in the United States , 2013, Big Data.

[17]  Ivan Merelli,et al.  Managing, Analysing, and Integrating Big Data in Medical Bioinformatics: Open Problems and Future Perspectives , 2014, BioMed research international.

[18]  M. Clowse,et al.  The Impact of the COVID-19 Pandemic and Telemedicine Implementation on Practice Patterns and Electronic Health Record Utilization in an Academic Rheumatology Practice , 2021, Journal of clinical rheumatology : practical reports on rheumatic & musculoskeletal diseases.

[19]  David Soriano-Valdez,et al.  The basics of data, big data, and machine learning in clinical practice , 2020, Clinical Rheumatology.

[20]  D. Alghazzawi,et al.  A Reinforcement Learning-Based Framework for Crowdsourcing in Massive Health Care Internet of Things , 2021, Big Data.

[21]  Stephan Velsko,et al.  A Conceptual Architecture for National Biosurveillance: Moving Beyond Situational Awareness to Enable Digital Detection of Emerging Threats. , 2016, Health security.

[22]  Majid Sarrafzadeh,et al.  Improving biomedical signal search results in big data case-based reasoning environments , 2016, Pervasive Mob. Comput..

[23]  Valerio Persico,et al.  Big Data for Health , 2019, Encyclopedia of Big Data Technologies.

[24]  A. Terzic,et al.  Big Data Transforms Discovery–Utilization Therapeutics Continuum , 2016, Clinical pharmacology and therapeutics.

[25]  Uttam Ghosh,et al.  Effective task scheduling algorithm with deep learning for Internet of Health Things (IoHT) in sustainable smart cities , 2021 .

[26]  Takashi Washio,et al.  Data Mining as a Powerful Tool for Creating Novel Drugs in Cardiovascular Medicine: The Importance of a “Back-and-Forth Loop” Between Clinical Data and Basic Research , 2015, Cardiovascular Drugs and Therapy.

[27]  Atul J Butte,et al.  Collaborative Biomedicine in the Age of Big Data: The Case of Cancer , 2014, Journal of medical Internet research.

[28]  Andrew J. Plassard,et al.  Revealing latent value of clinically acquired CTs of traumatic brain injury through multi-atlas segmentation in a retrospective study of 1,003 with external cross-validation , 2015, Medical Imaging.

[29]  Abdusalam Abdulla Shaltooki,et al.  Healthcare big data processing mechanisms: The role of cloud computing , 2019, Int. J. Inf. Manag..

[30]  Gary E Marchant,et al.  Big data in the era of precision medicine: big promise or big liability? , 2016, Personalized medicine.

[31]  A. Sonnenberg,et al.  Big data in gastroenterology research , 2014, Nature Reviews Gastroenterology &Hepatology.

[32]  SugumaranVijayan,et al.  A multi-agent system to support evidence based medicine and clinical decision making via data sharing and data privacy , 2016 .

[33]  Jason H. Moore,et al.  Embedding covariate adjustments in tree-based automated machine learning for biomedical big data analyses , 2020, BMC Bioinformatics.

[34]  Zeshui Xu,et al.  Towards felicitous decision making: An overview on challenges and trends of Big Data , 2016, Inf. Sci..

[35]  A. A. Bykov,et al.  Fluorescent Nanoscale Structures for Selective Medical Diagnostics , 2016 .

[36]  Yehia Taher,et al.  In Search of Big Medical Data Integration Solutions - A Comprehensive Survey , 2019, IEEE Access.

[37]  Srinivasan Suresh,et al.  Big Data and Predictive Analytics: Applications in the Care of Children. , 2016, Pediatric clinics of North America.

[38]  M. Hilbert,et al.  Big Data for Development: A Review of Promises and Challenges , 2016 .

[39]  Xiaoming Li,et al.  Comorbidity patterns among people living with HIV: a hierarchical clustering approach through integrated electronic health records data in South Carolina , 2020, AIDS care.

[40]  Xiaohui Hu,et al.  Discovering New Analytical Methods for Large Volume Medical and Online Data Processing , 2014, HIS.

[41]  Aboul Ella Hassanien,et al.  Dimensionality reduction of medical big data using neural-fuzzy classifier , 2014, Soft Computing.

[42]  Svetha Venkatesh,et al.  Using Twitter to learn about the autism community , 2015, Social Network Analysis and Mining.

[43]  A. Patkar,et al.  Precision medicine for psychopharmacology: a general introduction , 2016, Expert review of neurotherapeutics.

[44]  Daqiang Zhang,et al.  Health Internet of Things: Recent Applications and Outlook , 2015 .

[45]  G. Meyfroidt,et al.  Informatics in neurocritical care: new ideas for Big Data , 2016, Current opinion in critical care.

[46]  Chonghui Guo,et al.  Big Data Analytics in Healthcare: Data-Driven Methods for Typical Treatment Pattern Mining , 2019, Journal of Systems Science and Systems Engineering.

[47]  Hasso Plattner,et al.  The Medical Knowledge Cockpit: Real-time analysis of big medical data enabling precision medicine , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[48]  Muhammad Saleem,et al.  Big linked cancer data: Integrating linked TCGA and PubMed , 2014, J. Web Semant..

[49]  Nader Karimi,et al.  Toward practical guideline for design of image compression algorithms for biomedical applications , 2016, Expert Syst. Appl..

[50]  RadhaKanta Mahapatra,et al.  Business data mining - a machine learning perspective , 2001, Inf. Manag..

[51]  Shusen Yang,et al.  The prediction of asymptomatic carotid atherosclerosis with electronic health records: a comparative study of six machine learning models , 2021, BMC Medical Informatics and Decision Making.

[52]  Ching-Hsien Hsu,et al.  Analysis of Electronic Health Records Based on Deep Learning with Natural Language Processing , 2021 .

[53]  M. Srivathsan,et al.  Health Monitoring System by Prognotive Computing Using Big Data Analytics , 2015 .

[54]  Koldo Gojenola,et al.  Computer aided classification of diagnostic terms in spanish , 2015, Expert Syst. Appl..

[55]  Andy Podgurski,et al.  The Use and Misuse of Biomedical Data: Is Bigger Really Better? , 2013, American Journal of Law & Medicine.

[56]  Robert K. Eastlack,et al.  Artificial intelligence clustering of adult spinal deformity sagittal plane morphology predicts surgical characteristics, alignment, and outcomes , 2021, European Spine Journal.

[57]  Matthias Eikermann,et al.  Codifying healthcare – big data and the issue of misclassification , 2015, BMC Anesthesiology.

[58]  Yonggwan Won,et al.  Application of big data analysis with decision tree for the foot disorder , 2015, Cluster Computing.

[59]  Rob Law,et al.  Jose Maria Cavanillas, Edward Curry, and Wolfgang Wahlster (editors): new horizons for a data-driven economy: a roadmap for usage and exploitation of big data in Europe , 2016, Information Technology & Tourism.

[60]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[61]  O Jones,et al.  Accessing primary care Big Data: the development of a software algorithm to explore the rich content of consultation records , 2015, BMJ Open.

[62]  P. Dhavachelvan,et al.  Big Data and Hadoop-a Study in Security Perspective , 2015 .

[64]  Jinjun Chen,et al.  HireSome-II: Towards Privacy-Aware Cross-Cloud Service Composition for Big Data Applications , 2015, IEEE Transactions on Parallel and Distributed Systems.

[65]  Mohamed Eddabbah,et al.  AWS and IoT for real-time remote medical monitoring , 2019 .

[66]  Xin Huang,et al.  Hadoop-Based Medical Image Storage and Access Method for Examination Series , 2021, Mathematical Problems in Engineering.

[67]  Wolfgang Wahlster,et al.  New Horizons for a Data-Driven Economy , 2016, Springer International Publishing.

[68]  Franco Milicchio,et al.  Visual programming for next-generation sequencing data analytics , 2016, BioData Mining.

[69]  Fabrício F. Costa Big data in biomedicine. , 2014, Drug discovery today.

[70]  Angel Ramos-Miguel,et al.  Use of data mining to predict significant factors and benefits of bilateral cochlear implantation , 2015, European Archives of Oto-Rhino-Laryngology.

[71]  Ashutosh Kumar Singh,et al.  Preserving Privacy of Patients Based on Re-identification Risk , 2015 .

[72]  Jian-yu Lu,et al.  Establishing a University Library-Based Health Information Literacy Service Model in the Age of Big Data , 2016 .

[73]  Fei Wang,et al.  Adaptive semi-supervised recursive tree partitioning: The ART towards large scale patient indexing in personalized healthcare , 2015, J. Biomed. Informatics.

[74]  Vojtech Huser,et al.  Impending Challenges for the Use of Big Data. , 2016, International journal of radiation oncology, biology, physics.

[75]  De-gan Zhang,et al.  New Medical Image Fusion Approach with Coding Based on SCD in Wireless Sensor Network , 2015 .

[76]  Ken Sakamura,et al.  Fine-grained access control to medical records in digital healthcare enterprises , 2015, 2015 International Symposium on Networks, Computers and Communications (ISNCC).

[77]  Clive R Taylor,et al.  Practicing Pathology in the Era of Big Data and Personalized Medicine , 2014, Applied immunohistochemistry & molecular morphology : AIMM.

[78]  Dazhe Zhao,et al.  SP-MIOV: A novel framework of shadow proxy based medical image online visualization in computing and storage resource restrained environments , 2020, Future Gener. Comput. Syst..

[79]  Natasa Przulj,et al.  Integrative methods for analyzing big data in precision medicine , 2016, Proteomics.

[80]  R. Wang,et al.  The current status and challenges of establishment and utilization of medical big data in China , 2015 .

[81]  Dima Alhadidi,et al.  Secure and Private Management of Healthcare Databases for Data Mining , 2015, 2015 IEEE 28th International Symposium on Computer-Based Medical Systems.

[82]  Zhongheng Zhang,et al.  Big data and clinical research: focusing on the area of critical care medicine in mainland China. , 2014, Quantitative imaging in medicine and surgery.

[83]  David Windridge,et al.  Challenges in designing an online healthcare platform for personalised patient analytics , 2014, 2014 IEEE Symposium on Computational Intelligence in Big Data (CIBD).

[84]  Tarek A. El-Ghazawi,et al.  Predicting the severity of motor neuron disease progression using electronic health record data with a cloud computing Big Data approach , 2014, 2014 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology.

[85]  Jianping Fan,et al.  Hierarchical Classification of Large-Scale Patient Records for Automatic Treatment Stratification , 2015, IEEE Journal of Biomedical and Health Informatics.

[86]  Galia Angelova,et al.  Text Mining and Big Data Analytics for Retrospective Analysis of Clinical Texts from Outpatient Care , 2015 .

[87]  Kayvan Najarian,et al.  Big Data Analytics in Healthcare , 2015, BioMed research international.

[88]  Shichao Zhang,et al.  Efficient kNN classification algorithm for big data , 2016, Neurocomputing.

[89]  Pingyi Fan,et al.  A lossless compression method for multi-component medical images based on big data mining , 2021, Scientific reports.

[90]  M. Mostert,et al.  Big Data in medical research and EU data protection law: challenges to the consent or anonymise approach , 2016, European Journal of Human Genetics.

[91]  Alexey Tsymbal,et al.  Towards cloud-based image-integrated similarity search in big data , 2014, IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[92]  K. Teng,et al.  Personalized healthcare in the era of value-based healthcare. , 2013, Personalized medicine.

[93]  Chang Liu,et al.  A cloud-based framework for Home-diagnosis service over big medical data , 2015, J. Syst. Softw..

[94]  Ying Chen,et al.  IBM Watson: How Cognitive Computing Can Be Applied to Big Data Challenges in Life Sciences Research. , 2016, Clinical therapeutics.

[95]  Carolyn McGregor Wearable monitors on babies: Big data saving little people , 2013, 2013 IEEE International Symposium on Technology and Society (ISTAS): Social Implications of Wearable Computing and Augmediated Reality in Everyday Life.

[96]  Daniel F. Keefe,et al.  Grand Challenge: Applying Regulatory Science and Big Data to Improve Medical Device Innovation , 2013, IEEE Transactions on Biomedical Engineering.

[97]  Khalil Drira,et al.  A Semantic Big Data Platform for Integrating Heterogeneous Wearable Data in Healthcare , 2015, Journal of Medical Systems.

[98]  Le Zhang,et al.  Relational Network for Knowledge Discovery through Heterogeneous Biomedical and Clinical Features , 2016, Scientific Reports.

[99]  Zeshui Xu,et al.  A Bibliometrics analysis on big data research (2009–2018) , 2019, Journal of Data, Information and Management.

[100]  Yu Tian,et al.  Design and Implementation of Clinical Data Integration and Management System Based on Hadoop Platform , 2015, 2015 7th International Conference on Information Technology in Medicine and Education (ITME).

[101]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[102]  Dongsheng Zhao,et al.  A Fast Statistics and Analysis Solution of Medical Service Big Data , 2015, 2015 7th International Conference on Information Technology in Medicine and Education (ITME).

[103]  Keke Gai,et al.  Electronic Health Record Error Prevention Approach Using Ontology in Big Data , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.

[104]  Ying Liu,et al.  Transcriptome marker diagnostics using big data. , 2016, IET systems biology.

[105]  Si Liu,et al.  Novel fusion computing method for bio-medical image of WSN based on spherical coordinate , 2016 .

[106]  S. Dhivya,et al.  Cloud Based Fuzzy Healthcare System , 2015 .

[107]  Marek Ostaszewski,et al.  Integration and Visualization of Translational Medicine Data for Better Understanding of Human Diseases , 2016, Big Data.

[108]  J. Thorpe,et al.  Comparative effectiveness research and big data: balancing potential with legal and ethical considerations. , 2015, Journal of comparative effectiveness research.

[109]  Witold Pedrycz,et al.  An overview on the roles of fuzzy set techniques in big data processing: Trends, challenges and opportunities , 2017, Knowl. Based Syst..

[110]  Zhidong Cao,et al.  Comorbidity Analysis According to Sex and Age in Hypertension Patients in China , 2016, International journal of medical sciences.

[111]  Joel H. Saltz,et al.  Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies , 2015, BMC Bioinformatics.

[112]  Tughrul Arslan,et al.  Parallel Delay Multiply and Sum Algorithm for Microwave Medical Imaging Using Spark Big Data Framework , 2021, Algorithms.