Data Science in Healthcare: Benefits, Challenges and Opportunities

The advent of digital medical data has brought an exponential increase in information available for each patient, allowing for novel knowledge generation methods to emerge. Tapping into this data brings clinical research and clinical practice closer together, as data generated in ordinary clinical practice can be used towards rapid-learning healthcare systems, continuously improving and personalizing healthcare. In this context, the recent use of Data Science technologies for healthcare is providing mutual benefits to both patients and medical professionals, improving prevention and treatment for several kinds of diseases. However, the adoption and usage of Data Science solutions for healthcare still require social capacity, knowledge and higher acceptance. The goal of this chapter is to provide an overview of needs, opportunities, recommendations and challenges of using (Big) Data Science technologies in the healthcare sector. This contribution is based on a recent whitepaper (http://www.bdva.eu/sites/default/files/Big%20Data%20Technologies%20in%20Healthcare.pdf) provided by the Big Data Value Association (BDVA) (http://www.bdva.eu/), the private counterpart to the EC to implement the BDV PPP (Big Data Value PPP) programme, which focuses on the challenges and impact that (Big) Data Science may have on the entire healthcare chain.

[1]  Jan Fostier,et al.  Halvade: scalable sequence analysis with MapReduce , 2015, Bioinform..

[2]  Thierry Hamon,et al.  CLEF eHealth Evaluation Lab 2015 Task 1b: Clinical Named Entity Recognition , 2015, CLEF.

[3]  Diego Reforgiato Recupero,et al.  SuperNoder: a tool to discover over-represented modular structures in networks , 2018, BMC Bioinformatics.

[4]  A. Persidis,et al.  Drug repurposing and adverse event prediction using high‐throughput literature analysis , 2011, Wiley interdisciplinary reviews. Systems biology and medicine.

[5]  Forum on Microbial Threats Big Data and Analytics for Infectious Disease Research, Operations, and Policy: Proceedings of a Workshop , 2016 .

[6]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[7]  Mariana L. Neves,et al.  A survey on annotation tools for the biomedical literature , 2014, Briefings Bioinform..

[8]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[9]  John A. Quelch,et al.  Philips Healthcare: Marketing the HealthSuite Digital Platform , 2015 .

[10]  Diego Reforgiato Recupero,et al.  Deep Learning and Sentiment Analysis for Human-Robot Interaction , 2018, ESWC.

[11]  Peter Z. Yeh,et al.  Multiple Ontologies in Healthcare Information Technology: Motivations and Recommendation for Ontology Mapping and Alignment , 2011, ICBO.

[12]  Maria Kvist,et al.  Automatic recognition of disorders, findings, pharmaceuticals and body structures from clinical text: An annotation and machine learning study , 2014, J. Biomed. Informatics.

[13]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[14]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[15]  Magnus Rattray,et al.  Making sense of big data in health research: Towards an EU action plan , 2016, Genome Medicine.

[16]  Mike May,et al.  LIFE SCIENCE TECHNOLOGIES: Big biological impacts from big data , 2014 .

[17]  Hugo Y. K. Lam,et al.  Detecting and annotating genetic variations using the HugeSeq pipeline , 2012, Nature Biotechnology.

[18]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[19]  K. Bretonnel Cohen,et al.  Mining the pharmacogenomics literature - a survey of the state of the art , 2012, Briefings Bioinform..

[20]  Jan Fostier,et al.  elPrep: High-Performance Preparation of Sequence Alignment/Map Files for Variant Calling , 2015, PloS one.

[21]  Reinhard Busse,et al.  Tackling Chronic Disease in Europe: Strategies, Interventions and Challenges , 2010 .

[22]  Mauricio Santillana,et al.  Accurate estimation of influenza epidemics using Google search data via ARGO , 2015, Proceedings of the National Academy of Sciences.

[23]  Régis Beuscart,et al.  Toward a Literature-Driven Definition of Big Data in Healthcare , 2015, BioMed research international.

[24]  D. Jamison Investing in Health , 2006 .

[25]  Bo Luo,et al.  Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model , 2014, BMC Medical Informatics and Decision Making.

[26]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[27]  Hans Uszkoreit,et al.  Annotation of Entities and Relations in Spanish Radiology Reports , 2017, RANLP.

[28]  M. Porter,et al.  Redefining Health Care: Creating Value-based Competition on Results , 2006 .

[29]  D. Rebholz-Schuhmann,et al.  Text-mining solutions for biomedical research: enabling integrative biology , 2012, Nature Reviews Genetics.

[30]  Diego Reforgiato Recupero,et al.  Leveraging semantics for sentiment polarity detection in social media , 2019, Int. J. Mach. Learn. Cybern..

[31]  Diego Reforgiato Recupero,et al.  Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Document Clustering , 2017, SeWeBMeDA@ESWC.

[32]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[33]  Karin M. Verspoor,et al.  Biomedical Text Mining: State-of-the-Art, Open Problems and Future Challenges , 2014, Interactive Knowledge Discovery and Data Mining in Biomedical Informatics.

[34]  Guo-Qiang Zhang,et al.  RMS: a platform for managing cross-disciplinary and multi-institutional research project collaboration , 2014, BMC Medical Informatics and Decision Making.

[35]  D. Sculley,et al.  Hidden Technical Debt in Machine Learning Systems , 2015, NIPS.

[36]  Diego Reforgiato Recupero,et al.  A Recommender System of Medical Reports Leveraging Cognitive Computing and Frame Semantics , 2018, Machine Learning Paradigms.

[37]  Diego Reforgiato Recupero,et al.  Sentilo: Frame-Based Sentiment Analysis , 2014, Cognitive Computation.