Big Data Usage Patterns in the Health Care Domain: A Use Case Driven Approach Applied to the Assessment of Vaccination Benefits and Risks

BACKGROUND Generally benefits and risks of vaccines can be determined from studies carried out as part of regulatory compliance, followed by surveillance of routine data; however there are some rarer and more long term events that require new methods. Big data generated by increasingly affordable personalised computing, and from pervasive computing devices is rapidly growing and low cost, high volume, cloud computing makes the processing of these data inexpensive. OBJECTIVE To describe how big data and related analytical methods might be applied to assess the benefits and risks of vaccines. METHOD We reviewed the literature on the use of big data to improve health, applied to generic vaccine use cases, that illustrate benefits and risks of vaccination. We defined a use case as the interaction between a user and an information system to achieve a goal. We used flu vaccination and pre-school childhood immunisation as exemplars. RESULTS We reviewed three big data use cases relevant to assessing vaccine benefits and risks: (i) Big data processing using crowdsourcing, distributed big data processing, and predictive analytics, (ii) Data integration from heterogeneous big data sources, e.g. the increasing range of devices in the "internet of things", and (iii) Real-time monitoring for the direct monitoring of epidemics as well as vaccine effects via social media and other data sources. CONCLUSIONS Big data raises new ethical dilemmas, though its analysis methods can bring complementary real-time capabilities for monitoring epidemics and assessing vaccine benefit-risk balance.

[1]  S. de Lusignan,et al.  Developing a survey instrument to assess the readiness of primary care data, genetic and disease registries to conduct linked research: TRANSFoRm International Research Readiness (TIRRE) survey instrument. , 2013, Informatics in primary care.

[2]  Dylan B. George,et al.  Big Data Opportunities for Global Infectious Disease Surveillance , 2013, PLoS medicine.

[3]  D. Fleming,et al.  The representativeness of sentinel practice networks. , 2010, Journal of public health.

[4]  Sourav Bandyopadhyay,et al.  Bringing it all together: big data and HIV research. , 2013, AIDS.

[5]  Galit Shmueli,et al.  Early statistical detection of anthrax outbreaks by tracking over-the-counter medication sales , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[6]  D M Fleming,et al.  Estimating influenza vaccine effectiveness using routinely collected laboratory data , 2009, Journal of Epidemiology & Community Health.

[7]  S. de Lusignan,et al.  Accelerating the development of an information ecosystem in health care, by stimulating the growth of safe intermediate processing of health information (IPHI). , 2013, Informatics in primary care.

[8]  Murat M. Tanik,et al.  A self-updating road map of The Cancer Genome Atlas , 2013, Bioinform..

[9]  Che-Lun Hung,et al.  Open Reading Frame Phylogenetic Analysis on the Cloud , 2013, International journal of genomics.

[10]  K Denecke,et al.  How to Exploit Twitter for Public Health Monitoring? , 2013, Methods of Information in Medicine.

[11]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[12]  David Madigan,et al.  Multiple Self‐Controlled Case Series for Large‐Scale Longitudinal Observational Databases , 2013, Biometrics.

[13]  D M Fleming,et al.  An assessment of the effect of statin use on the incidence of acute respiratory infections in England during winters 1998–1999 to 2005–2006 , 2010, Epidemiology and Infection.

[14]  Wanting Huang,et al.  Telephone monitoring of adverse events during an MF59®-adjuvanted H5N1 influenza vaccination campaign in Taiwan , 2014, Human vaccines & immunotherapeutics.

[15]  T. Jefferson,et al.  Vaccines for preventing influenza in healthy children. , 2012, The Cochrane database of systematic reviews.

[16]  Kenji Mizuguchi,et al.  Sagace: A web-based search engine for biomedical databases in Japan , 2012, BMC Research Notes.

[17]  John M. Fonner,et al.  Leveraging the national cyberinfrastructure for biomedical research , 2013, J. Am. Medical Informatics Assoc..

[18]  G. Eysenbach Infodemiology and Infoveillance: Framework for an Emerging Set of Public Health Informatics Methods to Analyze Search, Communication and Publication Behavior on the Internet , 2009, Journal of medical Internet research.

[19]  D. Fleming,et al.  Vaccine effectiveness of 2011/12 trivalent seasonal influenza vaccine in preventing laboratory-confirmed influenza in primary care in the United Kingdom: evidence of waning intra-seasonal protection. , 2013, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[20]  A. Holton,et al.  Twitter as a source of vaccination information: content drivers and what they are saying. , 2013, American journal of infection control.

[21]  Michael M. Wagner,et al.  Telephone Triage: A Timely Data Source for Surveillance of Influenza-like Diseases , 2003, AMIA.

[22]  D. Burwen,et al.  Chart-confirmed guillain-barre syndrome after 2009 H1N1 influenza vaccination among the Medicare population, 2009-2010. , 2013, American journal of epidemiology.

[23]  T. Murdoch,et al.  The inevitable application of big data to health care. , 2013, JAMA.

[24]  D. Mohr,et al.  Behavioral intervention technologies: evidence review and recommendations for future research in mental health. , 2013, General hospital psychiatry.

[25]  R. Steinbrook Personally controlled online health data--the next big thing in medical care? , 2008, The New England journal of medicine.

[26]  Dean Leffingwell,et al.  Managing Software Requirements: A Use Case Approach , 2003 .

[27]  Chien-Hung Chen,et al.  Heart beats in the cloud: distributed analysis of electrophysiological 'Big Data' using cloud computing for epilepsy clinical research , 2014, J. Am. Medical Informatics Assoc..

[28]  Shuang-Hua Yang,et al.  How the internet of things technology enhances emergency response operations , 2013 .

[29]  Gina Neff,et al.  Why Big Data Won't Cure Us , 2013, Big Data.

[30]  Lynne Dailey,et al.  Research Paper: Timeliness of Data Sources Used for Influenza Surveillance , 2007, J. Am. Medical Informatics Assoc..

[31]  H Müller,et al.  Health information search to deal with the exploding amount of health information produced. , 2012, Methods of information in medicine.

[32]  Douglas E Green,et al.  Can big data lead us to big savings? , 2013, Radiographics : a review publication of the Radiological Society of North America, Inc.

[33]  Neil Bahroos,et al.  Leverage hadoop framework for large scale clinical informatics applications. , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[34]  H. Clothier,et al.  Active surveillance for adverse events following immunization , 2014, Expert review of vaccines.

[35]  Tao Zhang,et al.  International collaboration to assess the risk of Guillain Barré Syndrome following Influenza A (H1N1) 2009 monovalent vaccines. , 2013, Vaccine.

[36]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[37]  V Demicheli,et al.  Efficacy and effectiveness of influenza vaccines in elderly people: a systematic review , 2005, The Lancet.

[38]  P. Baghurst,et al.  Consumer reporting of adverse events following immunization (AEFI) , 2014, Human vaccines & immunotherapeutics.

[39]  Adam Barker,et al.  Undefined By Data: A Survey of Big Data Definitions , 2013, ArXiv.

[40]  Björn Regnell,et al.  A hierarchical use case model with graphical representation , 1996, Proceedings IEEE Symposium and Workshop on Engineering of Computer-Based Systems.

[41]  A. Darzi,et al.  Harnessing the cloud of patient experience: using social media to detect poor quality healthcare , 2013, BMJ quality & safety.

[42]  Arash Jalali,et al.  Leveraging Cloud Computing to Address Public Health Disparities: An Analysis of the SPHPS , 2012, Online journal of public health informatics.

[43]  Marcel Salathé,et al.  Assessing Vaccination Sentiments with Online Social Media: Implications for Infectious Disease Dynamics and Control , 2011, PLoS Comput. Biol..

[44]  Stefania Salmaso,et al.  Vaccine adverse event monitoring systems across the European Union countries: time for unifying efforts. , 2009, Vaccine.

[45]  ECDC in collaboration with the VAESCO consortium to develop a complementary tool for vaccine safety monitoring in Europe. , 2009, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[46]  Robert J. Taylor,et al.  Global Mortality Estimates for the 2009 Influenza Pandemic from the GLaMOR Project: A Modeling Study , 2013, PLoS medicine.

[47]  Paul Krause,et al.  Conducting requirements analyses for research using routinely collected health data: a model driven approach. , 2012, Studies in health technology and informatics.

[48]  D. Fleming,et al.  Lessons from 40 years' surveillance of influenza in England and Wales , 2007, Epidemiology and Infection.

[49]  Robert T. Chen,et al.  The role of the Vaccine Adverse Event Reporting system (VAERS) in monitoring vaccine safety. , 2004, Pediatric annals.

[50]  Robert M. Stephens,et al.  Knowledge and Theme Discovery across Very Large Biological Data Sets Using Distributed Queries: A Prototype Combining Unstructured and Structured Data , 2013, PloS one.

[51]  Steve Feng,et al.  Crowd-sourced BioGames: managing the big data problem for next-generation lab-on-a-chip platforms. , 2012, Lab on a chip.

[52]  Tom Jefferson,et al.  Vaccines for preventing influenza in the elderly. , 2010, The Cochrane database of systematic reviews.

[53]  Benyuan Liu,et al.  Predicting Flu Trends using Twitter data , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[54]  Cees T. A. M. de Laat,et al.  Addressing big data issues in Scientific Data Infrastructure , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[55]  G. Lissovoy Big data meets the electronic medical record: a commentary on "identifying patients at increased risk for unplanned readmission". , 2013 .

[56]  P. Palese,et al.  Universal influenza virus vaccines: need for clinical trials , 2013, Nature Immunology.

[57]  G. Nolan,et al.  Computational solutions to large-scale data management and analysis , 2010, Nature Reviews Genetics.

[58]  H. Sacks,et al.  The Efficacy of Influenza Vaccine in Elderly Persons , 1995, Annals of Internal Medicine.

[59]  A Lyon,et al.  Comparison of web-based biosecurity intelligence systems: BioCaster, EpiSPIDER and HealthMap. , 2012, Transboundary and emerging diseases.

[60]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[61]  Atanas Radenski,et al.  Speeding-up codon analysis on the cloud with local MapReduce aggregation , 2014, Inf. Sci..

[62]  Eleftherios Mylonakis,et al.  Google trends: a web-based tool for real-time surveillance of disease outbreaks. , 2009, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[63]  Bill Fox Using big data for big impact. Leveraging data and analytics provides the foundation for rethinking how to impact patient behavior. , 2011, Health management technology.

[64]  N H Shah,et al.  Translational Bioinformatics Embraces Big Data , 2012, Yearbook of Medical Informatics.

[65]  Michael Nguyen,et al.  Post-licensure rapid immunization safety monitoring program (PRISM) data characterization. , 2013, Vaccine.

[66]  Catherine Bartlett,et al.  Twitter and public health. , 2015, Journal of public health management and practice : JPHMP.

[67]  Mona Choi,et al.  Nursing Critical Patient Severity Classification System Predicts Outcomes in Patients Admitted to Surgical Intensive Care Units: Use of Data from Clinical Data Repository , 2013, MedInfo.

[68]  J. Narula,et al.  Are we up to speed?: from big data to rich insights in CV imaging for a hyperconnected world. , 2013, JACC. Cardiovascular imaging.

[69]  David J. Stone,et al.  "Big data" in the intensive care unit. Closing the data loop. , 2013, American journal of respiratory and critical care medicine.

[70]  S. Jacobsen,et al.  Agreement between medical record and parent report for evaluation of childhood febrile seizures. , 2013, Vaccine.

[71]  Andrew Page,et al.  Systematic review of reporting rates of adverse events following immunization: an international comparison of post-marketing surveillance programs with reference to China. , 2013, Vaccine.

[72]  Bill Fox Using big data for big impact. How predictive modeling can affect patient outcomes. , 2012, Health management technology.

[73]  Simon de Lusignan,et al.  The roles of policy and professionalism in the protection of processed clinical data: A literature review , 2007, Int. J. Medical Informatics.

[74]  Mark H. Ellisman,et al.  Data-intensive e-science frontier research , 2003, CACM.

[75]  M. Shigematsu,et al.  Experimental surveillance using data on sales of over-the-counter medications--Japan, November 2003-April 2004. , 2005, MMWR supplements.

[76]  P. Horby,et al.  Estimated global mortality associated with the first 12 months of 2009 pandemic influenza A H1N1 virus circulation: a modelling study. , 2012, The Lancet. Infectious diseases.

[77]  S. de Lusignan,et al.  The Evidence-base for Using Ontologies and Semantic Integration Methodologies to Support Integrated Chronic Disease Management in Primary and Ambulatory Care: Realist Review , 2013, Yearbook of Medical Informatics.