Big data in health care: using analytics to identify and manage high-risk and high-cost patients.

The US health care system is rapidly adopting electronic health records, which will dramatically increase the quantity of clinical data that are available electronically. Simultaneously, rapid progress has been made in clinical analytics--techniques for analyzing large quantities of data and gleaning new insights from that analysis--which is part of what is known as big data. As a result, there are unprecedented opportunities to use big data to reduce the costs of health care in the United States. We present six use cases--that is, key examples--where some of the clearest opportunities exist to reduce costs through the use of big data: high-cost patients, readmissions, triage, decompensation (when a patient's condition worsens), adverse events, and treatment optimization for diseases affecting multiple organ systems. We discuss the types of insights that are likely to emerge from clinical analytics, the types of data needed to obtain such insights, and the infrastructure--analytics, algorithms, registries, assessment scores, monitoring devices, and so forth--that organizations will need to perform the necessary analyses and to implement changes that will improve care while reducing costs. Our findings have policy implications for regulatory oversight, ways to address privacy concerns, and the support of research on analytics.

[1]  J. F. Rodriguez,et al.  CONGRESSIONAL BUDGET OFFICE , 1608 .

[2]  Hugh Gordon,et al.  Scleroderma , 1937 .

[3]  V. Apgar The newborn (Apgar) scoring system. Reflections and advice. , 1966, Pediatric clinics of North America.

[4]  W. Knaus,et al.  APACHE II: a severity of disease classification system. , 1985 .

[5]  N. Laird,et al.  Incidence of Adverse Drug Events and Potential Adverse Drug Events: Implications for Prevention , 1995 .

[6]  S D Small,et al.  Incidence of adverse drug events and potential adverse drug events. Implications for prevention. ADE Prevention Study Group. , 1995, JAMA.

[7]  D W Bates,et al.  Drugs and adverse drug reactions: how worried should we be? , 1998, JAMA.

[8]  G. Escobar The neonatal "sepsis work-up": personal reflections on the development of an evidence-based approach toward newborn infections in a managed care organization. , 1999, Pediatrics.

[9]  G. Broll,et al.  Microsoft Corporation , 1999 .

[10]  Bruce F. Folck,et al.  Neonatal Sepsis Workups in Infants ≥2000 Grams at Birth: A Population-Based Study , 2000, Pediatrics.

[11]  D. Bates,et al.  Mortality and costs of acute renal failure associated with amphotericin B therapy. , 2001, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[12]  B. Starfield,et al.  Prevalence, expenditures, and complications of multiple chronic conditions in the elderly. , 2002, Archives of internal medicine.

[13]  D. Bates,et al.  Risk Factors for Adverse Drug Events Among Older Adults in the Ambulatory Setting , 2004, Journal of the American Geriatrics Society.

[14]  J. le Gall,et al.  SAPS 3—From evaluation of the patient to evaluation of the intensive care unit. Part 1: Objectives, methods and cohort description , 2005, Intensive Care Medicine.

[15]  Martin Fortin,et al.  Prevalence of Multimorbidity Among Adults Seen in Family Practice , 2005, The Annals of Family Medicine.

[16]  Margaret Wood,et al.  The Apgar score has survived the test of time. , 2005, Anesthesiology.

[17]  M. Donald,et al.  End tidal carbon dioxide monitoring in prehospital and retrieval medicine: a review , 2006, Emergency Medicine Journal.

[18]  J Lyle Bootman,et al.  Value in health care. , 2006, Managed care interface.

[19]  L. Ohno-Machado,et al.  Prognosis in critical care. , 2006, Annual review of biomedical engineering.

[20]  M. Petri Systemic Lupus Erythematosus: 2006 Update , 2006, Journal of clinical rheumatology : practical reports on rheumatic & musculoskeletal diseases.

[21]  J. Zimmerman,et al.  Outcome prediction in critical care: the Acute Physiology and Chronic Health Evaluation models , 2008, Current opinion in critical care.

[22]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[23]  Mark V. Williams,et al.  Rehospitalizations among patients in the Medicare fee-for-service program. , 2009, The New England journal of medicine.

[24]  David C. Chan,et al.  Improving safety and eliminating redundant tests: cutting costs in U.S. hospitals. , 2009, Health affairs.

[25]  Mark V. Williams,et al.  Rehospitalizations among patients in the Medicare fee-for-service program. , 2009, The New England journal of medicine.

[26]  John A. Quinn,et al.  Factorial Switching Linear Dynamical Systems Applied to Physiological Condition Monitoring , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Alex Pentland,et al.  Social sensing for epidemiological behavior change , 2010, UbiComp.

[28]  Jennifer Chu-Carroll,et al.  Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..

[29]  Suchi Saria,et al.  Combining Structured and Free-text Data for Automatic Coding of Patient Outcomes. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[30]  A. Gatherer Clinical , 1997 .

[31]  D. Koller,et al.  Integration of Early Physiological Responses Predicts Later Illness Severity in Preterm Infants , 2010, Science Translational Medicine.

[32]  Sergey Goryachev,et al.  Automated concept-level information extraction to reduce the need for custom software and rules development , 2011, J. Am. Medical Informatics Assoc..

[33]  J. Frankovich,et al.  Evidence-based medicine in the EMR era. , 2011, The New England journal of medicine.

[34]  W. Carlo,et al.  Mortality reduction by heart rate characteristic monitoring in very low birth weight neonates: a randomized trial. , 2011, The Journal of pediatrics.

[35]  Robert P Kocher,et al.  Hospital readmissions and the Affordable Care Act: paying for coordinated quality care. , 2011, JAMA.

[36]  Gabriel J. Escobar,et al.  Estimating the Probability of Neonatal Early-Onset Infection on the Basis of Maternal Risk Factors , 2011, Pediatrics.

[37]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..

[38]  Lucila Ohno-Machado,et al.  To Share or Not To Share: That Is Not the Question , 2012, Science Translational Medicine.

[39]  L. Nelson Lessons from Medicare's Demonstration Projects on Disease Management and Care Coordination: Working Paper 2012-01 , 2012 .

[40]  D. Bates,et al.  Clinical prediction rule to identify high‐risk inpatients for adverse drug events: the JADE Study , 2012, Pharmacoepidemiology and drug safety.

[41]  S. Altman,et al.  The new era of payment reform, spending targets, and cost containment in Massachusetts: early lessons for the nation. , 2012, Health affairs.

[42]  Shuying Shen,et al.  Evaluating the state of the art in coreference resolution for electronic medical records , 2012, J. Am. Medical Informatics Assoc..

[43]  Xiaoqian Jiang,et al.  Doubly Optimized Calibrated Support Vector Machine (DOC-SVM): An Algorithm for Joint Optimization of Discrimination and Calibration , 2012, PloS one.

[44]  Jihoon Kim,et al.  A patient-driven adaptive prediction technique to improve personalized risk estimation for clinical decision support , 2012, J. Am. Medical Informatics Assoc..

[45]  2012 Annual Report President's Message—Health Care Reform: A Journey , 2012 .

[46]  D. Ose,et al.  Patterns of multimorbidity in primary care patients at high risk of future hospitalization. , 2012, Population health management.

[47]  Patricia Kipnis,et al.  Risk-adjusting Hospital Mortality Using a Comprehensive Electronic Record in an Integrated Health Care Delivery System , 2013, Medical care.

[48]  Francis Y. Lau,et al.  Measuring value for money: a scoping review on economic evaluation of health information systems , 2013, J. Am. Medical Informatics Assoc..

[49]  Anna Rumshisky,et al.  Temporal reasoning over clinical text: the state of the art , 2013, J. Am. Medical Informatics Assoc..

[50]  E. Eichenwald,et al.  Neonatal early-onset sepsis evaluations among well-appearing infants: projected impact of changes in CDC GBS guidelines , 2013, Journal of Perinatology.

[51]  M. Rothman,et al.  Placing clinical variables on a common linear scale of empirically based risk as a step towards construction of a general patient acuity score from the electronic health record: a modelling study , 2013, BMJ Open.

[52]  Suchi Saria,et al.  Developing Predictive Models Using Electronic Medical Records: Challenges and Pitfalls , 2013, AMIA.

[53]  L. Casalino,et al.  Can accountable care organizations improve population health?: should they try? , 2013, JAMA.

[54]  Michael J. Rothman,et al.  Development and validation of a continuous measure of patient condition using the Electronic Medical Record , 2013, J. Biomed. Informatics.

[55]  Robert L. Phillips,et al.  The Rise of Electronic Health Record Adoption Among Family Physicians , 2013, The Annals of Family Medicine.

[56]  Andrew T. Kaczynski,et al.  Comparison of traditional versus mobile app self-monitoring of physical activity and dietary intake among overweight adults participating in an mHealth weight loss program , 2013, J. Am. Medical Informatics Assoc..

[57]  N. Shah,et al.  Pharmacovigilance Using Clinical Notes , 2013, Clinical pharmacology and therapeutics.

[58]  Bhavani Shankar Kodali,et al.  Capnography outside the operating rooms. , 2013, Anesthesiology.

[59]  Nigam H. Shah,et al.  Practice-Based Evidence: Profiling the Safety of Cilostazol by Text-Mining of Clinical Notes , 2013, PloS one.

[60]  Keith Marsolo,et al.  An i2b2-based, generalizable, open source, self-scaling chronic disease registry , 2012, J. Am. Medical Informatics Assoc..

[61]  T. Murdoch,et al.  The inevitable application of big data to health care. , 2013, JAMA.

[62]  Steven N Goodman,et al.  An ethics framework for a learning health care system: a departure from traditional research ethics and clinical ethics. , 2013, The Hastings Center report.

[63]  Melissa Haendel,et al.  A sea of standards for omics data: sink or swim? , 2013, J. Am. Medical Informatics Assoc..

[64]  Gregory N. Connolly,et al.  Concern about security and privacy, and perceived control over collection and use of health information are related to withholding of health information from healthcare providers , 2014, J. Am. Medical Informatics Assoc..

[65]  David W Bates,et al.  Continuous monitoring in an inpatient medical-surgical unit: a controlled clinical trial. , 2014, The American journal of medicine.

[66]  Michael W. Kuzniewicz,et al.  Stratification of Risk of Early-Onset Sepsis in Newborns ≥34 Weeks’ Gestation , 2014, Pediatrics.

[67]  Michael J Rothman,et al.  Measuring the modified early warning score and the Rothman Index: Advantages of utilizing the electronic medical record in an early warning system , 2013, Journal of hospital medicine.