Combining Health Data Uses to Ignite Health System Learning

OBJECTIVES In this paper we aim to characterise the critical mass of linked data, methods and expertise required for health systems to adapt to the needs of the populations they serve - more recently known as learning health systems. The objectives are to: 1) identify opportunities to combine separate uses of common data sources in order to reduce duplication of data processing and improve information quality; 2) identify challenges in scaling-up the reuse of health data sufficiently to support health system learning. METHODS The challenges and opportunities were identified through a series of e-health stakeholder consultations and workshops in Northern England from 2011 to 2014. From 2013 the concepts presented here have been refined through feedback to collaborators, including patient/citizen representatives, in a regional health informatics research network (www.herc.ac.uk). RESULTS Health systems typically have separate information pipelines for: 1) commissioning services; 2) auditing service performance; 3) managing finances; 4) monitoring public health; and 5) research. These pipelines share common data sources but usually duplicate data extraction, aggregation, cleaning/preparation and analytics. Suboptimal analyses may be performed due to a lack of expertise, which may exist elsewhere in the health system but is fully committed to a different pipeline. Contextual knowledge that is essential for proper data analysis and interpretation may be needed in one pipeline but accessible only in another. The lack of capable health and care intelligence systems for populations can be attributed to a legacy of three flawed assumptions: 1) universality: the generalizability of evidence across populations; 2) time-invariance: the stability of evidence over time; and 3) reducibility: the reduction of evidence into specialised sub-systems that may be recombined. CONCLUSIONS We conceptualize a population health and care intelligence system capable of supporting health system learning and we put forward a set of maturity tests of progress toward such a system. A factor common to each test is data-action latency; a mature system spawns timely actions proportionate to the information that can be derived from the data, and in doing so creates meaningful measurement about system learning. We illustrate, using future scenarios, some major opportunities to improve health systems by exchanging conventional intelligence pipelines for networked critical masses of data, methods and expertise that minimise data-action latency and ignite system-learning.

[1]  Evangelos Kontopantelis,et al.  Relationship between quality of care and choice of clinical computing system: retrospective analysis of family practice performance under the UK's quality and outcomes framework , 2013, BMJ Open.

[2]  Iain Buchan,et al.  Informatics in Healthcare Systems , 2011 .

[3]  Iain E. Buchan,et al.  eLab: Bringing Together People, Data and Methods to Enhance Knowledge Discovery in Healthcare Settings , 2012, HealthGrid.

[4]  Iain E. Buchan,et al.  Trustworthy reuse of health data: A transnational perspective , 2013, Int. J. Medical Informatics.

[5]  Priscilla S. Markwood,et al.  The Long Tail: Why the Future of Business is Selling Less of More , 2006 .

[6]  Margaret McCartney Care.data: why are Scotland and Wales doing it differently? , 2014, BMJ : British Medical Journal.

[7]  Benjamin C. M. Fung,et al.  Quantifying the costs and benefits of privacy-preserving health data publishing , 2014, J. Biomed. Informatics.

[8]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[9]  Anthony Harrison,et al.  Our Future Health Secured?: A Review of NHS Funding and Performance , 2007 .

[10]  Zheng Shao,et al.  Data warehousing and analytics infrastructure at facebook , 2010, SIGMOD Conference.

[11]  Ara Darzi,et al.  Preparing for precision medicine. , 2012, The New England journal of medicine.

[12]  Iain E. Buchan,et al.  A unified modeling approach to data-intensive healthcare , 2009, The Fourth Paradigm.

[13]  Iain Buchan,et al.  Dynamic trends in cardiac surgery: why the logistic EuroSCORE is no longer suitable for contemporary cardiac surgery and implications for future risk models. , 2013, European journal of cardio-thoracic surgery : official journal of the European Association for Cardio-thoracic Surgery.

[14]  Amy P Abernethy,et al.  True translational research: bridging the three phases of translation through data and behavior , 2011, Translational behavioral medicine.

[15]  Adnan Custovic,et al.  Asthma endotypes: a new approach to classification of disease entities within the asthma syndrome. , 2011, The Journal of allergy and clinical immunology.

[16]  Iain Buchan,et al.  COCPIT: a tool for integrated care pathway variance analysis. , 2012, Studies in health technology and informatics.

[17]  J A Smyth,et al.  Determining optimal therapy--randomized trials in individual patients. , 1986, The New England journal of medicine.

[18]  Iain Buchan,et al.  Performance of the EuroSCORE Models in Emergency Cardiac Surgery , 2013, Circulation. Cardiovascular quality and outcomes.

[19]  Munir Pirmohamed,et al.  Through a Glass Darkly: Economics and Personalised Medicine , 2014, PharmacoEconomics (Auckland).

[20]  William W. Stead,et al.  Toward a science of learning systems: a research agenda for the high-functioning Learning Health System , 2014, J. Am. Medical Informatics Assoc..

[21]  B. Starfield,et al.  Defining Comorbidity: Implications for Understanding Health and Health Services , 2009, The Annals of Family Medicine.

[22]  Cédrick Fairon,et al.  Annotation analysis for testing drug safety signals using unstructured clinical notes , 2012, J. Biomed. Semant..

[23]  James A. Hendler,et al.  From the Semantic Web to social machines: A research challenge for AI on the World Wide Web , 2010, Artif. Intell..

[24]  R. Hotchkiss Integrated care pathways , 1997, BMJ.

[25]  M. Swan Crowdsourced Health Research Studies: An Important Emerging Complement to Clinical Trials in the Public Health Research Ecosystem , 2012, Journal of medical Internet research.

[26]  Iain E. Buchan,et al.  e-Labs and Work Objects: Towards Digital Health Economies , 2009, EuropeComm.

[27]  G Rose,et al.  Sick individuals and sick populations. , 1985, International journal of epidemiology.

[28]  Daren C. Brabham Crowdsourcing as a Model for Problem Solving , 2008 .

[29]  M Rigby,et al.  Personal Health, Person-centred Health and Personalised Medicine – Concepts, Consumers, Confusion and Challenges in the Informatics World , 2012, Yearbook of Medical Informatics.

[30]  Gary Wolf Gary Wolf: The quantified self: (609302010-001) , 2010 .

[31]  Harvey V Fineberg,et al.  Shattuck Lecture. A successful and sustainable health system--how to get there from here. , 2012, The New England journal of medicine.

[32]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[33]  Erik Brynjolfsson,et al.  Goodbye Pareto Principle, Hello Long Tail: The Effect of Search Costs on the Concentration of Product Sales , 2011, Manag. Sci..

[34]  Stephen Kaptoge,et al.  Statistical methods for the time-to-event analysis of individual participant data from multiple epidemiological studies , 2010, International journal of epidemiology.

[35]  Brian A. Nosek,et al.  An Open, Large-Scale, Collaborative Effort to Estimate the Reproducibility of Psychological Science , 2012, Perspectives on psychological science : a journal of the Association for Psychological Science.

[36]  M. Fortin,et al.  Randomized Controlled Trials: Do They Have External Validity for Patients With Multiple Comorbidities? , 2006, The Annals of Family Medicine.

[37]  Iain E. Buchan,et al.  Missed Opportunities Mapping: Computable Healthcare Quality Improvement , 2013, MedInfo.

[38]  H. Mewes,et al.  Informatics and Medicine , 2008, Methods of Information in Medicine.

[39]  Celementina R. Russo,et al.  The Quantified Self , 2015, HCI.

[40]  D. Blumenthal,et al.  Achieving a Nationwide Learning Health System , 2010, Science Translational Medicine.

[41]  J. Powell,et al.  Electronic Health Records Should Support Clinical Research , 2005, Journal of medical Internet research.

[42]  Carole A. Goble,et al.  Why Linked Data is Not Enough for Scientists , 2010, 2010 IEEE Sixth International Conference on e-Science.

[43]  Patrick B. Ryan,et al.  Desideratum for Evidence Based Epidemiology , 2013, Drug Safety.

[44]  S. Athar Principles of Biomedical Ethics , 2011, The Journal of IMA.

[45]  Sam Shah,et al.  The big data ecosystem at LinkedIn , 2013, SIGMOD '13.

[46]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[47]  J. Ioannidis Why Most Published Research Findings Are False , 2019, CHANCE.

[48]  M. Massagli,et al.  Accelerated clinical discovery using self-reported patient data collected online and a patient-matching algorithm , 2011, Nature Biotechnology.

[49]  I E Buchan,et al.  The high prevalence of unrecognized anaemia in patients with diabetes and chronic kidney disease: a population‐based study , 2008, Diabetic medicine : a journal of the British Diabetic Association.