Ethical Use of Electronic Health Record Data and Artificial Intelligence: Recommendations of the Primary Care Informatics Working Group of the International Medical Informatics Association

Summary Objective: To create practical recommendations for the curation of routinely collected health data and artificial intelligence (AI) in primary care with a focus on ensuring their ethical use. Methods: We defined data curation as the process of management of data throughout its lifecycle to ensure it can be used into the future. We used a literature review and Delphi exercises to capture insights from the Primary Care Informatics Working Group (PCIWG) of the International Medical Informatics Association (IMIA). Results: We created six recommendations: (1) Ensure consent and formal process to govern access and sharing throughout the data life cycle; (2) Sustainable data creation/collection requires trust and permission; (3) Pay attention to Extract-Transform-Load (ETL) processes as they may have unrecognised risks; (4) Integrate data governance and data quality management to support clinical practice in integrated care systems; (5) Recognise the need for new processes to address the ethical issues arising from AI in primary care; (6) Apply an ethical framework mapped to the data life cycle, including an assessment of data quality to achieve effective data curation. Conclusions: The ethical use of data needs to be integrated within the curation process, hence running throughout the data lifecycle. Current information systems may not fully detect the risks associated with ETL and AI; they need careful scrutiny. With distributed integrated care systems where data are often used remote from documentation, harmonised data quality assessment, management, and governance is important. These recommendations should help maintain trust and connectedness in contemporary information systems and planned developments.

[1]  R. McLean,et al.  The U.S. Health Care System Is Ill and Needs a Bold New Prescription , 2020, Annals of Internal Medicine.

[2]  Craig E. Kuziemsky,et al.  System Level Patient-Centered Data Sharing , 2019, 2019 IEEE/ACM 1st International Workshop on Software Engineering for Healthcare (SEH).

[3]  Enrico W. Coiera,et al.  The Price of Artificial Intelligence , 2019, Yearbook of Medical Informatics.

[4]  Ali Sunyaev,et al.  Availability and quality of mobile health app privacy policies , 2015, J. Am. Medical Informatics Assoc..

[5]  Liam Peyton,et al.  A configurable identity matching algorithm for community care management , 2020, J. Ambient Intell. Humaniz. Comput..

[6]  J. Winn Consumer protection in the age of the 'information economy' , 2006 .

[7]  S de Lusignan,et al.  Key Concepts to Assess the Readiness of Data for International Research: Data Quality, Lineage and Provenance, Extraction and Processing Errors, Traceability, and Curation , 2011, Yearbook of Medical Informatics.

[8]  David L Buckeridge,et al.  A population health perspective on artificial intelligence , 2019, Healthcare management forum.

[9]  Megan R. Mahoney,et al.  Ten Ways Artificial Intelligence Will Transform Primary Care , 2019, Journal of General Internal Medicine.

[10]  Harshana Liyanage,et al.  Artificial Intelligence in Primary Health Care: Perceptions, Issues, and Challenges , 2019, Yearbook of Medical Informatics.

[11]  Alena Buyx,et al.  Health Information Counselors: A New Profession for the Age of Big Data , 2019, Academic medicine : journal of the Association of American Medical Colleges.

[12]  Harshana Liyanage,et al.  An integrated organisation-wide data quality management and information governance framework: theoretical underpinnings. , 2014, Informatics in primary care.

[13]  Siaw-Teng Liaw,et al.  Ethical research or research ethics? , 2015, Australian family physician.

[14]  Philippe Ravaud,et al.  Blockchain technology for improving clinical research quality , 2017, Trials.

[15]  S de Lusignan,et al.  Building a Privacy, Ethics, and Data Access Framework for Real World Computerised Medical Record System Data: A Delphi Study , 2016, Yearbook of Medical Informatics.

[16]  F. Cate The Failure of Fair Information Practice Principles , 2006 .

[17]  Liam Peyton,et al.  Cloud‐based performance management of community care services , 2018, J. Softw. Evol. Process..

[18]  Andrew Hayen,et al.  Integrating electronic health record information to support integrated care: Practical application of ontologies to improve the accuracy of diabetes disease registers , 2014, J. Biomed. Informatics.

[19]  T. Y. Tham,et al.  Integrated health care systems in Asia: an urgent necessity , 2018, Clinical interventions in aging.

[20]  Dan J. Kim,et al.  A trust-based consumer decision-making model in electronic commerce: The role of trust, perceived risk, and their antecedents , 2019 .

[21]  Rickey E. Carter,et al.  Pragmatic considerations for fostering reproducible research in artificial intelligence , 2019, npj Digital Medicine.

[22]  S de Lusignan,et al.  Big Data Usage Patterns in the Health Care Domain: A Use Case Driven Approach Applied to the Assessment of Vaccination Benefits and Risks , 2014, Yearbook of Medical Informatics.

[23]  Brian W. Powers,et al.  Dissecting racial bias in an algorithm used to manage the health of populations , 2019, Science.

[24]  Abderrahim Beni Hssane,et al.  Big healthcare data: preserving security and privacy , 2018, Journal of Big Data.

[25]  Lucila Ohno-Machado Understanding and mitigating the digital divide in health care , 2017, J. Am. Medical Informatics Assoc..

[26]  H. Raghav Rao,et al.  A trust-based consumer decision-making model in electronic commerce: The role of trust, perceived risk, and their antecedents , 2008, Decis. Support Syst..

[27]  Simon de Lusignan,et al.  Real-world evidence studies into treatment adherence, thresholds for intervention and disparities in treatment in people with type 2 diabetes in the UK , 2016, BMJ Open.

[28]  Peter Beck,et al.  Cross-border flow of health information: is 'privacy by design' enough? Privacy performance assessment in EUBIROD. , 2013, European journal of public health.

[29]  Francesco Bonchi,et al.  Algorithmic Bias: From Discrimination Discovery to Fairness-aware Data Mining , 2016, KDD.

[30]  L. Coles,et al.  Computers and society. , 1972, Science.

[31]  Thu-Trang T. Hickman,et al.  Cranky comments: detecting clinical decision support malfunctions through free-text override reasons , 2018, J. Am. Medical Informatics Assoc..

[32]  Benjamin I. P. Rubinstein,et al.  Health Data in an Open World , 2017, ArXiv.

[33]  Christopher Pearce,et al.  Optimising the use of observational electronic health record data: Current issues, evolving opportunities, strategies and scope for collaboration. , 2016, Australian family physician.

[34]  Carole A. Goble,et al.  Data curation + process curation=data integration + science , 2008, Briefings Bioinform..

[35]  Lizzie Presser,et al.  Care.data and access to UK health records: patient privacy and public trust , 2015 .

[36]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[37]  Simon de Lusignan,et al.  The roles of policy and professionalism in the protection of processed clinical data: A literature review , 2007, Int. J. Medical Informatics.

[38]  Farah Magrabi,et al.  Artificial Intelligence in Clinical Decision Support: Challenges for Evaluating AI and Practical Implications , 2019, Yearbook of Medical Informatics.

[39]  Fabian Prasser,et al.  Privacy-enhancing ETL-processes for biomedical data , 2019, Int. J. Medical Informatics.