Exploring Physician Perspectives on Using Real-world Care Data for the Development of Artificial Intelligence–Based Technologies in Health Care: Qualitative Study

Background Development of artificial intelligence (AI)–based technologies in health care is proceeding rapidly. The sharing and release of real-world data are key practical issues surrounding the implementation of AI solutions into existing clinical practice. However, data derived from daily patient care are necessary for initial training, and continued data supply is needed for the ongoing training, validation, and improvement of AI-based solutions. Data may need to be shared across multiple institutions and settings for the widespread implementation and high-quality use of these solutions. To date, solutions have not been widely implemented in Germany to meet the challenge of providing a sufficient data volume for the development of AI-based technologies for research and third-party entities. The Protected Artificial Intelligence Innovation Environment for Patient-Oriented Digital Health Solutions (pAItient) project aims to meet this challenge by creating a large data pool that feeds on the donation of data derived from daily patient care. Prior to building this data pool, physician perspectives regarding data donation for AI-based solutions should be studied. Objective This study explores physician perspectives on providing and using real-world care data for the development of AI-based solutions in health care in Germany. Methods As a part of the requirements analysis preceding the pAItient project, this qualitative study explored physician perspectives and expectations regarding the use of data derived from daily patient care in AI-based solutions. Semistructured, guide-based, and problem-centered interviews were audiorecorded, deidentified, transcribed verbatim, and analyzed inductively in a thematically structured approach. Results Interviews (N=8) with a mean duration of 24 (SD 7.8) minutes were conducted with 6 general practitioners and 2 hospital-based physicians. The mean participant age was 54 (SD 14.1; range 30-74) years, with an average experience as a physician of 25 (SD 13.9; range 1-45) years. Self-rated affinity toward modern information technology varied from very high to low (5-point Likert scale: mean 3.75, SD 1.1). All participants reported they would support the development of AI-based solutions in research contexts by donating deidentified data derived from daily patient care if subsequent data use was made transparent to them and their patients and the benefits for patient care were clear. Contributing to care optimization and efficiency were cited as motivation for potential data donation. Concerns regarding workflow integration (time and effort), appropriate deidentification, and the involvement of third-party entities with economic interests were discussed. The donation of data in reference to psychosomatic treatment needs was viewed critically. Conclusions The interviewed physicians reported they would agree to use real-world care data to support the development of AI-based solutions with a clear benefit for daily patient care. Joint ventures with third-party entities were viewed critically and should focus on care optimization and patient benefits rather than financial interests.

[1]  M. Hennink,et al.  Sample sizes for saturation in qualitative research: A systematic review of empirical tests. , 2021, Social science & medicine.

[2]  E. Swart,et al.  Jetzt die Weichen stellen für ein leistungsfähiges Forschungsdatenzentrum Gesundheit , 2021, Das Gesundheitswesen.

[3]  A. Schuppert,et al.  Future Medical Artificial Intelligence Application Requirements and Expectations of Physicians in German University Hospitals: Web-Based Survey , 2020, Journal of medical Internet research.

[4]  S. Kardia,et al.  The public’s comfort with sharing health data with third-party commercial companies , 2020, Humanities & social sciences communications.

[5]  M. Krawczak,et al.  Secondary research use of personal medical data: attitudes from patient and population surveys in The Netherlands and Germany , 2020, European Journal of Human Genetics.

[6]  A. Middleton,et al.  Willingness to donate genomic and other medical data: results from Germany , 2020, European Journal of Human Genetics.

[7]  Mohamad Sawan,et al.  Artificial Intelligence in Healthcare: Review and Prediction Case Studies , 2020, Engineering.

[8]  Yuqi Guo,et al.  Artificial Intelligence in Health Care: Bibliometric Analysis , 2020, Journal of medical Internet research.

[9]  Holger Fröhlich,et al.  From hype to reality: data science enabling personalized medicine , 2018, BMC Medicine.

[10]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[11]  C. Critchley,et al.  The Impact of Commercialisation and Genetic Data Sharing Arrangements on Public Trust and the Intention to Participate in Biobank Research , 2015, Public Health Genomics.

[12]  P. Webster Sweden’s health data goldmine , 2014, Canadian Medical Association Journal.

[13]  Ritu Agarwal,et al.  The Digitization of Healthcare: Boundary Risks, Emotion, and Consumer Willingness to Disclose Personal Health Information , 2011, Inf. Syst. Res..

[14]  P. Sainsbury,et al.  Consolidated criteria for reporting qualitative research (COREQ): a 32-item checklist for interviews and focus groups. , 2007, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[15]  Hector Geffner,et al.  Introduction to Artificial Intelligence: Reasoning with Constraints , 2003 .