Evaluating ChatGPT as an Adjunct for Radiologic Decision-Making

BACKGROUND ChatGPT, a popular new large language model (LLM) built by OpenAI, has shown impressive performance in a number of specialized applications. Despite the rising popularity and performance of AI, studies evaluating the use of LLMs for clinical decision support are lacking. PURPOSE To evaluate ChatGPT's capacity for clinical decision support in radiology via the identification of appropriate imaging services for two important clinical presentations: breast cancer screening and breast pain. MATERIALS AND METHODS We compared ChatGPT's responses to the American College of Radiology (ACR) Appropriateness Criteria for breast pain and breast cancer screening. Our prompt formats included an open-ended (OE) format, where ChatGPT was asked to provide the single most appropriate imaging procedure, and a select all that apply (SATA) format, where ChatGPT was given a list of imaging modalities to assess. Scoring criteria evaluated whether proposed imaging modalities were in accordance with ACR guidelines. RESULTS ChatGPT achieved an average OE score of 1.83 (out of 2) and a SATA average percentage correct of 88.9% for breast cancer screening prompts, and an average OE score of 1.125 (out of 2) and a SATA average percentage correct of 58.3% for breast pain prompts. CONCLUSION Our results demonstrate the feasibility of using ChatGPT for radiologic decision making, with the potential to improve clinical workflow and responsible use of radiology services.

[1]  S. Biswas ChatGPT and the Future of Medical Writing. , 2023, Radiology.

[2]  A. Flanagin,et al.  Nonhuman "Authors" and Implications for the Integrity of Scientific Publication and Medical Knowledge. , 2023, JAMA.

[3]  H. Thorp ChatGPT is fun, but not an author , 2023, Science.

[4]  G. Shih,et al.  ChatGPT and Other Large Language Models Are Double-edged Swords. , 2023, Radiology.

[5]  Chris Stokel-Walker ChatGPT listed as author on research papers: many scientists disapprove , 2023, Nature.

[6]  D. Katz,et al.  GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities , 2023, SSRN Electronic Journal.

[7]  Tools such as ChatGPT threaten transparent science; here are our ground rules for their use , 2023, Nature.

[8]  Tiffany H. Kung,et al.  Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models , 2022, medRxiv.

[9]  D. Katz,et al.  GPT Takes the Bar Exam , 2022, SSRN Electronic Journal.

[10]  S. Mohan,et al.  Artificial Intelligence-Powered Clinical Decision Support and Simulation Platform for Radiology Trainee Education , 2022, Journal of Digital Imaging.

[11]  A. Jemal,et al.  Breast Cancer Statistics, 2022 , 2022, CA: a cancer journal for clinicians.

[12]  Lesley J J Soril,et al.  Characterizing and quantifying low-value diagnostic imaging internationally: a scoping review , 2022, BMC Medical Imaging.

[13]  Ryan J. Lowe,et al.  Training language models to follow instructions with human feedback , 2022, NeurIPS.

[14]  Rachel C. Shelton,et al.  A mixed-methods study of multi-level factors influencing mammography overuse among an older ethnically diverse screening population: implications for de-implementation , 2021, Implementation Science Communications.

[15]  R. Redberg,et al.  Recommendations From Breast Cancer Centers for Frequent Screening Mammography in Younger Women May Do More Harm Than Good. , 2021, JAMA internal medicine.

[16]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[17]  Tarik K Alkasab,et al.  Artificial Intelligence and Clinical Decision Support for Radiologists and Referring Providers. , 2019, Journal of the American College of Radiology : JACR.

[18]  Тулупов,et al.  CRITERIA , 1973, Wittgenstein: Meaning and Mind.

[19]  Allison H. Oakes,et al.  Factors Influencing Overuse of Breast Cancer Screening: A Systematic Review. , 2018, Journal of women's health.

[20]  W. Yang,et al.  Overutilization of Health Care Resources for Breast Pain. , 2018, AJR. American journal of roentgenology.

[21]  K. Ward,et al.  Downstream Breast Imaging Following Screening Mammography in Medicare Patients with Advanced Cancer: A Population-Based Study , 2018, Journal of General Internal Medicine.

[22]  Mph Mara A. Schonberg MD Overutilization of Breast Cancer Screening in the US: Awareness of a Growing Problem , 2017, Journal of General Internal Medicine.

[23]  N. Bundred Breast pain. , 2004, Clinical evidence.