Use of Machine Learning to Identify Follow-Up Recommendations in Radiology Reports

Purpose: Assess follow-up recommendations in radiology reports, develop and assess traditional machine learning (TML) and deep learning (DL) models in identifying follow-up, and benchmark them against a natural language processing (NLP) system. Methods: This HIPAA-compliant, IRB approved study, was performed at an academic medical center generating >500,000 radiology reports annually. 1,000 randomly-selected ultrasound, x-ray, computed tomography and magnetic resonance imaging reports generated in 2016 were manually reviewed and annotated for follow-up recommendations. Traditional machine learning (Support Vector Machines, Random Forest, Logistic Regression) and deep learning (Recurrent Neural Nets) algorithms were constructed and trained on 850 reports (training data), with subsequent optimization of model architectures and parameters. Precision, recall and F1-score were calculated on the remaining 150 reports (test data). A previously-developed and validated NLP system (iSCOUT) was also applied to the test data, with equivalent metrics calculated. Results: 12.7% of reports had follow-up recommendations. The TML algorithms achieved F1 scores of 0.75 (Random Forest), 0.83 (Logistic Regression), and 0.85 (Support Vector Machine) on the test data. DL Recurrent Neural Nets had an F1 score of 0.71; iSCOUT also had an F1 score of 0.71. Performance of both TML and DL methods by F1-scores appeared to plateau after 500–700 samples while training. Conclusion: TML and DL are feasible methods to identify follow-up recommendations. These methods have great potential for near real-time monitoring of follow up recommendations in radiology reports.

[1]  Fang Yu,et al.  Automated Classification of Radiology Reports to Facilitate Retrospective Study in Radiology , 2014, Journal of Digital Imaging.

[2]  D. Rueckert,et al.  Prediction of stroke thrombolysis outcome using CT brain machine learning , 2014, NeuroImage: Clinical.

[3]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[4]  Loes M. M. Braun,et al.  Natural Language Processing in Radiology: A Systematic Review. , 2016, Radiology.

[5]  Ramin Khorasani,et al.  Focal cystic pancreatic lesions: assessing variation in radiologists' management recommendations. , 2011, Radiology.

[6]  Antonio Lavecchia,et al.  Machine-learning approaches in drug discovery: methods and applications. , 2015, Drug discovery today.

[7]  Katherine P Andriole,et al.  Factors associated with radiologists' adherence to Fleischner Society guidelines for management of pulmonary nodules. , 2012, Journal of the American College of Radiology : JACR.

[8]  James A Brink,et al.  Managing incidental findings on abdominal CT: white paper of the ACR incidental findings committee. , 2010, Journal of the American College of Radiology : JACR.

[9]  A. Indrayan,et al.  A simple nomogram for sample size for estimating sensitivity and specificity of medical tests , 2010, Indian journal of ophthalmology.

[10]  K. Borgwardt,et al.  Machine Learning in Medicine , 2015, Mach. Learn. under Resour. Constraints Vol. 3.

[11]  Ramin Khorasani,et al.  Adoption of a closed loop communication tool to establish and execute a collaborative follow up plan for incidental pulmonary nodules , 2019, AJR. American journal of roentgenology.

[12]  Dimitrios I. Fotiadis,et al.  Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.

[13]  Bruce J. Hillman,et al.  The uncritical use of high-tech medical imaging. , 2010, The New England journal of medicine.

[14]  Ross W. Filice,et al.  Implementing Machine Learning in Radiology Practice and Research. , 2017, AJR. American journal of roentgenology.

[15]  Ramin Khorasani,et al.  Information from Searching Content with an Ontology-Utilizing Toolkit (iSCOUT) , 2012, Journal of Digital Imaging.

[16]  Ramin Khorasani,et al.  Focal Cystic Pancreatic Lesion Follow-up Recommendations After Publication of ACR White Paper on Managing Incidental Findings. , 2017, Journal of the American College of Radiology : JACR.

[17]  D. Miglioretti,et al.  Rising use of diagnostic medical imaging in a large integrated health system. , 2008, Health affairs.

[18]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.

[19]  Jun'ichi Tsujii,et al.  Named entity recognition of follow-up and time information in 20 000 radiology reports , 2012, J. Am. Medical Informatics Assoc..

[20]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[21]  Michael Pignone,et al.  How to decide whether a clinical practice guideline is trustworthy. , 2013, JAMA.

[22]  Ramin Khorasani,et al.  Multivariate Analysis of Radiologists' Usage of Phrases that Convey Diagnostic Certainty. , 2019, Academic radiology.

[23]  James H Thrall,et al.  Recommendations for additional imaging in radiology reports: multifactorial analysis of 5.9 million examinations. , 2009, Radiology.

[24]  Sayon Dutta,et al.  Automated detection using natural language processing of radiologists recommendations for additional imaging of incidental findings. , 2013, Annals of emergency medicine.

[25]  Hhs Centers for Medicare Medicaid Services Medicare Program; CY 2018 Updates to the Quality Payment Program; and Quality Payment Program: Extreme and Uncontrollable Circumstance Policy for the Transition Year. Final rule with comment period and interim final rule with comment period. , 2017 .

[26]  Chi Wan Koo,et al.  Addition of the Fleischner Society Guidelines to Chest CT Examination Interpretive Reports Improves Adherence to Recommended Follow-up Care for Incidental Pulmonary Nodules. , 2017, Academic radiology.

[27]  Aurélien Géron,et al.  Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems , 2017 .

[28]  Huan Huang,et al.  National trends in advanced outpatient diagnostic imaging utilization: an analysis of the medical expenditure panel survey, 2000-2009 , 2013, BMC Medical Imaging.