A DICOM Framework for Machine Learning and Processing Pipelines Against Real-time Radiology Images

Real-time execution of machine learning (ML) pipelines on radiology images is difficult due to limited computing resources in clinical environments, whereas running them in research clusters requires efficient data transfer capabilities. We developed Niffler, an open-source Digital Imaging and Communications in Medicine (DICOM) framework that enables ML and processing pipelines in research clusters by efficiently retrieving images from the hospitals’ PACS and extracting the metadata from the images. We deployed Niffler at our institution (Emory Healthcare, the largest healthcare network in the state of Georgia) and retrieved data from 715 scanners spanning 12 sites, up to 350 GB/day continuously in real-time as a DICOM data stream over the past 2 years. We also used Niffler to retrieve images bulk on-demand based on user-provided filters to facilitate several research projects. This paper presents the architecture and three such use cases of Niffler. First, we executed an IVC filter detection and segmentation pipeline on abdominal radiographs in real-time, which was able to classify 989 test images with an accuracy of 96.0%. Second, we applied the Niffler Metadata Extractor to understand the operational efficiency of individual MRI systems based on calculated metrics. We benchmarked the accuracy of the calculated exam time windows by comparing Niffler against the Clinical Data Warehouse (CDW). Niffler accurately identified the scanners’ examination timeframes and idling times, whereas CDW falsely depicted several exam overlaps due to human errors. Third, with metadata extracted from the images by Niffler, we identified scanners with misconfigured time and reconfigured five scanners. Our evaluations highlight how Niffler enables real-time ML and processing pipelines in a research cluster.

[1]  Synho Do,et al.  Medical Image Deep Learning with Hospital PACS Dataset , 2015, ArXiv.

[2]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  C. Parisot The DICOM standard , 1995, The International Journal of Cardiac Imaging.

[4]  Patricia Gauthier,et al.  Integrating feedback from a clinical data warehouse into practice organisation , 2006, Int. J. Medical Informatics.

[5]  R. Hutton,et al.  Applied cognitive task analysis (ACTA): a practitioner's toolkit for understanding cognitive task demands. , 1998, Ergonomics.

[6]  Daniel L. Rubin,et al.  The National Cancer Informatics Program (NCIP) Annotation and Image Markup (AIM) Foundation Model , 2014, Journal of Digital Imaging.

[7]  Krzysztof J. Geras,et al.  New Frontiers: An Update on Computer-Aided Diagnosis for Breast Imaging in the Age of Artificial Intelligence. , 2019, AJR. American journal of roentgenology.

[8]  S. Park,et al.  Design Characteristics of Studies Reporting the Performance of Artificial Intelligence Algorithms for Diagnostic Analysis of Medical Images: Results from Recently Published Papers , 2019, Korean journal of radiology.

[9]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[10]  Marie Taylor,et al.  Introduction to JavaScript Object Notation: a to-the-point guide to JSON , 2014 .

[11]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[12]  清也 稲邑,et al.  DICOM Structured Reporting構造化報告書 , 2001 .

[13]  V. Abedi,et al.  Artificial Intelligence Transforms the Future of Health Care. , 2019, The American journal of medicine.

[14]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[15]  D. Peck Digital Imaging and Communications in Medicine (DICOM): A Practical Introduction and Survival Guide , 2009, Journal of Nuclear Medicine.

[16]  Paul Nagy,et al.  Benefits of Using the DCM4CHE DICOM Archive , 2007, Journal of Digital Imaging.

[17]  Judy W. Gichoya,et al.  Just in Time Radiology Decision Support Using Real-time Data Feeds , 2019, Journal of Digital Imaging.

[18]  Tahsin Kurc,et al.  Twenty Years of Digital Pathology: An Overview of the Road Travelled, What is on the Horizon, and the Emergence of Vendor-Neutral Archives , 2018, Journal of pathology informatics.

[19]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[20]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[21]  Kamran Sartipi,et al.  HL7 FHIR: An Agile and RESTful approach to healthcare information exchange , 2013, Proceedings of the 26th IEEE International Symposium on Computer-Based Medical Systems.

[22]  Oleg S. Pianykh,et al.  Digital Imaging and Communications in Medicine : A Practical Introduction and Survival Guide , 2008 .

[23]  Josep Fernandez-Bayó IHE profiles applied to regional PACS. , 2011, European journal of radiology.

[24]  Marcus A. Badgeley,et al.  Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study , 2018, PLoS medicine.

[25]  Steve G. Langer A Flexible Database Architecture for Mining DICOM Objects: the DICOM Data Warehouse , 2011, Journal of Digital Imaging.

[26]  Carsten Felden,et al.  Real time analytics — State of the art: Potentials and limitations in the smart factory , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[27]  Dawood Khan,et al.  An online platform for interactive feedback in biomedical machine learning , 2020, Nature Machine Intelligence.

[28]  T. Leiner,et al.  Bringing AI to the clinic: blueprint for a vendor-neutral AI deployment infrastructure , 2021, Insights into Imaging.

[29]  Nikhil Ketkar,et al.  Introduction to PyTorch , 2021, Deep Learning with Python.

[30]  Edith Le Floch,et al.  Evaluating Workflow Management Systems: A Bioinformatics Use Case , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[31]  Kyle Banker,et al.  MongoDB in Action , 2011 .

[32]  Viswanath Venkatesh,et al.  Unified Theory of Acceptance and Use of Technology: A Synthesis and the Road Ahead , 2016, J. Assoc. Inf. Syst..

[33]  Suchi Saria,et al.  From development to deployment: dataset shift, causality, and shift-stable models in health AI. , 2019, Biostatistics.

[34]  J. Mongan,et al.  Automated detection of moderate and large pneumothorax on frontal chest X-rays using deep convolutional neural networks: A retrospective study , 2018, PLoS medicine.

[35]  Synho Do,et al.  How much data is needed to train a medical image deep learning system to achieve necessary high accuracy , 2015, 1511.06348.

[36]  Martín Ugarte,et al.  Foundations of JSON Schema , 2016, WWW.

[37]  O. Soremekun,et al.  Framework for analyzing wait times and other factors that impact patient satisfaction in the emergency department. , 2011, The Journal of emergency medicine.

[38]  B. K. Stewart,et al.  Picture archiving and communication systems (PACS) for radiological images: state of the art. , 1988, Critical reviews in diagnostic imaging.

[39]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[40]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Luciano M Prevedello,et al.  Machine Learning in Radiology: Applications Beyond Image Interpretation. , 2017, Journal of the American College of Radiology : JACR.