ENRICHing medical imaging training sets enables more efficient machine learning

OBJECTIVE Deep learning (DL) has been applied in proofs of concept across biomedical imaging, including across modalities and medical specialties. Labeled data are critical to training and testing DL models, but human expert labelers are limited. In addition, DL traditionally requires copious training data, which is computationally expensive to process and iterate over. Consequently, it is useful to prioritize using those images that are most likely to improve a model's performance, a practice known as instance selection. The challenge is determining how best to prioritize. It is natural to prefer straightforward, robust, quantitative metrics as the basis for prioritization for instance selection. However, in current practice, such metrics are not tailored to, and almost never used for, image datasets. MATERIALS AND METHODS To address this problem, we introduce ENRICH-Eliminate Noise and Redundancy for Imaging Challenges-a customizable method that prioritizes images based on how much diversity each image adds to the training set. RESULTS First, we show that medical datasets are special in that in general each image adds less diversity than in nonmedical datasets. Next, we demonstrate that ENRICH achieves nearly maximal performance on classification and segmentation tasks on several medical image datasets using only a fraction of the available images and without up-front data labeling. ENRICH outperforms random image selection, the negative control. Finally, we show that ENRICH can also be used to identify errors and outliers in imaging datasets. CONCLUSIONS ENRICH is a simple, computationally efficient method for prioritizing images for expert labeling and use in DL.

[1]  R. Arnaout,et al.  Repertoire-scale measures of antigen binding , 2022, bioRxiv.

[2]  A. Butte,et al.  Development and Validation of a Deep Learning Strategy for Automated View Classification of Pediatric Focused Assessment With Sonography for Trauma , 2021, Journal of ultrasound in medicine : official journal of the American Institute of Ultrasound in Medicine.

[3]  A. Moon‐Grady,et al.  An ensemble of neural networks provides expert-level prenatal detection of complex congenital heart disease , 2021, Nature Medicine.

[4]  T. Maloney,et al.  DeepLiverNet: a deep transfer learning model for classifying liver stiffness using clinical and T2-weighted magnetic resonance imaging data in children and young adults , 2020, Pediatric Radiology.

[5]  K. Brock,et al.  Automatic contouring system for cervical cancer using convolutional neural networks , 2020, Medical physics.

[6]  Elena A. Kaye,et al.  Accelerating Prostate Diffusion Weighted MRI using Guided Denoising Convolutional Neural Network: Retrospective Feasibility Study , 2020, Radiology. Artificial intelligence.

[7]  K. Brock,et al.  Automated Contouring of Contrast and Noncontrast Computed Tomography Liver Images With Fully Convolutional Networks , 2020, Advances in radiation oncology.

[8]  Nan Wu,et al.  An interpretable classifier for high-resolution breast cancer screening images utilizing weakly supervised localization , 2020, Medical Image Anal..

[9]  Daniel C. Lee,et al.  Rapid dealiasing of undersampled, non‐Cartesian cardiac perfusion images using U‐net , 2020, NMR in biomedicine.

[10]  Bruno De Man,et al.  A dual-stream deep convolutional network for reducing metal streak artifacts in CT images , 2019, Physics in medicine and biology.

[11]  P. Ellen Grant,et al.  Fetal Pose Estimation in Volumetric MRI using a 3D Convolution Neural Network , 2019, MICCAI.

[12]  Ruzena Bajcsy,et al.  Fully Automated Echocardiogram Interpretation in Clinical Practice , 2018, Circulation.

[13]  Xiang Li,et al.  Shortcomings of Ventricle Segmentation Using Deep Convolutional Networks , 2018, MLCN/DLF/iMIMIC@MICCAI.

[14]  L. Jost What do we mean by diversity? The path towards quantification , 2018, Mètode Revista de difusió de la investigació.

[15]  Ramy Arnaout,et al.  Fast and accurate view classification of echocardiograms using deep learning , 2018, npj Digital Medicine.

[16]  Khan M. Iftekharuddin,et al.  Deep learning and texture-based semantic label fusion for brain tumor segmentation , 2018, Medical Imaging.

[17]  Daniel S. Kermany,et al.  Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning , 2018, Cell.

[18]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[19]  Ruimao Zhang,et al.  Cost-Effective Active Learning for Deep Image Classification , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[21]  Richard Duszak,et al.  The U.S. Radiologist Workforce: An Analysis of Temporal and Geographic Variation by Using Large National Datasets. , 2016, Radiology.

[22]  Joseph Kaplinsky,et al.  Robust estimates of overall immune-repertoire diversity from high-throughput measurements on samples , 2016, Nature Communications.

[23]  José Francisco Martínez Trinidad,et al.  A review of instance selection methods , 2010, Artificial Intelligence Review.

[24]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[25]  Gobert N. Lee,et al.  Deep Learning in Medical Image Analysis: Challenges and Applications , 2020, Advances in Experimental Medicine and Biology.