Uncovering the effects of model initialization on deep model generalization: A study with adult and pediatric chest X-ray images

Model initialization techniques are vital for improving the performance and reliability of deep learning models in medical computer vision applications. While much literature exists on non-medical images, the impacts on medical images, particularly chest X-rays (CXRs) are less understood. Addressing this gap, our study explores three deep model initialization techniques: Cold-start, Warm-start, and Shrink and Perturb start, focusing on adult and pediatric populations. We specifically focus on scenarios with periodically arriving data for training, thereby embracing the real-world scenarios of ongoing data influx and the need for model updates. We evaluate these models for generalizability against external adult and pediatric CXR datasets. We also propose novel ensemble methods: F-score-weighted Sequential Least-Squares Quadratic Programming (F-SLSQP) and Attention-Guided Ensembles with Learnable Fuzzy Softmax to aggregate weight parameters from multiple models to capitalize on their collective knowledge and complementary representations. We perform statistical significance tests with 95% confidence intervals and p-values to analyze model performance. Our evaluations indicate models initialized with ImageNet-pretrained weights demonstrate superior generalizability over randomly-initialized counterparts, contradicting some findings for non-medical images. Notably, ImageNet-pretrained models exhibit consistent performance during internal and external testing across different training scenarios. Weight-level ensembles of these models show significantly higher recall (p<0.05) during testing compared to individual models. Thus, our study accentuates the benefits of ImageNet-pretrained weight initialization, especially when used with weight-level ensembles, for creating robust and generalizable deep learning solutions.

[1]  Saritha Bantu,et al.  Performance Analysis of VGG-16 Deep Learning Model for COVID-19 Detection using Chest X-Ray Images , 2023, 2023 10th International Conference on Computing for Sustainable Global Development (INDIACom).

[2]  A. Bandrowski,et al.  PreprintMatch: A tool for preprint to publication detection shows global inequities in scientific publication , 2023, PloS one.

[3]  C. Kravaris,et al.  Bayesian-optimized Gaussian process-based fault classification in industrial processes , 2023, Comput. Chem. Eng..

[4]  S. Antani,et al.  Assessing the Impact of Image Resolution on Deep Learning for TB Lesion Segmentation on Frontal Chest X-rays , 2023, Diagnostics.

[5]  D. Loiacono,et al.  Image Embeddings Extracted from CNNs Outperform Other Transfer Learning Approaches in Classification of Chest Radiographs , 2022, Diagnostics.

[6]  A. Katsaggelos,et al.  Validation of a deep learning, value-based care model to predict mortality and comorbidities from chest radiographs in COVID-19 , 2022, PLOS digital health.

[7]  S. Antani,et al.  Real-time echocardiography image analysis and quantification of cardiac indices , 2022, Medical Image Anal..

[8]  S. Antani,et al.  A Deep Modality-Specific Ensemble for Improving Pneumonia Detection in Chest X-rays , 2022, Diagnostics.

[9]  Qiwen Fu,et al.  Deep Transfer Learning for the Multilabel Classification of Chest X-ray Images , 2022, Diagnostics.

[10]  Ashish Ranjan,et al.  Pneumonia Detection in Chest X-Rays using Neural Networks , 2022, ArXiv.

[11]  Seyed Yaser Bozorgi Rad,et al.  Proposing a Fuzzy Soft-max-based classifier in a hybrid deep learning architecture for human activity recognition , 2022, IET Biom..

[12]  P. Yi,et al.  Limited generalizability of deep learning algorithm for pediatric pneumonia classification on external data , 2021, Emergency Radiology.

[13]  Shantanu K. Nath,et al.  Chest X-ray abnormalities localization via ensemble of deep convolutional neural networks , 2021, 2021 International Conference on Advanced Technologies for Communications (ATC).

[14]  Daniel C. Cunha,et al.  A cost-effective trilateration-based radio localization algorithm using machine learning and sequential least-square programming optimization , 2021, Comput. Commun..

[15]  J. Wiens,et al.  Combining chest X-rays and electronic health record (EHR) data using machine learning to diagnose acute respiratory failure , 2021, J. Am. Medical Informatics Assoc..

[16]  T. Kwee,et al.  Workload of diagnostic radiologists in the foreseeable future based on recent scientific advances: growth expectations and role of artificial intelligence , 2021, Insights into Imaging.

[17]  Amjad J. Humaidi,et al.  Review of deep learning: concepts, CNN architectures, challenges, applications, future directions , 2021, Journal of Big Data.

[18]  Saeid Nahavandi,et al.  Objective evaluation of deep uncertainty predictions for COVID-19 detection , 2020, Scientific Reports.

[19]  C. Malamateniou,et al.  Identifying cardiomegaly in chest X-rays: a cross-sectional study of evaluation and comparison between different transfer learning methods , 2020, Acta radiologica.

[20]  Proceedings of the 7th International Conference on Computing for Sustainable Global Development (INDIACom) , 2020, 2020 7th International Conference on Computing for Sustainable Global Development (INDIACom).

[21]  Carl F. Sabottke,et al.  The Effect of Image Resolution on Deep Learning in Radiography. , 2020, Radiology. Artificial intelligence.

[22]  Yang Liu,et al.  Federated Learning , 2019, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[23]  Ryan P. Adams,et al.  On Warm-Starting Neural Network Training , 2019, NeurIPS.

[24]  Anit Kumar Sahu,et al.  Federated Learning: Challenges, Methods, and Future Directions , 2019, IEEE Signal Processing Magazine.

[25]  Quoc V. Le,et al.  EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[26]  Jon Kleinberg,et al.  Transfusion: Understanding Transfer Learning for Medical Imaging , 2019, NeurIPS.

[27]  Sema Candemir,et al.  A review on lung boundary detection in chest X-rays , 2019, International Journal of Computer Assisted Radiology and Surgery.

[28]  Yifan Yu,et al.  CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison , 2019, AAAI.

[29]  Carol C Wu,et al.  Augmenting the National Institutes of Health Chest Radiograph Dataset with Expert Annotations of Possible Pneumonia. , 2019, Radiology. Artificial intelligence.

[30]  Sameer Antani,et al.  Visualization and Interpretation of Convolutional Neural Network Predictions in Detecting Pneumonia in Pediatric Chest Radiographs , 2018, Applied sciences.

[31]  Marcus A. Badgeley,et al.  Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study , 2018, PLoS medicine.

[32]  George R. Thoma,et al.  Pre-trained convolutional neural networks as feature extractors toward improved malaria parasite detection in thin blood smear images , 2018, PeerJ.

[33]  Kenji Suzuki,et al.  Overview of deep learning in medical imaging , 2017, Radiological Physics and Technology.

[34]  Le Lu,et al.  ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Michael M Maher,et al.  Computed tomography and patient risk: Facts, perceptions and uncertainties , 2016, World journal of radiology.

[36]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Clement J. McDonald,et al.  Preparing a collection of radiology examinations for distribution and retrieval , 2015, J. Am. Medical Informatics Assoc..

[39]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[40]  Stefan Jaeger,et al.  Two public chest X-ray datasets for computer-aided screening of pulmonary diseases. , 2014, Quantitative imaging in medicine and surgery.

[41]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[42]  H. MacMahon,et al.  Performance of radiologists in detection of small pulmonary nodules on chest radiographs: effect of rib suppression with a massive-training artificial neural network. , 2009, AJR. American journal of roentgenology.

[43]  R. Arthur,et al.  Interpretation of the paediatric chest X-ray. , 2000, Paediatric respiratory reviews.

[44]  Ronald M. Summers,et al.  ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[45]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[46]  K. Bowyer,et al.  Multi-band Polarization Imaging and Applications , 2016, Advances in Computer Vision and Pattern Recognition.

[47]  Yahya Al-Hazmi,et al.  Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 2014, ICPP 2014.

[48]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[49]  I. I. Rushakov,et al.  Computed Tomography , 2019, Compendium of Biomedical Instrumentation.