Mitigating domain shift in AI-based tuberculosis screening with unsupervised domain adaptation

We demonstrate that Domain Invariant Feature Learning (DIFL) can improve the out-ofdomain generalizability of a deep learning Tuberculosis screening algorithm. It is well known that state of the art deep learning algorithms often have difficulty generalizing to unseen data distributions due to "domain shift". In the context of medical imaging, this could lead to unintended biases such as the inability to generalize from one patient population to another. We analyze the performance of a ResNet50 classifier for the purposes of Tuberculosis screening using the four most popular public datasets with geographically diverse sources of imagery. We show that without domain adaptation, ResNet-50 has difficulty in generalizing between imaging distributions from a number of public Tuberculosis screening datasets with imagery from geographically distributed regions. However, with the incorporation of DIFL, the out-of-domain performance is greatly enhanced. Analysis criteria includes a comparison of accuracy, sensitivity, specificity and AUC over both the baseline, as well as the DIFL enhanced algorithms. We conclude that DIFL improves generalizability of Tuberculosis screening while maintaining acceptable accuracy over the source domain imagery when applied across a variety of public datasets. INDEX TERMS Tuberculosis, X-Ray Imaging, Domain Adaptation, Domain Invariant Feature Learning, Generative Adversarial Networks, Deep Learning, Computer Vision, Computer Aided Diagnosis

[1]  Trevor Darrell,et al.  Semi-Supervised Domain Adaptation via Minimax Entropy , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[2]  Moritz Hardt,et al.  A Meta-Analysis of Overfitting in Machine Learning , 2019, NeurIPS.

[3]  Daumé,et al.  Frustratingly Easy Semi-Supervised Domain Adaptation , 2010 .

[4]  Avishek Saha,et al.  Active Supervised Domain Adaptation , 2011, ECML/PKDD.

[5]  M. Zandehshahvar,et al.  Toward understanding COVID-19 pneumonia: a deep-learning-based approach for severity analysis and monitoring the disease , 2020, Scientific Reports.

[6]  Michael I. Jordan,et al.  Unsupervised Domain Adaptation with Residual Transfer Networks , 2016, NIPS.

[7]  Cewu Lu,et al.  Cross-Domain Adaptation for Animal Pose Estimation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8]  C. Rout,et al.  Role of Gist and PHOG Features in Computer-Aided Diagnosis of Tuberculosis without Segmentation , 2014, PloS one.

[9]  Dong Yang,et al.  When Unseen Domain Generalization is Unnecessary? Rethinking Data Augmentation , 2019, ArXiv.

[10]  Bogdan Kwolek,et al.  DiagSet: a dataset for prostate cancer histopathological image classification , 2021, ArXiv.

[11]  Daniel Marcu,et al.  Domain Adaptation for Statistical Classifiers , 2006, J. Artif. Intell. Res..

[12]  Jieli Zhou,et al.  SODA: Detecting Covid-19 in Chest X-rays with Semi-supervised Open Set Domain Adaptation , 2020, ArXiv.

[13]  Ke Lu,et al.  Transfer Independently Together: A Generalized Framework for Domain Adaptation , 2019, IEEE Transactions on Cybernetics.

[14]  Brian C. Lovell,et al.  Unsupervised Domain Adaptation by Domain Invariant Projection , 2013, 2013 IEEE International Conference on Computer Vision.

[15]  James H Thrall,et al.  Artificial Intelligence and Machine Learning in Radiology: Opportunities, Challenges, Pitfalls, and Criteria for Success. , 2018, Journal of the American College of Radiology : JACR.

[16]  Barbara Caputo,et al.  Learning to Learn, from Transfer Learning to Domain Adaptation: A Unifying Perspective , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[18]  Donald A. Adjeroh,et al.  Unified Deep Supervised Domain Adaptation and Generalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19]  Kaixin Liu,et al.  Domain adaptation transfer learning soft sensor for product quality prediction , 2019, Chemometrics and Intelligent Laboratory Systems.

[20]  Wouter M. Kouw An introduction to domain adaptation and transfer learning , 2018, ArXiv.

[21]  Nico Karssemeijer,et al.  Transfer Learning for Domain Adaptation in MRI: Application in Brain Lesion Segmentation , 2017, MICCAI.

[22]  Dumitru Erhan,et al.  Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Tanveer F. Syeda-Mahmood,et al.  Semi-supervised learning with generative adversarial networks for chest X-ray classification with ability of data domain adaptation , 2018, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[24]  Ivor W. Tsang,et al.  Heterogeneous Domain Adaptation for Multiple Classes , 2014, AISTATS.

[25]  Stefan Jaeger,et al.  Two public chest X-ray datasets for computer-aided screening of pulmonary diseases. , 2014, Quantitative imaging in medicine and surgery.

[26]  David J. Kriegman,et al.  Image to Image Translation for Domain Adaptation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27]  Victor S. Lempitsky,et al.  Unsupervised Domain Adaptation by Backpropagation , 2014, ICML.

[28]  Chong-Wah Ngo,et al.  Semi-supervised Domain Adaptation with Subspace Learning for visual recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  A. Tavakkoli,et al.  Optic-Net: A Novel Convolutional Neural Network for Diagnosis of Retinal Diseases from Optical Tomography Images , 2019, 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA).

[30]  Sumit Chopra,et al.  DLID: Deep Learning for Domain Adaptation by Interpolating between Domains , 2013 .

[31]  Silvio Savarese,et al.  Generalizing to Unseen Domains via Adversarial Data Augmentation , 2018, NeurIPS.

[32]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.