Hetero-Modal Learning and Expansive Consistency Constraints for Semi-Supervised Detection from Multi-Sequence Data

Lesion detection serves a critical role in early diagnosis and has been well explored in recent years due to methodological advances and increased data availability. However, the high costs of annotations hinder the collection of large and completely labeled datasets, motivating semi-supervised detection approaches. In this paper, we introduce mean teacher hetero-modal detection (MTHD), which addresses two important gaps in current semi-supervised detection. First, it is not obvious how to enforce unlabeled consistency constraints across the very different outputs of various detectors, which has resulted in various compromises being used in the state of the art. Using an anchor-free framework, MTHD formulates a mean teacher approach without such compromises, enforcing consistency on the soft-output of object centers and size. Second, multi-sequence data is often critical, e.g., for abdominal lesion detection, but unlabeled data is often missing sequences. To deal with this, MTHD incorporates hetero-modal learning in its framework. Unlike prior art, MTHD is able to incorporate an expansive set of consistency constraints that include geometric transforms and random sequence combinations. We train and evaluate MTHD on liver lesion detection using the largest MR lesion dataset to date (1099 patients with > 5000 volumes). MTHD surpasses the best fully-supervised and semi-supervised competitors by 10.1% and 3.5%, respectively, in average sensitivity.

[1]  Adam P. Harrison,et al.  Lesion-Harvester: Iteratively Mining Unlabeled Lesions and Hard-Negative Examples at Scale , 2020, IEEE Transactions on Medical Imaging.

[2]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[3]  Timo Aila,et al.  Temporal Ensembling for Semi-Supervised Learning , 2016, ICLR.

[4]  Nojun Kwak,et al.  Consistency-based Semi-supervised Learning for Object detection , 2019, NeurIPS.

[5]  David Berthelot,et al.  MixMatch: A Holistic Approach to Semi-Supervised Learning , 2019, NeurIPS.

[6]  Adam P. Harrison,et al.  Learning From Multiple Datasets With Heterogeneous and Partial Labels for Universal Lesion Detection in CT , 2020, IEEE Transactions on Medical Imaging.

[7]  Ivan Bricault,et al.  EASL and AASLD recommendations for the diagnosis of HCC to the test of daily practice , 2017, Liver international : official journal of the International Association for the Study of the Liver.

[8]  Adam P. Harrison,et al.  Co-Heterogeneous and Adaptive Segmentation from Multi-Source and Multi-Phase CT Imaging Data: A Study on Pathological Liver and Lesion Segmentation , 2020, ECCV.

[9]  Nima Tajbakhsh,et al.  Embracing Imperfect Datasets: A Review of Deep Learning Solutions for Medical Image Segmentation , 2019, Medical Image Anal..

[10]  Han Zhang,et al.  A Simple Semi-Supervised Learning Framework for Object Detection , 2020, ArXiv.

[11]  Le Lu,et al.  DeepLesion: automated mining of large-scale lesion annotations and universal lesion detection with deep learning , 2018, Journal of medical imaging.

[12]  Yuan Zhang,et al.  FocalMix: Semi-Supervised Learning for 3D Medical Image Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Harri Valpola,et al.  Weight-averaged consistency targets improve semi-supervised deep learning results , 2017, ArXiv.

[14]  Youbao Tang,et al.  MULAN: Multitask Universal Lesion Analysis Network for Joint Lesion Detection, Tagging, and Segmentation , 2019, MICCAI.

[15]  Stephanie R Wilson,et al.  Contrast-enhanced US Approach to the Diagnosis of Focal Liver Masses. , 2017, Radiographics : a review publication of the Radiological Society of North America, Inc.

[16]  Ling Shao,et al.  Collaborative Learning of Semi-Supervised Segmentation and Classification for Medical Images , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Mohammad Havaei,et al.  HeMIS: Hetero-Modal Image Segmentation , 2016, MICCAI.

[18]  Kaiqi Huang,et al.  MVP-Net: Multi-view FPN with Position-aware Attention for Deep Universal Lesion Detection , 2019, MICCAI.

[19]  Konstantina S. Nikita,et al.  A computer-aided diagnostic system to characterize CT focal liver lesions: design and optimization of a neural network classifier , 2003, IEEE Transactions on Information Technology in Biomedicine.

[20]  Xingyi Zhou,et al.  Objects as Points , 2019, ArXiv.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Jing Xiao,et al.  Knowledge Distillation with Adaptive Asymmetric Label Sharpening for Semi-supervised Fracture Detection in Chest X-rays , 2020, IPMI.

[23]  Ronald M. Summers,et al.  3D Context Enhanced Region-based Convolutional Neural Network for End-to-End Lesion Detection , 2018, MICCAI.

[24]  R. Castellino,et al.  Computer aided detection (CAD): an overview , 2005, Cancer imaging : the official publication of the International Cancer Imaging Society.

[25]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[26]  Kenji Suzuki A review of computer-aided diagnosis in thoracic and colonic imaging. , 2012, Quantitative imaging in medicine and surgery.

[27]  Shifeng Zhang,et al.  Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Peter Vajda,et al.  Unbiased Teacher for Semi-Supervised Object Detection , 2021, ICLR.

[29]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Nong Xiao,et al.  ElixirNet: Relation-aware Network Architecture Adaptation for Medical Lesion Detection , 2020, AAAI.