Multi-Instance Multi-Stage Deep Learning for Medical Image Recognition

Abstract Over the last few decades, as the amount of annotated medical data is increasing speedily, deep learning-based approaches have been attracting more attention and enjoyed a great success in the medical imaging field, including computer-aided diagnosis, image segmentation, image registration, image database retrieval, and so on. In medical image recognition problems, by using beautiful biologically-inspired architectures, deep learning is able to learn a hierarchical representation of data to distinguish different image classes. However, if the discriminative information only lies in local image patches, classic deep learning framework may still have limitations in discovering them without local-level annotations. In this chapter, we introduce a multi-stage deep learning framework that aims to automatically discover local discriminative information for medical image classification and apply it on body part recognition in CT. Specifically, the pre-train stage of the proposed framework automatically discovers the local regions that are either discriminative or non-informative to the image classification problem by learning a convolutional neural network (CNN) in a multi-instance learning fashion. Then, the pre-learned CNN is fine-tuned based on these discovered local regions to produce an efficient image-level classifier. In this way, no manual annotation of local regions is required. Our method is validated on synthetic datasets and a large scale CT dataset with several thousands of images. It achieves better performances than conventional learning approaches using ad-hoc designed image features and the standard deep CNN.

[1]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Antonio Criminisi,et al.  Regression Forests for Efficient Anatomy Detection and Localization in CT Studies , 2010, MCV.

[3]  Nassir Navab,et al.  AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images , 2016, IEEE Trans. Medical Imaging.

[4]  S. Resnick,et al.  Measuring Size and Shape of the Hippocampus in MR Images Using a Deformable Shape Model , 2002, NeuroImage.

[5]  Dimitris N. Metaxas,et al.  Automated anatomical landmark detection ondistal femur surface using convolutional neural network , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[6]  Fei-Fei Li,et al.  Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.

[7]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[8]  Tomás Lozano-Pérez,et al.  A Framework for Multiple-Instance Learning , 1997, NIPS.

[9]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[10]  Bingbing Ni,et al.  HCP: A Flexible CNN Framework for Multi-Label Image Classification , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Demetri Terzopoulos,et al.  Deformable models in medical image analysis: a survey , 1996, Medical Image Anal..

[12]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[13]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[14]  Yiqiang Zhan,et al.  Active Scheduling of Organ Detection and Segmentation in Whole-Body Medical Images , 2008, MICCAI.

[15]  Junzhou Huang,et al.  Towards robust and effective shape modeling: Sparse shape composition , 2012, Medical Image Anal..

[16]  Yann LeCun,et al.  Traffic sign recognition with multi-scale Convolutional Networks , 2011, The 2011 International Joint Conference on Neural Networks.

[17]  Horst Bischof,et al.  Global localization of 3D anatomical structures by pre-filtered Hough Forests and discrete optimization , 2013, Medical Image Anal..

[18]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[19]  D. Hill,et al.  Medical image registration , 2001, Physics in medicine and biology.

[20]  Dumitru Erhan,et al.  Deep Neural Networks for Object Detection , 2013, NIPS.

[21]  Meng Wang,et al.  3D deep shape descriptor , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[23]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Gernot Brockmann,et al.  Automatic Aorta Segmentation and Valve Landmark Detection in C-Arm CT: Application to Aortic Valve Implantation , 2010, MICCAI.

[25]  R. Bellotti,et al.  A CAD system for nodule detection in low-dose lung CTs based on region growing and a new active contour model. , 2007, Medical physics.

[26]  Michael Kohnen,et al.  Quality of DICOM header information for image categorization , 2002, SPIE Medical Imaging.

[27]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[28]  Jiajun Wu,et al.  Deep multiple instance learning for image classification and auto-annotation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Ning Qian,et al.  On the momentum term in gradient descent learning algorithms , 1999, Neural Networks.

[30]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Lina Arbash Meinel,et al.  Breast MRI lesion classification: Improved performance of human readers with a backpropagation neural network computer‐aided diagnosis (CAD) system , 2007, Journal of magnetic resonance imaging : JMRI.

[32]  Dimitris N. Metaxas,et al.  Deformable segmentation via sparse representation and dictionary learning , 2012, Medical Image Anal..

[33]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[35]  Chunming Li,et al.  A Level Set Method for Image Segmentation in the Presence of Intensity Inhomogeneities With Application to MRI , 2011, IEEE Transactions on Image Processing.

[36]  Shu Liao,et al.  Multi-Instance Deep Learning: Discover Discriminative Local Anatomies for Bodypart Recognition , 2016, IEEE Transactions on Medical Imaging.

[37]  Marios Anthimopoulos,et al.  Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network , 2016, IEEE Transactions on Medical Imaging.

[38]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[39]  Lisa Tang,et al.  Deep 3D Convolutional Encoder Networks With Shortcuts for Multiscale Feature Integration Applied to Multiple Sclerosis Lesion Segmentation , 2016, IEEE Transactions on Medical Imaging.

[40]  Margrit Betke,et al.  Landmark detection in the chest and registration of lung surfaces with an application to nodule registration , 2003, Medical Image Anal..

[41]  Dinggang Shen,et al.  Segmenting Hippocampus from Infant Brains by Sparse Patch Matching with Deep-Learned Features , 2014, MICCAI.

[42]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Shu Liao,et al.  Bodypart Recognition Using Multi-stage Deep Learning , 2015, IPMI.

[44]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[45]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[47]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[48]  Dorin Comaniciu,et al.  Hierarchical, learning-based automatic liver segmentation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  K. Doi,et al.  Current status and future potential of computer-aided diagnosis in medical imaging. , 2005, The British journal of radiology.

[50]  Ronald M. Summers,et al.  Anatomy-specific classification of medical images using deep convolutional nets , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[51]  Ronan Collobert,et al.  From image-level to pixel-level labeling with Convolutional Networks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[53]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[54]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[55]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[56]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[57]  Jerry L Prince,et al.  Current methods in medical image segmentation. , 2000, Annual review of biomedical engineering.

[58]  Max A. Viergever,et al.  Automatic Segmentation of MR Brain Images With a Convolutional Neural Network , 2016, IEEE Transactions on Medical Imaging.

[59]  Shuiwang Ji,et al.  Deep convolutional neural networks for multi-modality isointense infant brain image segmentation , 2015, NeuroImage.

[60]  Max A. Viergever,et al.  A survey of medical image registration , 1998, Medical Image Anal..

[61]  George Papandreou,et al.  Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation , 2015, ArXiv.