Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning?

Training a deep convolutional neural network (CNN) from scratch is difficult because it requires a large amount of labeled training data and a great deal of expertise to ensure proper convergence. A promising alternative is to fine-tune a CNN that has been pre-trained using, for instance, a large set of labeled natural images. However, the substantial differences between natural and medical images may advise against such knowledge transfer. In this paper, we seek to answer the following central question in the context of medical image analysis: Can the use of pre-trained deep CNNs with sufficient fine-tuning eliminate the need for training a deep CNN from scratch? To address this question, we considered four distinct medical imaging applications in three specialties (radiology, cardiology, and gastroenterology) involving classification, detection, and segmentation from three different imaging modalities, and investigated how the performance of deep CNNs trained from scratch compared with the pre-trained CNNs fine-tuned in a layer-wise manner. Our experiments consistently demonstrated that 1) the use of a pre-trained CNN with adequate fine-tuning outperformed or, in the worst case, performed as well as a CNN trained from scratch; 2) fine-tuned CNNs were more robust to the size of training sets than CNNs trained from scratch; 3) neither shallow tuning nor deep tuning was the optimal choice for a particular application; and 4) our layer-wise fine-tuning scheme could offer a practical way to reach the best performance for the application at hand based on the amount of available data.

[1]  Luca Maria Gambardella,et al.  Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks , 2013, MICCAI.

[2]  Gerard Lacey,et al.  Indistinct Frame Detection in Colonoscopy Videos , 2009, 2009 13th International Machine Vision and Image Processing Conference.

[3]  Fernando Vilariño,et al.  Impact of image preprocessing methods on polyp localization in colonoscopy frames , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[4]  Atsuto Maki,et al.  Factors of Transferability for a Generic ConvNet Representation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Yann LeCun,et al.  Understanding Deep Architectures using a Recursive Convolutional Network , 2013, ICLR.

[6]  Shuiwang Ji,et al.  Deep convolutional neural networks for multi-modality isointense infant brain image segmentation , 2015, NeuroImage.

[7]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8]  Jung-Hwan Oh,et al.  Polyp Detection in Colonoscopy Video using Elliptical Shape Feature , 2007, 2007 IEEE International Conference on Image Processing.

[9]  Fernando Vilariño,et al.  Towards automatic polyp detection with a polyp appearance model , 2012, Pattern Recognit..

[10]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[11]  Jinbo Bi,et al.  Computer Aided Detection of Pulmonary Embolism with Tobogganing and Mutiple Instance Classification in CT Pulmonary Angiography , 2007, IPMI.

[12]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[13]  H. El‐Serag,et al.  OOutcomes of Colorectal Cancer in the United States: No Change in Survival (1986–1997) , 2003, American Journal of Gastroenterology.

[14]  Yu Zhang,et al.  ECG-based frame selection and curvature-based ROI detection for measuring carotid intima-media thickness , 2014, Medical Imaging.

[15]  Ronald M. Summers,et al.  Deep convolutional networks for pancreas segmentation in CT imaging , 2015, Medical Imaging.

[16]  Matthew T. Freedman,et al.  Artificial convolution neural network techniques and applications for lung nodule detection , 1995, IEEE Trans. Medical Imaging.

[17]  Justin A. Blanco,et al.  Modeling electroencephalography waveforms with semi-supervised deep belief nets: fast classification and anomaly measurement , 2011, Journal of neural engineering.

[18]  Darrin C. Edwards,et al.  Maximum likelihood fitting of FROC curves under an initial-detection-and-candidate-analysis model. , 2002, Medical physics.

[19]  Dimitrios K. Iakovidis,et al.  A comparative study of texture features for the discrimination of gastric polyps in endoscopic video , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[20]  Nima Tajbakhsh,et al.  A Classification-Enhanced Vote Accumulation Scheme for Detecting Colonic Polyps , 2013, Abdominal Imaging.

[21]  Mel Herbert,et al.  The mortality of untreated pulmonary embolism in emergency department patients. , 2005, Annals of emergency medicine.

[22]  Gelareh Sadigh,et al.  Challenges, controversies, and hot topics in pulmonary embolism imaging. , 2011, AJR. American journal of roentgenology.

[23]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  H. El‐Serag,et al.  Outcomes of colorectal cancer in the United States , 2003 .

[25]  Dimitris A. Karras,et al.  Computer-aided tumor detection in endoscopic video using color wavelet features , 2003, IEEE Transactions on Information Technology in Biomedicine.

[26]  Nima Tajbakhsh,et al.  A Comprehensive Computer-Aided Polyp Detection System for Colonoscopy Videos , 2015, IPMI.

[27]  D. Heresbach,et al.  Miss rate for colorectal neoplastic polyps: a prospective multicenter study of back-to-back video colonoscopies , 2008, Endoscopy.

[28]  Miguel Ángel Guevara-López,et al.  Convolutional neural networks for mammography mass lesion classification , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[29]  Hayit Greenspan,et al.  Deep learning with non-medical training used for chest pathology identification , 2015, Medical Imaging.

[30]  Ronald M. Summers,et al.  A New 2.5D Representation for Lymph Node Detection Using Random Sets of Deep Convolutional Neural Network Observations , 2014, MICCAI.

[31]  Luís A. Alexandre,et al.  Color and Position versus Texture Features for Endoscopic Polyp Detection , 2008, 2008 International Conference on BioMedical Engineering and Informatics.

[32]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[33]  Nima Tajbakhsh,et al.  Automatic polyp detection from learned boundaries , 2014, 2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI).

[34]  A. M. Leufkens,et al.  Factors influencing the miss rate of polyps in a back-to-back colonoscopy study , 2012, Endoscopy.

[35]  Nima Tajbakhsh,et al.  Computer-Aided Pulmonary Embolism Detection Using a Novel Vessel-Aligned Multi-planar Image Representation and Convolutional Neural Networks , 2015, MICCAI.

[36]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[37]  Christos P. Loizou,et al.  Segmentation of the Common Carotid Intima-Media Complex in Ultrasound Images Using Active Contours , 2012, IEEE Transactions on Biomedical Engineering.

[38]  Nima Tajbakhsh,et al.  Automating Carotid Intima-Media Thickness Video Interpretation with Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Luca Maria Gambardella,et al.  Deep Neural Networks Segment Neuronal Membranes in Electron Microscopy Images , 2012, NIPS.

[40]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[41]  Peter Lance,et al.  Analysis of colorectal cancer occurrence during surveillance colonoscopy in the dietary Polyp Prevention Trial. , 2004, Gastrointestinal endoscopy.

[42]  Dorin Comaniciu,et al.  3D Deep Learning for Efficient and Robust Landmark Detection in Volumetric Data , 2015, MICCAI.

[43]  Nima Tajbakhsh,et al.  Automated Polyp Detection in Colonoscopy Videos Using Shape and Context Information , 2016, IEEE Transactions on Medical Imaging.

[44]  Nicholas Ayache,et al.  Fine-tuned convolutional neural nets for cardiac MRI acquisition plane recognition , 2017, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[45]  Ronald M. Summers,et al.  Interleaved Text/Image Deep Mining on a Large-Scale Radiology Image Database , 2017, Deep Learning and Convolutional Neural Networks for Medical Image Computing.

[46]  Sun Young Park,et al.  A Colon Video Analysis Framework for Polyp Detection , 2012, IEEE Transactions on Biomedical Engineering.

[47]  Jefersson Alex dos Santos,et al.  Do deep features generalize from everyday objects to remote sensing and aerial scenes domains? , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[48]  Gerald Penn,et al.  Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[49]  Demetri Terzopoulos,et al.  United Snakes , 1999, Medical Image Anal..

[50]  D. Hubel,et al.  Receptive fields of single neurones in the cat's striate cortex , 1959, The Journal of physiology.

[51]  Gustavo Carneiro,et al.  Unregistered Multiview Mammogram Analysis with Pre-trained Deep Learning Models , 2015, MICCAI.

[52]  Christian Igel,et al.  Deep Feature Learning for Knee Cartilage Segmentation Using a Triplanar Convolutional Neural Network , 2013, MICCAI.

[53]  Jung-Hwan Oh,et al.  Informative frame classification for endoscopy video , 2007, Medical Image Anal..

[54]  Christopher Joseph Pal,et al.  Brain tumor segmentation with Deep Neural Networks , 2015, Medical Image Anal..

[55]  Nima Tajbakhsh,et al.  Automatic polyp detection in colonoscopy videos using an ensemble of convolutional neural networks , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[56]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[57]  Atsuto Maki,et al.  From generic to specific deep representations for visual recognition , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[58]  Jung-Hwan Oh,et al.  Part-Based Multiderivative Edge Cross-Sectional Profiles for Polyp Detection in Colonoscopy , 2014, IEEE Journal of Biomedical and Health Informatics.

[59]  Nima Tajbakhsh,et al.  Automatic Polyp Detection Using Global Geometric Constraints and Local Intensity Variation Patterns , 2014, MICCAI.

[60]  Antonio González-López,et al.  Automatic Evaluation of Carotid Intima-Media Thickness in Ultrasounds Using Machine Learning , 2013, IWINAC.

[61]  Bram van Ginneken,et al.  Off-the-shelf convolutional neural network features for pulmonary nodule detection in computed tomography scans , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[62]  Marcin Polkowski,et al.  CT colonography versus colonoscopy for the detection of advanced neoplasia. , 2008, The New England journal of medicine.

[63]  K Doi,et al.  Computerized detection of clustered microcalcifications in digital mammograms using a shift-invariant artificial neural network. , 1994, Medical physics.

[64]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[65]  Hao Chen,et al.  Standard Plane Localization in Fetal Ultrasound via Domain Transferred Deep Neural Networks , 2015, IEEE Journal of Biomedical and Health Informatics.

[66]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[67]  Georg Langs,et al.  Unsupervised Pre-training Across Image Domains Improves Lung Tissue Classification , 2014, MCV.

[68]  Honglak Lee,et al.  Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[69]  Pascal Vincent,et al.  The Difficulty of Training Deep Architectures and the Effect of Unsupervised Pre-Training , 2009, AISTATS.

[70]  José-Luis Sancho-Gómez,et al.  Fully automatic segmentation of ultrasound common carotid artery images based on machine learning , 2015, Neurocomputing.

[71]  P. Bossuyt,et al.  Polyp Miss Rate Determined by Tandem Colonoscopy: A Systematic Review , 2006, The American Journal of Gastroenterology.

[72]  K L Lam,et al.  Computer-aided detection of mammographic microcalcifications: pattern recognition with an artificial neural network. , 1995, Medical physics.

[73]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[74]  Yuan Zhou,et al.  Ultrasound intima-media segmentation using Hough transform and dual snake model , 2012, Comput. Medical Imaging Graph..

[75]  J. Fairfield,et al.  Toboggan contrast enhancement for contrast segmentation , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[76]  Ronald M. Summers,et al.  Holistic classification of CT attenuation patterns for interstitial lung diseases via deep convolutional neural networks , 2018, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[77]  T. Ponchon3,et al.  Miss rate for colorectal neoplastic polyps: a prospective multicenter study of back-to-back video colonoscopies , 2008 .

[78]  Nima Tajbakhsh,et al.  Automatic Assessment of Image Informativeness in Colonoscopy , 2014, ABDI@MICCAI.