论文信息 - Comparing two classes of end-to-end machine-learning models in lung nodule detection and classification: MTANNs vs. CNNs - 字舞流文

Comparing two classes of end-to-end machine-learning models in lung nodule detection and classification: MTANNs vs. CNNs

End-to-end learning machines enable a direct mapping from the raw input data to the desired outputs, eliminating the need for hand-crafted features. Despite less engineering effort than the hand-crafted counterparts, these learning machines achieve extremely good results for many computer vision and medical image analysis tasks. Two dominant classes of end-to-end learning machines are massive-training artificial neural networks (MTANNs) and convolutional neural networks (CNNs). Although MTANNs have been actively used for a number of medical image analysis tasks over the past two decades, CNNs have recently gained popularity in the field of medical imaging. In this study, we have compared these two successful learning machines both experimentally and theoretically. For that purpose, we considered two well-studied topics in the field of medical image analysis: detection of lung nodules and distinction between benign and malignant lung nodules in computed tomography (CT). For a thorough analysis, we used 2 optimized MTANN architectures and 4 distinct CNN architectures that have different depths. Our experiments demonstrated that the performance of MTANNs was substantially higher than that of CNN when using only limited training data. With a larger training dataset, the performance gap became less evident even though the margin was still significant. Specifically, for nodule detection, MTANNs generated 2.7 false positives per patient at 100% sensitivity, which was significantly (p<0.05) lower than the best performing CNN model with 22.7 false positives per patient at the same level of sensitivity. For nodule classification, MTANNs yielded an area under the receiver-operating-characteristic curve (AUC) of 0.8806 (95% CI: 0.83890.9223), which was significantly (p<0.05) greater than the best performing CNN model with an AUC of 0.7755 (95% CI: 0.71200.8270). Thus, with limited training data, MTANNs would be a suitable end-to-end machine-learning model for detection and classification of focal lesions that do not require high-level semantic features. HighlightsMTANNs yielded higher performance than CNNs for nodule detection and classification.Deep CNN architectures achieved higher performance than shallow architectures for nodule detection.CNN architectures with varying depths performed comparably for nodule classification.MTANNs can achieve desired performance with a smaller training dataset than do the CNNs.MTANNs tend to learn the appearance of lesion parts, whereas CNNs attempt to learn the lesion appearance as a whole.

Nima Tajbakhsh | Kenji Suzuki | Kenji Suzuki | Nima Tajbakhsh

[1] Darrin C. Edwards,et al. Maximum likelihood fitting of FROC curves under an initial-detection-and-candidate-analysis model. , 2002, Medical physics.

[2] S. Armato,et al. Massive training artificial neural network (MTANN) for reduction of false positives in computerized detection of lung nodules in low-dose computed tomography. , 2003, Medical physics.

[3] K. Doi,et al. Investigation of new psychophysical measures for evaluation of similar images on thoracic computed tomography for distinction between benign and malignant nodules. , 2003, Medical physics.

[4] Gabriela Csurka,et al. Visual categorization with bags of keypoints , 2002, eccv 2004.

[5] Rich Caruana,et al. Do Deep Nets Really Need to be Deep? , 2013, NIPS.

[6] Kenji Suzuki,et al. CT colonography: advanced computer-aided detection scheme utilizing MTANNs for detection of "missed" polyps in a multicenter clinical trial. , 2009, Medical physics.

[7] Hang Li,et al. Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[8] Kunihiko Fukushima,et al. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[9] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[10] Kunio Doi,et al. Computer-aided diagnostic scheme for distinction between benign and malignant nodules in thoracic low-dose CT by use of massive training artificial neural network , 2005, IEEE Transactions on Medical Imaging.

[11] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[13] Josephine Sullivan,et al. Improved Boosting Performance by Explicit Handling of Ambiguous Positive Examples , 2013, ICPRAM.

[14] Nima Tajbakhsh,et al. Computer-Aided Pulmonary Embolism Detection Using a Novel Vessel-Aligned Multi-planar Image Representation and Convolutional Neural Networks , 2015, MICCAI.

[15] Shuiwang Ji,et al. Deep convolutional neural networks for multi-modality isointense infant brain image segmentation , 2015, NeuroImage.

[17] Ken-ichi Suzuki,et al. Neural Filter with Selection of Input Features and Its Application to Image Quality Improvement of Medical Image Sequences , 2002 .

[18] Sid Deutsch,et al. A simplified version of Kunihiko Fukushima's neocognitron , 1981, Biological Cybernetics.

[19] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20] Vijay S. Pande,et al. Massively Multitask Networks for Drug Discovery , 2015, ArXiv.

[21] Hiroyuki Yoshida,et al. Massive-training artificial neural network (MTANN) for reduction of false positives in computer-aided detection of polyps: Suppression of rectal tubes. , 2006, Medical physics.

[22] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[23] Kenji Suzuki,et al. Massive-training support vector regression and Gaussian process for false-positive reduction in computer-aided detection of polyps in CT colonography. , 2011, Medical physics.

[24] S. Armato,et al. Lung cancers missed at low-dose helical CT screening in a general population: comparison of clinical, histopathologic, and imaging findings. , 2002, Radiology.

[25] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.

[26] Ronald M. Summers,et al. A New 2.5D Representation for Lymph Node Detection Using Random Sets of Deep Convolutional Neural Network Observations , 2014, MICCAI.

[27] S. Armato,et al. Mixture of expert 3D massive-training ANNs for reduction of multiple types of false positives in CAD for detection of polyps in CT colonography. , 2008, Medical physics.

[28] Hayit Greenspan,et al. Deep learning with non-medical training used for chest pathology identification , 2015, Medical Imaging.

[29] Feng Li,et al. Mass screening for lung cancer with mobile spiral computed tomography scanner , 1998, The Lancet.

[30] Ronald M. Summers,et al. DeepOrgan: Multi-level Deep Convolutional Networks for Automated Pancreas Segmentation , 2015, MICCAI.

[31] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[32] Kunihiko Fukushima. Neocognitron capable of incremental learning , 2004, Neural Networks.

[33] Nima Tajbakhsh,et al. Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? , 2016, IEEE Transactions on Medical Imaging.

[34] Nima Tajbakhsh,et al. A Comprehensive Computer-Aided Polyp Detection System for Colonoscopy Videos , 2015, IPMI.

[35] Nicholas Ayache,et al. Fine-tuned convolutional neural nets for cardiac MRI acquisition plane recognition , 2017, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[36] Kenji Suzuki,et al. Efficient approximation of neural filters for removing quantum noise from images , 2002, IEEE Trans. Signal Process..

[37] Kunio Doi,et al. How can a massive training artificial neural network (MTANN) be trained with a small number of cases in the distinction between nodules and vessels in thoracic CT? , 2005, Academic radiology.

[38] Kenji Suzuki,et al. Massive-Training Artificial Neural Network Coupled With Laplacian-Eigenfunction-Based Dimensionality Reduction for Computer-Aided Detection of Polyps in CT Colonography , 2010, IEEE Transactions on Medical Imaging.

[39] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[40] Wei Shen,et al. Multi-scale Convolutional Neural Networks for Lung Nodule Classification , 2015, IPMI.

[41] Ronald M. Summers,et al. Improving Computer-Aided Detection Using Convolutional Neural Networks and Random View Aggregation , 2015, IEEE Transactions on Medical Imaging.

[42] Nima Tajbakhsh,et al. Automatic polyp detection in colonoscopy videos using an ensemble of convolutional neural networks , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[43] Sepp Hochreiter,et al. Toxicity Prediction using Deep Learning , 2015, ArXiv.

[44] Izhar Wallach,et al. AtomNet: A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery , 2015, ArXiv.

[45] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46] Jun Zhao,et al. Recurrent Convolutional Neural Networks for Text Classification , 2015, AAAI.

[47] Lawrence D. Jackel,et al. Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.