Malignant-benign classification of pulmonary nodules based on random forest aided by clustering analysis

To help the radiologists better differentiate the benign from malignant pulmonary nodules on CT images, a novel classification scheme was proposed to improve the performance of benign and malignant classifier of pulmonary nodules. First, the pulmonary nodules were segmented with the references to the results from four radiologists. Then, some basic features of the segmented nodules such as the shape, gray and texture are given by calculation. Finally, malignant-benign classification of pulmonary nodules is performed by using random forest (RF) with the aid of clustering analysis. The data with a set of 952 nodules have been collected from lung image database consortium (LIDC). The effect of proposed classification scheme was verified by three experiments, in which the variant composite rank of malignancy were got from four radiologists (experiment 1: rank of malignancy '1', '2' as benign and '4', '5' as malignant; experiment 2: rank of malignancy '1', '2', '3' as benign and '4', '5' as malignant; experiment 3: rank of malignancy '1', '2' as benign and '3', '4', '5' as malignant) and the corresponding ([Formula: see text]) (area under the receiver operating characteristic curve) are 0.9702, 0.9190 and 0.8662, respectively. It can be drawn that the method in this work can greatly improve the accuracy of the classification of benign and malignant pulmonary nodules based on CT images.

[1]  Hiram Madero Orozco,et al.  Automated system for lung nodules classification based on wavelet feature descriptor and support vector machine , 2015 .

[2]  Ning Xu,et al.  Automated lung nodule segmentation using dynamic programming and EM-based classification , 2002, SPIE Medical Imaging.

[3]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[4]  Mohamed Medhat Gaber,et al.  Diversified Random Forests Using Random Subspaces , 2014, IDEAL.

[5]  A. Akan,et al.  A novel approach to malignant-benign classification of pulmonary nodules by using ensemble learning classifiers , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[6]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[7]  Niranjan Khandelwal,et al.  A Combination of Shape and Texture Features for Classification of Pulmonary Nodules in Lung CT Images , 2016, Journal of Digital Imaging.

[8]  Kunio Doi,et al.  Computer-aided diagnostic scheme for distinction between benign and malignant nodules in thoracic low-dose CT by use of massive training artificial neural network , 2005, IEEE Transactions on Medical Imaging.

[9]  D. Naidich,et al.  Evaluation of individuals with pulmonary nodules: when is it lung cancer? Diagnosis and management of lung cancer, 3rd ed: American College of Chest Physicians evidence-based clinical practice guidelines. , 2013, Chest.

[10]  Ricardo A. M. Valentim,et al.  Computer-aided detection system for lung cancer in computed tomography scans: Review and future prospects , 2014, BioMedical Engineering OnLine.

[11]  Richard C. Pais,et al.  Evaluation of Lung MDCT Nodule Annotation Across Radiologists and Methods 1 , 2006 .

[12]  Ashfaqur Rahman,et al.  Cluster-Oriented Ensemble Classifier: Impact of Multicluster Characterization on Ensemble Classifier Learning , 2012, IEEE Transactions on Knowledge and Data Engineering.

[13]  Michael C. Lee,et al.  Computer-aided diagnosis of pulmonary nodules using a two-step approach for feature selection and classifier ensemble construction , 2010, Artif. Intell. Medicine.

[14]  Berkman Sahiner,et al.  Computer-aided diagnosis of pulmonary nodules on CT scans: improvement of classification performance with nodule surface features. , 2009, Medical physics.

[15]  Niranjan Khandelwal,et al.  Erratum to: A Segmentation Framework of Pulmonary Nodules in Lung CT Images , 2015, Journal of Digital Imaging.

[16]  P. Pattynama,et al.  Receiver operating characteristic (ROC) analysis: basic principles and applications in radiology. , 1998, European journal of radiology.

[17]  Alexander McGregor,et al.  Lung cancer screening using low-dose computed tomography in at-risk individuals: the Toronto experience. , 2010, Lung cancer.

[18]  Antje Venjakob,et al.  Radiologists' eye gaze when reading cranial CT images , 2012, Medical Imaging.

[19]  Jing Zhang,et al.  Performance comparison of artificial neural network and logistic regression model for differentiating lung nodules on CT scans , 2012, Expert Syst. Appl..

[20]  M. Gould Randomized trials and bronchoscopy. , 2013, Chest.

[21]  Steven Guan,et al.  A class decomposition approach for GA-based classifiers , 2005, Eng. Appl. Artif. Intell..

[22]  Wei Shen,et al.  Multi-crop Convolutional Neural Networks for lung nodule malignancy suspiciousness classification , 2017, Pattern Recognit..

[23]  Paulo Mazzoncini de Azevedo Marques,et al.  Selecting relevant 3D image features of margin sharpness and texture for lung nodule retrieval , 2017, International Journal of Computer Assisted Radiology and Surgery.

[24]  Aboul Ella Hassanien,et al.  A random forest classifier for lymph diseases , 2014, Comput. Methods Programs Biomed..

[25]  Anselmo Cardoso de Paiva,et al.  Lung Nodules Classification in CT Images Using Shannon and Simpson Diversity Indices and SVM , 2012, MLDM.

[26]  Geoffrey McLennan,et al.  WE‐B‐201B‐02: The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): A Completed Public Database of CT Scans for Lung Nodule Analysis , 2010 .

[27]  Ashfaqur Rahman,et al.  Ensemble classifier generation using non-uniform layered clustering and Genetic Algorithm , 2013, Knowl. Based Syst..