Analysis of Architecture Combining Convolutional Neural Network (CNN) and Kernel K-Means Clustering for Lung Cancer Diagnosis

In this paper, we proposed the modified deep learning method that combined Convolutional Neural Network (CNN) and Kernel K-Means clustering for lung cancer diagnosis. The Anti-PD-1 Immunotherapy Lung dataset obtained from The Cancer Imaging Archive was used to evaluate our proposed method. From this dataset, we use 400 Magnetic Resonance Imaging (MRI) images that manually labeled consists of 150 healthy lung images and 250 lung cancer images. As the first step, all the data was examined through the CNN architecture. The flatten neuron of the feature map for every image resulted from the convolutional layers in CNN is gained and passed through the kernel k-means clustering algorithm. This algorithm then used to obtain the centroid of each cluster that determines the prediction class of every data point in the validation set. The performance of our proposed method was evaluated using several k values in k-fold cross-validation. According to our experiments, our proposed method achieved the highest performance measure with 98.85 percent accuracy, 98.32 percent sensitivity, 99.40 percent precision, 99.39 percent specificity, and 98.86 percent F1-Score when using RBF kernel function with sigma=0.05 in 9-fold cross-validation. Those performance improves 1.31% sensitivity, 1.12% accuracy, 1.11% F1-Score, 0.92% specificity, and 0.91% precision compared to when using 5-fold cross-validation. It is even obtained in less than 8 seconds for passing the dataset to the CNN model and 40 ± 0.77 seconds for examined in kernel k-means clustering. Therefore, it was proved that our proposed method has an efficient and promised performance for lung cancer diagnosis from MRI images.