Comparative evaluation of support vector machines for computer aided diagnosis of lung cancer in CT based on a multi-dimensional data set

Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. In this paper, the usage of support vector machine (SVM) classification for lung cancer is investigated, presenting a systematic quantitative evaluation against Boosting, Decision trees, k-nearest neighbor, LASSO regressions, neural networks and random forests. A large database of 5984 regions of interest (ROIs) and 488 input features (including textural features, patient characteristics, and morphological features) were used to train the classifiers and evaluate for their performance. The evaluation for classifiers' performance was based on a tenfold cross validation framework, receiver operating characteristic curve (ROC), and Matthews correlation coefficient. Area under curve (AUC) of SVM, Boosting, Decision trees, k-nearest neighbor, LASSO, neural networks, random forests were 0.94, 0.86, 0.73, 0.72, 0.91, 0.92, and 0.85, respectively. It was proved that SVM classification offered significantly increased classification performance compared to the reference methods. This scheme may be used as an auxiliary tool to differentiate between benign and malignant SPNs of CT images in future.

[1]  Jinbo Bi,et al.  Robust Large Scale Prone-Supine Polyp Matching Using Local Features: A Metric Learning Approach , 2011, MICCAI.

[2]  Lucia Dettori,et al.  A comparison of wavelet, ridgelet, and curvelet-based texture classification algorithms in computed tomography , 2007, Comput. Biol. Medicine.

[3]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[4]  Samir Brahim Belhaouari,et al.  Breast cancer diagnosis in digital mammogram using multiscale curvelet transform , 2010, Comput. Medical Imaging Graph..

[5]  Mahesh B. Nagarajan,et al.  Performance of topological texture features to classify fibrotic interstitial lung disease patterns. , 2011, Medical physics.

[6]  Jan Sijbers,et al.  Machine learning study of several classifiers trained with texture analysis features to differentiate benign from malignant soft‐tissue tumors in T1‐MRI images , 2010, Journal of magnetic resonance imaging : JMRI.

[7]  Anselmo Cardoso de Paiva,et al.  Methodology for automatic detection of lung nodules in computerized tomography images , 2010, Comput. Methods Programs Biomed..

[8]  Manabu Ito,et al.  Usefulness of circumference difference for estimating the likelihood of malignancy in small solitary pulmonary nodules on CT. , 2007, Lung cancer.

[9]  Michael C. Lee,et al.  Computer-aided diagnosis of pulmonary nodules using a two-step approach for feature selection and classifier ensemble construction , 2010, Artif. Intell. Medicine.

[10]  Laurent Demanet,et al.  Fast Discrete Curvelet Transforms , 2006, Multiscale Model. Simul..

[11]  U. Rajendra Acharya,et al.  Computer-Assisted Diagnosis of Tuberculosis: A First Order Statistical Approach to Chest Radiograph , 2012, Journal of Medical Systems.

[12]  Gloria Bordogna,et al.  A method for extracting burned areas from Landsat TM/ETM+ images by soft aggregation of multiple Spectral Indices and a region growing algorithm , 2012 .

[13]  Max Q.-H. Meng,et al.  Texture analysis for ulcer detection in capsule endoscopy images , 2009, Image Vis. Comput..

[14]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[15]  Joon Beom Seo,et al.  Performance testing of several classifiers for differentiating obstructive lung diseases based on texture analysis at high-resolution computerized tomography (HRCT) , 2009, Comput. Methods Programs Biomed..

[16]  Bram van Ginneken,et al.  A large-scale evaluation of automatic pulmonary nodule detection in chest CT using local image features and k-nearest-neighbour classification , 2009, Medical Image Anal..

[17]  H. Ohmatsu,et al.  The adenocarcinoma-specific stage shift in the Anti-lung Cancer Association project: significance of repeated screening for lung cancer for more than 5 years with low-dose helical computed tomography in a high-risk cohort. , 2010, Lung cancer.

[18]  M. Dimopoulos,et al.  Histopathologic and genetic alterations as predictors of response to treatment and survival in lung cancer: a review of published data. , 2010, Critical reviews in oncology/hematology.

[19]  Q. M. Jonathan Wu,et al.  Curvelet based face recognition via dimension reduction , 2009, Signal Process..

[20]  Nikos Dimitropoulos,et al.  Mammographic masses characterization based on localized texture and dataset fractal analysis using linear, neural and support vector machine classifiers , 2006, Artif. Intell. Medicine.

[21]  Huan Wang,et al.  Multilevel binomial logistic prediction model for malignant pulmonary nodules based on texture features of CT image. , 2010, European journal of radiology.

[22]  Yong Soo Choi,et al.  Prediction of lymph node metastasis using the combined criteria of helical CT and mRNA expression profiling for non-small cell lung cancer. , 2008, Lung cancer.

[23]  Chung-Ho Hsieh,et al.  Novel solutions for an old disease: diagnosis of acute appendicitis with random forest, support vector machines, and artificial neural networks. , 2011, Surgery.

[24]  Yuancheng Li,et al.  Image compression scheme based on curvelet transform and support vector machine , 2010, Expert Syst. Appl..

[25]  F. Herrmann,et al.  Enhancing crustal reflection data through curvelet denoising , 2011 .

[26]  Flávio Bortolozzi,et al.  A comparison of SVM and HMM classifiers in the off-line signature verification , 2005, Pattern Recognit. Lett..

[27]  Douglas C McCrory,et al.  Noninvasive staging of non-small cell lung cancer: a review of the current evidence. , 2003, Chest.

[28]  Xia Li,et al.  Combination of Radiological and Gray Level Co-occurrence Matrix Textural Features Used to Distinguish Solitary Pulmonary Nodules by Computed Tomography , 2013, Journal of Digital Imaging.

[29]  Le Lu,et al.  Sparse Classification for Computer Aided Diagnosis Using Learned Dictionaries , 2011, MICCAI.

[30]  Joon Beom Seo,et al.  Development of an Automatic Classification System for Differentiation of Obstructive Lung Disease using HRCT , 2009, Journal of Digital Imaging.