Support Vector Machine Model for Diagnosing Pneumoconiosis Based on Wavelet Texture Features of Digital Chest Radiographs

This study aims to explore the classification ability of decision trees (DTs) and support vector machines (SVMs) to discriminate between the digital chest radiographs (DRs) of pneumoconiosis patients and control subjects. Twenty-eight wavelet-based energy texture features were calculated at the lung fields on DRs of 85 healthy controls and 40 patients with stage I and stage II pneumoconiosis. DTs with algorithm C5.0 and SVMs with four different kernels were trained by samples with two combinations of the texture features to classify a DR as of a healthy subject or of a patient with pneumoconiosis. All of the models were developed with fivefold cross-validation, and the final performances of each model were compared by the area under receiver operating characteristic (ROC) curve. For both SVM (with a radial basis function kernel) and DT (with algorithm C5.0), areas under ROC curves (AUCs) were 0.94 ± 0.02 and 0.86 ± 0.04 (P = 0.02) when using the full feature set and 0.95 ± 0.02 and 0.88 ± 0.04 (P = 0.05) when using the selected feature set, respectively. When built on the selected texture features, the SVM with a polynomial kernel showed a higher diagnostic performance with an AUC value of 0.97 ± 0.02 than SVMs with a linear kernel, a radial basis function kernel and a sigmoid kernel with AUC values of 0.96 ± 0.02 (P = 0.37), 0.95 ± 0.02 (P = 0.24), and 0.90 ± 0.03 (P = 0.01), respectively. The SVM model with a polynomial kernel built on the selected feature set showed the highest diagnostic performance among all tested models when using either all the wavelet texture features or the selected ones. The model has a good potential in diagnosing pneumoconiosis based on digital chest radiographs.

[1]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[2]  Jun-ichi Hasegawa,et al.  Automated classification of pneumoconiosis radiographs based on recognition of small rounded opacities , 1990, Systems and Computers in Japan.

[3]  Liang-ping Hu,et al.  Performance comparison between Logistic regression, decision trees, and multilayer perceptron in predicting peripheral neuropathy in type 2 diabetes mellitus. , 2012, Chinese medical journal.

[4]  Chao Yang,et al.  An Automatic Computer-Aided Detection Scheme for Pneumoconiosis on Digital Chest Radiographs , 2011, Journal of Digital Imaging.

[5]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[6]  E. Hall,et al.  Computer Classification of Pneumoconiosis from Radiographs of Coal Workers , 1975, IEEE Transactions on Biomedical Engineering.

[7]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.

[8]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[9]  M. Marjanović,et al.  Landslide susceptibility assessment using SVM machine learning algorithm , 2011 .

[10]  Lubomir M. Hadjiiski,et al.  Classifier performance prediction for computer-aided diagnosis using a limited dataset. , 2008, Medical physics.

[11]  Hui Chen,et al.  Computer-Aided Diagnosis for Pneumoconiosis Based on Texture Analysis on Digital Chest Radiographs , 2012 .

[12]  Andrzej Materka,et al.  DISCRETE WAVELET TRANSFORM – DERIVED FEATURES FOR DIGITAL IMAGE TEXTURE ANALYSIS , 2002 .

[13]  A. Fenster,et al.  Prostate cancer diagnosis based on Gabor filter texture segmentation of ultrasound image , 2003, CCECE 2003 - Canadian Conference on Electrical and Computer Engineering. Toward a Caring and Humane Technology (Cat. No.03CH37436).

[14]  A. M. Savol,et al.  Computer-aided recognition of small rounded pneumoconiosis opacities in chest X-rays , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Jing Zhang,et al.  Performance comparison of artificial neural network and logistic regression model for differentiating lung nodules on CT scans , 2012, Expert Syst. Appl..

[16]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[17]  Lubomir M. Hadjiiski,et al.  Effect of finite sample size on feature selection and classification: a simulation study. , 2010, Medical physics.

[18]  Marios S. Pattichis,et al.  Multiscale AM-FM analysis of pneumoconiosis x-ray images , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[19]  Lior Rokach,et al.  Data Mining and Knowledge Discovery Handbook, 2nd ed , 2010, Data Mining and Knowledge Discovery Handbook, 2nd ed..

[20]  V. Sugumaran,et al.  Effect of SVM kernel functions on classification of vibration signals of a single point cutting tool , 2011, Expert Syst. Appl..

[21]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[22]  Hiroshi Kondo,et al.  Automated quantitative analysis for pneumoconiosis , 1998, Other Conferences.

[23]  José Francisco Martínez Trinidad,et al.  General framework for class-specific feature selection , 2011, Expert Syst. Appl..

[24]  Hui Chen,et al.  Morphological Reconstruction Based Segmentation of Lung Fields on Digital Radiographs , 2012 .

[25]  Arivazhagan Selvaraj,et al.  Texture segmentation using wavelet transform , 2003, Pattern Recognit. Lett..

[26]  Guozhen Zhang,et al.  Feature Selection and Performance Evaluation of Support Vector Machine (SVM)-Based Classifier for Differentiating Benign and Malignant Pulmonary Nodules by Computed Tomography , 2010, Journal of Digital Imaging.

[27]  Dawei Han,et al.  Artificial intelligence techniques for clutter identification with polarimetric radar signatures , 2012 .

[28]  Takayuki Ishida,et al.  Computerized Analysis of Pneumoconiosis in Digital Chest Radiography: Effect of Artificial Neural Network Trained with Power Spectra , 2011, Journal of Digital Imaging.

[29]  Wei-Yin Loh,et al.  A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms , 2000, Machine Learning.

[30]  Weizhong Yan,et al.  Computer Aided Detection for Pneumoconiosis Screening on Digital Chest Radiographs , 2010 .

[31]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[32]  Min-Ying Su,et al.  Prediction of malignant breast lesions from MRI features: a comparison of artificial neural network and logistic regression techniques. , 2009, Academic radiology.

[33]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.