Soft Set Based Feature Selection Approach for Lung Cancer Images

Lung cancer is the deadliest type of cancer for both men and women. Feature selection plays a vital role in cancer classification. This paper investigates the feature selection process in Computed Tomographic (CT) lung cancer images using soft set theory. We propose a new soft set based unsupervised feature selection algorithm. Nineteen features are extracted from the segmented lung images using gray level co-occurence matrix (GLCM) and gray level different matrix (GLDM). In this paper, an efficient Unsupervised Soft Set based Quick Reduct (SSUSQR) algorithm is presented. This method is used to select features from the data set and compared with existing rough set based unsupervised feature selection methods. Then K-Means and Self Organizing Map (SOM) clustering algorithms are used to cluster the data. The performance of the feature selection algorithms is evaluated based on performance of clustering techniques. The results show that the proposed method effectively removes redundant features.

[1]  Hanan M. Amer,et al.  A Computer-Aided System for Classifying Computed Tomographic (CT) Lung Images Using Artificial Neural Network and Data Fusion , 2011 .

[2]  Johan Karlsson,et al.  Decision Support System for Lung Cancer using PET/CT Images , 2004 .

[3]  Thangavel,et al.  Unsupervised Quick Reduct Algorithm Using Rough Set Theory , 2011 .

[4]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[6]  Constantin F. Aliferis,et al.  Machine Learning Models for Classification of Lung Cancer and Selection of Genomic Markers Using Array Gene Expression Data , 2003, FLAIRS.

[7]  Hao Wang,et al.  Feature Similarity Based Redundancy Reduction for Gene Selection , 2006, DMIN.

[8]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[9]  Ahmad Nazari Mohd Rose,et al.  Soft Set Theoretic Approach for Dimensionality Reduction , 2009, FGIT-DTA.

[10]  Mustafa Mat Deris,et al.  On Multi-soft Sets Construction in Information Systems , 2009, ICIC.

[11]  In-Hee Lee,et al.  A filter-based feature selection approach for identifying potential biomarkers for lung cancer , 2011, Journal of Clinical Bioinformatics.

[12]  Norhalina Senan,et al.  Soft Set Theory for Feature Selection of Traditional Malay Musical Instrument Sounds , 2010, ICICA.

[13]  Sellappan Palaniappan,et al.  Discretization of Continuous Valued Dimensions in OLAP Data Cubes , 2008 .

[14]  D. Molodtsov Soft set theory—First results , 1999 .

[15]  A. R. Roy,et al.  Soft set theory , 2003 .

[16]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2022 .

[17]  Huanglin Zeng,et al.  A Heuristic Algorithm Based on Attribute Importance for Feature Selection , 2008, RSKT.