A Comparative Analysis of Rough Set Based Intelligent Techniques for Unsupervised Gene Selection

As the micro array databases increases in dimension and results in complexity, identifying the most informative genes is a challenging task. Such difficulty is often related to the huge number of genes with very few samples. Research in medical data mining addresses this problem by applying techniques from data mining and machine learning to the micro array datasets. In this paper Unsupervised Tolerance Rough Set based Quick Reduct U-TRS-QR, a diverse feature selection algorithm, which extends the existing equivalent rough sets for unsupervised learning, is proposed. Genes selected by the proposed method leads to a considerably improved class predictions in wide experiments on two gene expression datasets: Brain Tumor and Colon Cancer. The results indicate consistent improvement among 12 classifiers.

[1]  Joaquín Dopazo,et al.  Using a Genetic Algorithm and a Perceptron for Feature Selection and Supervised Class Learning in DNA Microarray Data , 2003, Artificial Intelligence Review.

[2]  Yajun Mei,et al.  Linear-mixed effects models for feature selection in high-dimensional NMR spectra , 2009, Expert Syst. Appl..

[3]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[4]  H. P. Lee,et al.  Saliency Analysis of Support Vector Machines for Gene Selection in Tissue Classification , 2003, Neural Computing & Applications.

[5]  Aboul Ella Hassanien,et al.  Rough Computing: Theories, Technologies and Applications , 2007 .

[6]  Duoqian Miao,et al.  An Efficient Gene Selection Algorithm Based on Tolerance Rough Set Theory , 2009, RSFDGrC.

[7]  I. Jolliffe Principal Component Analysis , 2002 .

[8]  Jerzy W. Grzymala-Busse,et al.  Rough sets : New horizons in commercial and industrial AI , 1995 .

[9]  H. Inbarani,et al.  Unsupervised feature selection using Tolerance Rough Set based Relative Reduct , 2012, IEEE-International Conference On Advances In Engineering, Science And Management (ICAESM -2012).

[10]  Andrzej Skowron,et al.  Tolerance Approximation Spaces , 1996, Fundam. Informaticae.

[11]  Seoung Bum Kim,et al.  Controlling the False Discovery Rate for Feature Selection in High-resolution NMR Spectra , 2008 .

[12]  Seoung Bum Kim,et al.  Genetic algorithm-based feature selection in high-resolution NMR spectra , 2008, Expert Syst. Appl..

[13]  Carla E. Brodley,et al.  Feature Subset Selection and Order Identification for Unsupervised Learning , 2000, ICML.

[14]  K. Thangavel,et al.  Unsupervised Feature Selection in Digital Mammogram Image Using Tolerance Rough Set Based Quick Reduct , 2012, 2012 Fourth International Conference on Computational Intelligence and Communication Networks.

[15]  Zhihua Zhang,et al.  Sparse Unsupervised Dimensionality Reduction Algorithms , 2010, ECML/PKDD.

[16]  Ash A. Alizadeh,et al.  'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns , 2000, Genome Biology.

[17]  Thangavel,et al.  Unsupervised Quick Reduct Algorithm Using Rough Set Theory , 2011 .

[18]  Salvatore Greco,et al.  Fuzzy Similarity Relation as a Basis for Rough Approximations , 1998, Rough Sets and Current Trends in Computing.

[19]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Julie Wilson,et al.  Novel feature selection method for genetic programming using metabolomic 1H NMR data , 2006 .

[21]  Chris H. Q. Ding,et al.  Unsupervised Feature Selection Via Two-way Ordering in Gene Expression Analysis , 2003, Bioinform..

[22]  Zdzislaw Pawlak,et al.  VAGUENESS AND UNCERTAINTY: A ROUGH SET PERSPECTIVE , 1995, Comput. Intell..

[23]  Sankar K. Pal,et al.  Unsupervised feature evaluation: a neuro-fuzzy approach , 2000, IEEE Trans. Neural Networks Learn. Syst..

[24]  Kezhi Mao,et al.  Identifying critical variables of principal components for unsupervised feature selection , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[25]  Young Bun Kim,et al.  Unsupervised Gene Selection For High Dimensional Data , 2006, Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06).

[26]  Lipo Wang,et al.  Gene selection and cancer classification using a fuzzy neural network , 2004, IEEE Annual Meeting of the Fuzzy Information, 2004. Processing NAFIPS '04..

[27]  Chenn-Jung Huang,et al.  CLASS PREDICTION OF CANCER USING PROBABILISTIC NEURAL NETWORKS AND RELATIVE CORRELATION METRIC , 2004, Appl. Artif. Intell..