Gene Selection Using Neighborhood Rough Set from Gene Expression Profiles

Although adopting feature reduction in classic rough set theory to select informative genes is an effective method, its classification accuracy rate is usually not higher compared with other tumor-related gene selection and tumor classification approaches; for gene expression values must be discretized before gene reduction, which leads to information loss in tumor classification. Therefore, the neighborhood rough set model proposed by Hu Qing-Hua is introduced to tumor classification, which omits the discretization procedure, so no information loss occurs before gene reduction. Experiments on two well-known tumor datasets show that gene selection using neighborhood rough set model obviously outperforms using classic rough set theory and experiment results also prove that the most of the selected gene subset not only has higher accuracy rate but also are related to tumor.

[1]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Qinghua Hu,et al.  Neighborhood classifiers , 2008, Expert Syst. Appl..

[3]  Jan Komorowski,et al.  Learning Rough Set Classifiers from Gene Expressions and Clinical Data , 2002, Fundam. Informaticae.

[4]  Peter Eisert,et al.  Digital watermarking of MPEG-4 facial animation parameters , 1998, Comput. Graph..

[5]  C. F. Osborne,et al.  A digital watermark , 1994, Proceedings of 1st International Conference on Image Processing.

[6]  Bernd Girod,et al.  Watermarking of uncompressed and compressed video , 1998, Signal Process..

[7]  Bijan G. Mobasseri,et al.  Content authentication and tamper detection in digital video , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[8]  Lawrence Carin,et al.  Gene expression analysis: Joint feature selection and classifler design , 2004 .

[9]  Roland Eils,et al.  Mining Gene Expression Data using Rough Set Theory , 1999 .

[10]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[11]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[12]  Qinghua Hu,et al.  Consistency Based Attribute Reduction , 2007, PAKDD.

[13]  Wang Li-na,et al.  Wavelet-based denoising attack on image watermarking , 2008, Wuhan University Journal of Natural Sciences.

[14]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[15]  Jerzy W. Grzymala-Busse,et al.  Leukemia Prediction from Gene Expression Data-A Rough Set Approach , 2006, ICAISC.

[16]  Iwan Setyawan,et al.  Low-bit-rate video watermarking using temporally extended differential energy watermarking (DEW) algorithm , 2001, IS&T/SPIE Electronic Imaging.

[17]  Ingemar J. Cox,et al.  Secure spread spectrum watermarking for multimedia , 1997, IEEE Trans. Image Process..