Rough mereology classifier vs. simple DNA microarray gene extraction methods

This work extends the author's contribution to the Second International Conference of Soft Computing and Pattern Recognition (SocPar 2010) held at the University of Cergy Pontoise in December 2010. The current version is dedicated to the topic of gene separation algorithms and our best classification method based on weighted voting, which was investigated recently by Polkowski and Artiemjew. The DNA microarrays are a popular tool, useful to the research of gene expression. The exemplary application is, among others, to differentiate healthy and ill tissues, distinguishing some organisms' features or to check changes in gene expression by means of some additional factors. The huge amount of information obtained from DNA microarrays in the range of tens of thousands of genes causes many difficulties. Many algorithms, especially brute force methods, cannot be applied for this reason, and due to the low number of training objects tend to overfit. In this paper, we present two simple gene extraction methods compared by means of our best weighted voting classifier. The results of research show the high effectiveness of our approach, and full comparability with the results of the recent DNA microarray data mining competition.

[1]  Lech Polkowski,et al.  On Classifying Mappings Induced by Granular Structures , 2008, Trans. Rough Sets.

[2]  Lech Polkowski,et al.  A Study in Granular Computing: On Classifiers Induced from Granular Reflections of Data , 2008, Trans. Rough Sets.

[3]  Piotr Artiemjew,et al.  The Extraction Method of DNA Microarray Features Based on Modified F Statistics vs. Classifier Based on Rough Mereology , 2011, ISMIS.

[4]  Lech Polkowski A Unified Approach to Granulation of Knowledge and Granular Computing Based on Rough Mereology: A Survey , 2008 .

[5]  Piotr Artiemjew,et al.  Classifiers based on rough mereology in analysis of DNA microarray data , 2010, 2010 International Conference of Soft Computing and Pattern Recognition.

[6]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[7]  P. Brown,et al.  DNA arrays for analysis of gene expression. , 1999, Methods in enzymology.

[8]  Piotr Artiemjew,et al.  The Extraction Method of DNA Microarray Features Based on Experimental A Statistics , 2011, RSKT.

[9]  Petr Hájek,et al.  Metamathematics of Fuzzy Logic , 1998, Trends in Logic.

[10]  Piotr Artiemjew,et al.  Natural versus Granular Computing: Classifiers from Granular Structures , 2008, RSCTC.

[11]  Guoyin Wang,et al.  RSCTC'2010 Discovery Challenge: Mining DNA Microarray Data for Medical Diagnosis and Treatment , 2010, RSCTC.

[12]  Lotfi A. Zadeh,et al.  Fuzzy sets and information granularity , 1996 .

[13]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[14]  Piotr Artiemjew,et al.  DNA microarray classification by means of weighted voting based on rough set classifier , 2010, 2010 International Conference of Soft Computing and Pattern Recognition.