Simple rule-based ensemble classifiers for cancer DNA microarray data classification

DNA microarray, which is one of the most important molecular biology technologies in post-genomic era, has been widely applied in medical field, especially for cancer classification. However, it is difficult to acquire excellent classification accuracy by using traditional classification approaches due to microarray datasets are extremely asymmetric in dimensionality. In recent years, ensemble classifiers which may obtain better classification accuracy and robustness have attracted more interests in this field but it is more time-consuming. Therefore, this paper proposed a novel ensemble classification method named as SREC(Simple Rule-based Ensemble Classifiers). Firstly, the classification contribution of each gene is evaluated by a novel strategy and the corresponding classification rule is extracted. Then we rank all genes to select some important ones. At last, the rules of the selected genes are assembled by weighted-voting to make decision for testing samples. It has been demonstrated the proposed method may improve classification accuracy with lower time-complexity than traditional classification methods.

[1]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[2]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[4]  Yonghong Peng,et al.  A novel ensemble machine learning for robust microarray data classification , 2006, Comput. Biol. Medicine.

[5]  Sung-Bae Cho,et al.  An Evolutionary Algorithm Approach to Optimal Ensemble Classifiers for DNA Microarray Data Analysis , 2008, IEEE Transactions on Evolutionary Computation.

[6]  Sung-Bae Cho,et al.  Prediction of colon cancer using an evolutionary neural network , 2004, Neurocomputing.

[7]  Jorng-Tzong Horng,et al.  An expert system to classify microarray gene expression data using gene selection by decision tree , 2009, Expert Syst. Appl..

[8]  Pedro Larrañaga,et al.  Filter versus wrapper gene selection approaches in DNA microarray domains , 2004, Artif. Intell. Medicine.

[9]  Jing Zhao,et al.  A Modified Ant Colony Optimization Algorithm for Tumor Marker Gene Selection , 2009, Genom. Proteom. Bioinform..

[10]  Thomas A. Darden,et al.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method , 2001, Bioinform..

[11]  Musa H. Asyali,et al.  Gene expression profile class prediction using linear Bayesian classifiers , 2007, Comput. Biol. Medicine.

[12]  Marcel Dettling,et al.  BagBoosting for tumor classification with gene expression data , 2004, Bioinform..