A Family of GEP-Induced Ensemble Classifiers

The paper proposes applying Gene Expression Programming (GEP) to induce ensemble classifiers. Four algorithms inducing such classifiers are proposed. The first one, denoted GEPA, based on the Adaboost method, is the two-class specific. The second, denoted MV is based on majority voting learning. Third one, denoted MVI, assumes incremental learning where for some classes more genes may be needed than for other ones. Finally, the last one denoted MVC involves partitioning of the training dataset into clusters prior to expression trees induction. The proposed algorithms were validated experimentally using several datasets.

[1]  Cândida Ferreira,et al.  Gene Expression Programming: A New Adaptive Algorithm for Solving Problems , 2001, Complex Syst..

[2]  Jacek M. Zurada,et al.  Artificial Intelligence and Soft Computing - ICAISC 2008, 9th International Conference, Zakopane, Poland, June 22-26, 2008, Proceedings , 2008, ICAISC.

[3]  Heitor Silvério Lopes,et al.  GEPCLASS: A Classification Rule Discovery Tool Using Gene Expression Programming , 2006, ADMA.

[4]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[5]  Weimin Xiao,et al.  Prefix Gene Expression Programming , 2005 .

[6]  Marek Kretowski,et al.  Global Induction of Decision Trees: From Parallel Implementation to Distributed Evolution , 2008, ICAISC.

[7]  Joanna Jedrzejowicz,et al.  GEP-Induced Expression Trees as Weak Classifiers , 2008, ICDM.

[8]  Changjie Tang,et al.  A Model of Immune Gene Expression Programming for Rule Mining , 2007, J. Univers. Comput. Sci..

[9]  Jing-Yu Yang,et al.  Optimal discriminant plane for a small number of samples and design method of classifier on the plane , 1991, Pattern Recognit..

[10]  Changjie Tang,et al.  Distance Guided Classification with Gene Expression Programming , 2006, ADMA.

[11]  Weimin Xiao,et al.  Evolving accurate and compact classification rules with gene expression programming , 2003, IEEE Trans. Evol. Comput..

[12]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[13]  Candida Ferreira Gene expression programming , 2006 .

[14]  Petra Perner,et al.  Advances in Data Mining , 2002, Lecture Notes in Computer Science.

[15]  Andreas Stafylopatis,et al.  Data Mining based on Gene Expression Programming and Clonal Selection , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[16]  Maurice K. Wong,et al.  Algorithm AS136: A k-means clustering algorithm. , 1979 .

[17]  Cândida Ferreira Decision Tree Induction , 2006 .

[18]  Marek Kretowski A Memetic Algorithm for Global Induction of Decision Trees , 2008, SOFSEM.

[19]  Weihong Wang,et al.  A Preliminary Study on Constructing Decision Tree with Gene Expression Programming , 2006, First International Conference on Innovative Computing, Information and Control - Volume I (ICICIC'06).

[20]  Mária Bieliková,et al.  SOFSEM 2008: Theory and Practice of Computer Science, 34th Conference on Current Trends in Theory and Practice of Computer Science, Nový Smokovec, Slovakia, January 19-25, 2008, Proceedings , 2008, SOFSEM.