Symbiotic coevolutionary genetic programming: a benchmarking study under large attribute spaces

Classification under large attribute spaces represents a dual learning problem in which attribute subspaces need to be identified at the same time as the classifier design is established. Embedded as opposed to filter or wrapper methodologies address both tasks simultaneously. The motivation for this work stems from the observation that team based approaches to Genetic Programming (GP) have the potential to design multiple classifiers per class—each with a potentially unique attribute subspace—without recourse to filter or wrapper style preprocessing steps. Specifically, competitive coevolution provides the basis for scaling the algorithm to data sets with large instance counts; whereas cooperative coevolution provides a framework for problem decomposition under a bid-based model for establishing program context. Symbiosis is used to separate the tasks of team/ensemble composition from the design of specific team members. Team composition is specified in terms of a combinatorial search performed by a Genetic Algorithm (GA); whereas the properties of individual team members and therefore subspace identification is established under an independent GP population. Teaming implies that the members of the resulting ensemble of classifiers should have explicitly non-overlapping behaviour. Performance evaluation is conducted over data sets taken from the UCI repository with 649–102,660 attributes and 2–10 classes. The resulting teams identify attribute spaces 1–4 orders of magnitude smaller than under the original data set. Moreover, team members generally consist of less than 10 instructions; thus, small attribute subspaces are not being traded for opaque models.

[1]  Jason Weston,et al.  Large-scale kernel machines , 2007 .

[2]  Foster J. Provost,et al.  Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction , 2003, J. Artif. Intell. Res..

[3]  Terence Soule,et al.  Novel ways of improving cooperation and performance in ensemble classifiers , 2007, GECCO '07.

[4]  Rajeev Kumar,et al.  Evolution of hyperheuristics for the biobjective 0/1 knapsack problem by multiobjective genetic programming , 2008, GECCO '08.

[5]  Masoud Nikravesh,et al.  Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing) , 2006 .

[6]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[7]  Larry Bull,et al.  Genetic Programming with a Genetic Algorithm for Feature Construction and Selection , 2005, Genetic Programming and Evolvable Machines.

[8]  R. Watson,et al.  Pareto coevolution: using performance against coevolved opponents in a game as dimensions for Pareto selection , 2001 .

[9]  Risto Miikkulainen,et al.  Forming Neural Networks Through Efficient and Adaptive Coevolution , 1997, Evolutionary Computation.

[10]  Christian Gagné,et al.  Ensemble learning for free with evolutionary algorithms? , 2007, GECCO '07.

[11]  William B. Langdon,et al.  GP on SPMD parallel graphics hardware for mega Bioinformatics data mining , 2008, Soft Comput..

[12]  Jordan B. Pollack,et al.  Pareto Optimality in Coevolutionary Learning , 2001, ECAL.

[13]  Jason Weston,et al.  Fast Kernel Learning with Sparse Inverted Index , 2007 .

[14]  Chih-Jen Lin,et al.  Combining SVMs with Various Feature Selection Strategies , 2006, Feature Extraction.

[15]  Richard K. Belew,et al.  New Methods for Competitive Coevolution , 1997, Evolutionary Computation.

[16]  Andrew R. McIntyre,et al.  Pareto Cooperative-Competitive Genetic Programming: A Classification Benchmarking Study , 2009 .

[17]  Kenneth A. De Jong,et al.  Cooperative Coevolution: An Architecture for Evolving Coadapted Subcomponents , 2000, Evolutionary Computation.

[18]  Yang Zhang,et al.  A Generic Multi-dimensional Feature Extraction Method Using Multiobjective Genetic Programming , 2009, Evolutionary Computation.

[19]  Yang Zhang,et al.  Feature Extraction Using Multi-Objective Genetic Programming , 2006, Multi-Objective Machine Learning.

[20]  Xin Yao,et al.  Evolutionary ensembles with negative correlation learning , 2000, IEEE Trans. Evol. Comput..

[21]  Huanhuan Chen,et al.  Trade-Off Between Diversity and Accuracy in Ensemble Generation , 2006, Multi-Objective Machine Learning.

[22]  Patrick Haffner,et al.  Scaling large margin classifiers for spoken language understanding , 2006, Speech Commun..

[23]  D. J. Newman,et al.  UCI Repository of Machine Learning Database , 1998 .

[24]  Malcolm I. Heywood,et al.  Managing team-based problem solving with symbiotic bid-based genetic programming , 2008, GECCO '08.

[25]  John A. Doucette,et al.  Evolving Coevolutionary Classifiers Under Large Attribute Spaces , 2010 .

[26]  Steve R. Gunn,et al.  Design and Analysis of the NIPS2003 Challenge , 2006, Feature Extraction.

[27]  Wolfgang Banzhaf,et al.  Evolving Teams of Predictors with Linear Genetic Programming , 2001, Genetic Programming and Evolvable Machines.

[28]  John Cartlidge,et al.  Autonomous Virulence Adaptation Improves Coevolutionary Optimization , 2011, IEEE Transactions on Evolutionary Computation.

[29]  Malcolm I. Heywood,et al.  Coevolutionary bid-based genetic programming for problem decomposition in classification , 2008, Genetic Programming and Evolvable Machines.

[30]  Andrew R. McIntyre,et al.  MOGE: GP classification problem decomposition using multi-objective optimization , 2006, GECCO '06.

[31]  Lynn Margulis,et al.  Symbiosis as a source of evolutionary innovation : speciation and morphogenesis , 1991 .

[32]  Wolfgang Banzhaf,et al.  A hierarchical cooperative evolutionary algorithm , 2010, GECCO '10.

[33]  Krzysztof Krawiec,et al.  Genetic Programming-based Construction of Features for Machine Learning and Knowledge Discovery Tasks , 2002, Genetic Programming and Evolvable Machines.

[34]  Pat Langley,et al.  Machine learning as an experimental science , 2004, Machine Learning.

[35]  Andrew R. McIntyre,et al.  Cooperative Problem Decomposition in Pareto Competitive Classifier Models of Coevolution , 2008, EuroGP.

[36]  Francisco Herrera,et al.  A Survey on the Application of Genetic Programming to Classification , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[37]  Malcolm I. Heywood,et al.  GP Classification under Imbalanced Data sets: Active Sub-sampling and AUC Approximation , 2008, EuroGP.

[38]  Wolfgang Banzhaf,et al.  A comparison of linear genetic programming and neural networks in medical data mining , 2001, IEEE Trans. Evol. Comput..

[39]  M. Heywood,et al.  Classification as Clustering: A Pareto Cooperative-Competitive GP Approach , 2011, Evolutionary Computation.

[40]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[41]  Thomas H. Westerdale,et al.  Local Reinforcement and Recombination in Classifier Systems , 2001, Evolutionary Computation.

[42]  Yaochu Jin,et al.  Multi-Objective Machine Learning , 2006, Studies in Computational Intelligence.

[43]  Chris. Drummond,et al.  Machine Learning as an Experimental Science ( Revisited ) ∗ , 2006 .

[44]  Malcolm I. Heywood,et al.  Symbiogenesis as a Mechanism for Building Complex Adaptive Systems: A Review , 2010, EvoApplications.

[45]  Malcolm I. Heywood,et al.  The Rubik cube and GP Temporal Sequence learning: An initial study , 2011 .

[46]  Edwin D de Jong A monotonic archive for pareto-coevolution. , 2007, Evolutionary computation.