Efficient Evolution of Accurate Classification Rules Using a Combination of Gene Expression Programming and Clonal Selection

A hybrid evolutionary technique is proposed for data mining tasks, which combines a principle inspired by the immune system, namely the clonal selection principle, with a more common, though very efficient, evolutionary technique, gene expression programming (GEP). The clonal selection principle regulates the immune response in order to successfully recognize and confront any foreign antigen, and at the same time allows the amelioration of the immune response across successive appearances of the same antigen. On the other hand, gene expression programming is the descendant of genetic algorithms and genetic programming and eliminates their main disadvantages, such as the genotype-phenotype coincidence, though it preserves their advantageous features. In order to perform the data mining task, the proposed algorithm introduces the notion of a data class antigen, which is used to represent a class of data, the produced rules are evolved by our clonal selection algorithm (CSA), which extends the recently proposed CLONALG algorithm. In CSA, among other new features, a receptor editing step has been incorporated. Moreover, the rules themselves are represented as antibodies that are coded as GEP chromosomes in order to exploit the flexibility and the expressiveness of such encoding. The proposed hybrid technique is tested on a set of benchmark problems in comparison to GEP. In almost all problems considered, the results are very satisfactory and outperform conventional GEP both in terms of prediction accuracy and computational efficiency.

[1]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[2]  Sebastian Thrun,et al.  The MONK''s Problems-A Performance Comparison of Different Learning Algorithms, CMU-CS-91-197, Sch , 1991 .

[3]  Fernando José Von Zuben,et al.  Learning and optimization using the clonal selection principle , 2002, IEEE Trans. Evol. Comput..

[4]  T. Fukuda,et al.  Immune Networks Using Genetic Algorithm for Adaptive Production Scheduling , 1993 .

[5]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[6]  M. Pike,et al.  Somatic Mutation , 1965, British medical journal.

[7]  Weimin Xiao,et al.  Evolving accurate and compact classification rules with gene expression programming , 2003, IEEE Trans. Evol. Comput..

[8]  Andreas Stafylopatis,et al.  Data Mining based on Gene Expression Programming and Clonal Selection , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[9]  Vincenzo Cutello,et al.  A Hybrid Immune Algorithm with Information Gain for the Graph Coloring Problem , 2003, GECCO.

[10]  Leandro Nunes de Castro,et al.  Artificial Immune Systems: Part I-Basic Theory and Applications , 1999 .

[11]  Dipankar Dasgupta,et al.  Artificial neural networks and artificial immune systems: similarities and differences , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[12]  Jonathan Timmis,et al.  Artificial Immune Recognition System (AIRS): An Immune-Inspired Supervised Learning Algorithm , 2004, Genetic Programming and Evolvable Machines.

[13]  Leandro Nunes de Castro,et al.  ARTIFICIAL IMMUNE SYSTEMS: PART II - A SURVEY OF APPLICATIONS , 2000 .

[14]  Cândida Ferreira,et al.  Gene Expression Programming: A New Adaptive Algorithm for Solving Problems , 2001, Complex Syst..

[15]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[16]  John E. Hunt,et al.  Learning using an artificial immune system , 1996 .

[17]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[18]  Axthonv G. Oettinger,et al.  IEEE Transactions on Information Theory , 1998 .

[19]  R G Weinand,et al.  Somatic mutation, affinity maturation and the antibody repertoire: a computer model. , 1990, Journal of theoretical biology.

[20]  Johannes Fürnkranz,et al.  Round Robin Rule Learning , 2001, ICML.

[21]  F. Burnet The clonal selection theory of acquired immunity , 1959 .

[22]  Kenneth A. Kaufman,et al.  A Measure of Description Quality for Data Mining and its Implementation in the AQ18 Learning System , 1999 .

[23]  Jorma Rissanen,et al.  MDL Denoising , 2000, IEEE Trans. Inf. Theory.

[24]  Kemal Polat,et al.  Breast cancer and liver disorders classification using artificial immune recognition system (AIRS) with performance evaluation by fuzzy resource allocation mechanism , 2007, Expert Syst. Appl..

[25]  M. Nussenzweig,et al.  Immune Receptor Editing Revise and Select , 1998, Cell.

[26]  D. Dasgupta Artificial Immune Systems and Their Applications , 1998, Springer Berlin Heidelberg.

[27]  Simon M. Garrett,et al.  Improved Pattern Recognition with Artificial Clonal Selection? , 2003, ICARIS.

[28]  Mark A. Pitt,et al.  Advances in Minimum Description Length: Theory and Applications , 2005 .

[29]  Kemal Polat,et al.  The Medical Applications of Attribute Weighted Artificial Immune System (AWAIS): Diagnosis of Heart and Diabetes Diseases , 2005, ICARIS.