Parallel Cooperative Classifier Systems

Learning and adaptation are essential capabilities for intelligent arti cial systems that operate autonomously in complex and uncertain environments. Classi er Systems (CSs) exploit genetic algorithms to learn behavioral strategies, in terms of a set of condition/action rules, from the direct interaction either with powerful simulations of the task environment or with the environment itself. Arti cial evolution is however a computationally expensive process, which may prevent the application of CSs to real world tasks. The wide availability of parallel architectures and parallel programming environments suggests parallelism as a viable approach to speed up learning. This thesis proposes and investigates a parallel model for CSs, Parallel Cooperative Classi er Systems (PCCSs), where a number of CSs learn in parallel the same task and cooperate by exchanging information. We discuss the di erences between a parallel implementation of a CS and parallel cooperative learning. We then focus on the new behaviors and learning capabilities resulting from parallel cooperative learning, and on the learning speed up induced by the parallel model with respect to a simple sequential model of reference. To study PCCSs, we identify a number of design choices, and explore their implications on the cooperation among the CSs as well as on the mapping of PCCSs on parallel architectures. The thesis emphasizes how the e ectiveness of the cooperation among the CSs strongly depends on its ability to integrate the knowledge acquired by distinct CSs, so that each CS can exploit what the other CSs have already learnt to focus its own learning. To improve cooperation, we de ne and experimentally evaluate a number of specialized cooperation mechanisms. We also show how a parallel CS model permits the use in CSs of more powerful learning mechanisms but with a large computational cost. An example is the Q-Credit Assignment, described and analysed in this thesis, which is able to evaluate rules whose application may have di erent outcomes depending on the context where they re. Acknowledgments First, I would like to thank my supervisor, Antonina Starita, for having introduced me to the eld of evolutionary computation and for her constant friendship, support, and encouragement during my undergraduate and graduate studies. I'm also greatly indebted to Fabrizio Baiardi, for always being ready to listen to, discuss, and suggest ideas, and for always having promptly read and improved my invariably late writings. We developed together most of the ideas and results in this thesis. I wish to thank Marco Vanneschi and Franco Turini, for agreeing to be supervisor and member of my thesis committee, respectively. Their comments and criticisms have undoubtedly contributed to improve this work and stimulated me to see my research in a wider perspective. I would like to express my gratitude to Stewart Wilson and Colin Reeves, who accepted to be my external referees, for their insightful comments and careful suggestions on a rst draft of this work. Furthermore, I sincerely thank the undergraduate students I've collaborated with and whose work has signi cantly contributed to this thesis: Paola Gambini, Marco Manetti, Massimiliano Porcu, and Andrea Sticca. I also wish to express my gratitude to all the people at the Computer Science Department who have made my doctoral years so enjoyable: all the technical and administrative sta , for their kindness in solving my uncountable problems; the colleagues with whom I've shared my room at the department and all the other graduate students, for their pleasantness and their always prompt answers to my technical questions; the friends of the Parallel Architectures Lab, for the merry hours we have spent together. A very special thank goes to Davide, for his invaluable support, his constant willingness to help me in whatever I needed, his always being an example to me for his reliability, integrity, and commitment. It's almost impossible to nd the words to thank my parents, Franco and Liliana. What I've done, what I am, I owe it to their dedication, love, and support in every sense. Thanks also to my little nephew Valentino, whose cheerfulness has been an enjoyable background music while I was working on this manuscript. Most of all, my greatest thanks go to Giuliano, for having always helped and supported me, for his faith in me, and for having put up with my ups and downs, with immeasurable friendship and love.

[1]  Robert E. Smith,et al.  Is a Learning Classifier System a Type of Neural Network? , 1994, Evolutionary Computation.

[2]  Fabrizio Baiardi,et al.  On Dropping Niches in Parallel Niching Genetic Algorithms , 1998 .

[3]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[4]  Samir W. Mahfoud Crowding and Preselection Revisited , 1992, PPSN.

[5]  J. David Schaffer,et al.  An Adaptive Crossover Distribution Mechanism for Genetic Algorithms , 1987, ICGA.

[6]  Marco Dorigo,et al.  Genetic and Non-Genetic Operators in ALECSYS , 1993, Evolutionary Computation.

[7]  W. Daniel Hillis,et al.  The connection machine , 1985 .

[8]  Long-Ji Lin,et al.  Self-improving reactive agents: case studies of reinforcement learning frameworks , 1991 .

[9]  Terence D. Sanger,et al.  Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[10]  Erick Cantú-Paz,et al.  A Summary of Research on Parallel Genetic Algorithms , 1995 .

[11]  Thomas Bäck,et al.  An Overview of Evolutionary Algorithms for Parameter Optimization , 1993, Evolutionary Computation.

[12]  Bernard Manderick,et al.  Fine-Grained Parallel Genetic Algorithms , 1989, ICGA.

[13]  Yuval Davidor,et al.  A Naturally Occurring Niche and Species Phenomenon: The Model and First Results , 1991, ICGA.

[14]  A. Barto,et al.  Learning and Sequential Decision Making , 1989 .

[15]  Gary McGraw,et al.  Emergent Control and Planning in an Autonomous Vehicle , 1993 .

[16]  John H. Holland,et al.  Properties of the bucket brigade algorithm , 1985 .

[17]  John H. Holland,et al.  Empirical studies of default hierarchies and sequences of rules in learning classifier systems , 1988 .

[18]  Marco Dorigo,et al.  A comparison of Q-learning and classifier systems , 1994 .

[19]  Larry J. Eshelman,et al.  Biases in the Crossover Landscape , 1989, ICGA.

[20]  R. Andrew Hidden State and Reinforcement Learning with Instance-Based State Identification , 1996 .

[21]  Richard S. Sutton,et al.  Temporal credit assignment in reinforcement learning , 1984 .

[22]  Filippo Neri,et al.  A Parallel Genetic Algorithm for Concept Learning , 1995, ICGA.

[23]  Theodore C. Belding,et al.  The Distributed Genetic Algorithm Revisited , 1995, ICGA.

[24]  Dana S. Richards,et al.  Punctuated Equilibria: A Parallel Genetic Algorithm , 1987, ICGA.

[25]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[26]  L. Darrell Whitley,et al.  Serial and Parallel Genetic Algorithms as Function Optimizers , 1993, ICGA.

[27]  Peter J. Angeline,et al.  An evolutionary algorithm that constructs recurrent neural networks , 1994, IEEE Trans. Neural Networks.

[28]  Astro Teller,et al.  The evolution of mental models , 1994 .

[29]  Stefano Nolfi,et al.  How to Evolve Autonomous Robots: Different Approaches in Evolutionary Robotics , 1994 .

[30]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[31]  W. Lovejoy A survey of algorithmic methods for partially observed Markov decision processes , 1991 .

[32]  Kenneth A. De Jong,et al.  A Cooperative Coevolutionary Approach to Function Optimization , 1994, PPSN.

[33]  Cosimo Anglano,et al.  Coevolutionary, Distributed Search for Inducing Concept Description , 1998, ECML.

[34]  Reinhard Männer,et al.  Implementation of Standard Genetic Algorithm on MIMD Machines , 1994, PPSN.

[35]  George G. Robertson,et al.  Parallel Implementation of Genetic Algorithms in a Classifier Rystem , 1987, ICGA.

[36]  Sandip Sen,et al.  Newboole: A Fast GBML System , 1990, ML.

[37]  James E. Baker,et al.  Reducing Bias and Inefficienry in the Selection Algorithm , 1987, ICGA.

[38]  James D. Johannes,et al.  Hierarchical exemplar based credit allocation for genetic classifier systems , 1998 .

[39]  Martina Gorges-Schleuter,et al.  ASPARAGOS An Asynchronous Parallel Genetic Optimization Strategy , 1989, ICGA.

[40]  Stephen F. Smith,et al.  Flexible Learning of Problem Solving Heuristics Through Adaptive Search , 1983, IJCAI.

[41]  John J. Grefenstette,et al.  ROBOT LEARNING WITH PARALLEL GENETIC ALGORITHMS ON NETWORKED COMPUTERS , 1995 .

[42]  Marco Colombetti,et al.  Robot Shaping: Developing Autonomous Agents Through Learning , 1994, Artif. Intell..

[43]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[44]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[45]  Stephen F. Smith,et al.  A learning system based on genetic adaptive algorithms , 1980 .

[46]  Paul J. Werbos,et al.  Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[47]  Thomas Bäck,et al.  Selective Pressure in Evolutionary Algorithms: A Characterization of Selection Mechanisms , 1994, International Conference on Evolutionary Computation.

[48]  Kari Karhunen,et al.  Über lineare Methoden in der Wahrscheinlichkeitsrechnung , 1947 .

[49]  Larry Bull,et al.  Evolution in Multi-agent Systems: Evolving Communicating Classifier Systems for Gait in a Quadrupedal Robot , 1995, ICGA.

[50]  Kalyanmoy Deb,et al.  A Comparative Analysis of Selection Schemes Used in Genetic Algorithms , 1990, FOGA.

[51]  Geoffrey E. Hinton,et al.  How Learning Can Guide Evolution , 1996, Complex Syst..

[52]  Jeffrey Horn,et al.  The nature of niching: genetic algorithms and the evolution of optimal, cooperative populations , 1997 .

[53]  John H. Holland,et al.  Cognitive systems based on adaptive algorithms , 1977, SGAR.

[54]  Marco Dorigo,et al.  Alecsys: A Parallel Laboratory for Learning Classifier Systems , 1991, ICGA.

[55]  Dana S. Richards,et al.  A Multi-Population Genetic Algorithm for Solving the K-Partition Problem on Hyper-Cubes , 1991, ICGA.

[56]  Pier Luca Lanzi,et al.  A Study of the Generalization Capabilities of XCS , 1997, ICGA.

[57]  Hitoshi Iba,et al.  Genetic Programming 1998: Proceedings of the Third Annual Conference , 1999, IEEE Trans. Evol. Comput..

[58]  L. Darrell Whitley,et al.  Lamarckian Evolution, The Baldwin Effect and Function Optimization , 1994, PPSN.

[59]  Vasant Honavar,et al.  Evolutionary Design of Neural Architectures , 1995 .

[60]  John H. Holland,et al.  Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .

[61]  Jim Smith,et al.  Self adaptation of mutation rates in a steady state genetic algorithm , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[62]  Dimitri P. Bertsekas,et al.  Dynamic Programming: Deterministic and Stochastic Models , 1987 .

[63]  Antonina Starita,et al.  Q-Learning and Parallelism in Evolutionary Rule Based Systems , 1995, ICANNGA.

[64]  Antonina Starita,et al.  PANIC: A Parallel Evolutionary Rule Based System , 1995, Evolutionary Programming.

[65]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[66]  Takashi Gomi,et al.  Evolutionary robotics-an overview , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[67]  Pier Luca Lanzi Adding memory to XCS , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[68]  Lashon B. Booker,et al.  Intelligent Behavior as an Adaptation to the Task Environment , 1982 .

[69]  Richard K. Belew,et al.  Evolving networks: using the genetic algorithm with connectionist learning , 1990 .

[70]  Donald A. Waterman,et al.  Generalization Learning Techniques for Automating the Learning of Heuristics , 1970, Artif. Intell..

[71]  Lashon B. Booker,et al.  Improving the Performance of Genetic Algorithms in Classifier Systems , 1985, ICGA.

[72]  Samir W. Mahfoud Niching methods for genetic algorithms , 1996 .

[73]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[74]  Marco Tomassini,et al.  Distributed Genetic Algorithms with an Application to Portfolio Selection Problems , 1995, ICANNGA.

[75]  Isabelle Guyon Applications of Neural Networks to Character Recognition , 1991, Int. J. Pattern Recognit. Artif. Intell..

[76]  L. Darrell Whitley,et al.  Adding Learning to the Cellular Development of Neural Networks: Evolution and the Baldwin Effect , 1993, Evolutionary Computation.

[77]  Jürgen Schmidhuber,et al.  Reinforcement Learning in Markovian and Non-Markovian Environments , 1990, NIPS.

[78]  J. Holmes Differential Negative Reinforcement Improves Classifier System Learning Rate in Two-Class Problems with Unequal Base Rates , 1990 .

[79]  Stewart W. Wilson Bid Competition and Specificity Reconsidered , 1988, Complex Syst..

[80]  Antonina Starita,et al.  Q-Learning in Evolutionary Rule Based Systems , 1994, PPSN.

[81]  John J. Grefenstette,et al.  A Parallel Genetic Algorithm , 1987, ICGA.

[82]  John J. Grefenstette,et al.  Lamarckian Learning in Multi-Agent Environments , 1991, ICGA.

[83]  William M. Spears,et al.  Adapting Crossover in Evolutionary Algorithms , 1995, Evolutionary Programming.

[84]  SchwefelHans-Paul,et al.  An overview of evolutionary algorithms for parameter optimization , 1993 .

[85]  John J. Grefenstette,et al.  Multilevel Credit Assignment in a Genetic Learning System , 1987, International Conference on Genetic Algorithms.

[86]  David E. Goldberg,et al.  Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[87]  Bernard Manderick,et al.  A Massively Parallel Genetic Algorithm: Implementation and First Analysis , 1991, ICGA.

[88]  Leslie Pack Kaelbling,et al.  Acting Optimally in Partially Observable Stochastic Domains , 1994, AAAI.

[89]  TesauroGerald Practical Issues in Temporal Difference Learning , 1992 .

[90]  Takeshi Yamada,et al.  The ECOlogical Framework II : Improving GA Performance At Virtually Zero Cost , 1993, ICGA.

[91]  Andrew McCallum,et al.  Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State , 1995, ICML.

[92]  Markus Schwehm,et al.  A Massively Parallel Genetic Algorithm on the MasPar MP-1 , 1993 .

[93]  Antonina Starita,et al.  Q-Learning and Redundancy Reduction in Classifier Systems with Internal State , 1998, ECML.

[94]  L. Darrell Whitley,et al.  The GENITOR Algorithm and Selection Pressure: Why Rank-Based Allocation of Reproductive Trials is Best , 1989, ICGA.

[95]  Gilbert Syswerda,et al.  Uniform Crossover in Genetic Algorithms , 1989, ICGA.

[96]  Lawrence J. Fogel,et al.  Artificial Intelligence through Simulated Evolution , 1966 .

[97]  Steven Douglas Whitehead,et al.  Reinforcement learning for the adaptive control of perception and action , 1992 .

[98]  John J. Grefenstette,et al.  A Coevolutionary Approach to Learning Sequential Decision Rules , 1995, ICGA.

[99]  Reiko Tanese,et al.  Distributed Genetic Algorithms , 1989, ICGA.

[100]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[101]  Stewart W. Wilson Hierarchical Credit Allocation in a Classifier System , 1987, IJCAI.

[102]  Chrisila C. Pettey,et al.  A Theoretical Investigation of a Parallel Genetic Algorithm , 1989, ICGA.