Learning Sequential Decision Rules Using Simulation Models and Competition

The problem of learning decision rules for sequential tasks is addressed, focusing on the problem of learning tactical decision rules from a simple flight simulator. The learning method relies on the notion of competition and employs genetic algorithms to search the space of decision policies. Several experiments are presented that address issues arising from differences between the simulation model on which learning occurs and the target environment on which the decision rules are ultimately tested.

[1]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[2]  Stephen F. Smith,et al.  A learning system based on genetic adaptive algorithms , 1980 .

[3]  John H. Holland,et al.  Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .

[4]  John H. Holland,et al.  Empirical studies of default hierarchies and sequences of rules in learning classifier systems , 1988 .

[5]  Rajarshi Das,et al.  A Study of Control Parameters Affecting Online Performance of Genetic Algorithms for Function Optimization , 1989, ICGA.

[6]  John J. Grefenstette,et al.  Optimization of Control Parameters for Genetic Algorithms , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  Arthur L. Samuel,et al.  Some studies in machine learning using the game of checkers , 2000, IBM J. Res. Dev..

[8]  David Chapman,et al.  Pengi: An Implementation of a Theory of Activity , 1987, AAAI.

[9]  Kenneth Alan De Jong,et al.  An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[10]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[11]  John J. Grefenstette,et al.  Credit assignment in rule discovery systems based on genetic algorithms , 1988, Machine Learning.

[12]  Riva Wenig Bickel,et al.  Tree Structured Rules in Genetic Algorithms , 1987, ICGA.

[13]  John J. Grefenstette,et al.  Explanations of Empirically Derived Reactive Plans , 1990, ML.

[14]  John Dickinson,et al.  Using the Genetic Algorithm to Generate LISP Source Code to Solve the Prisoner's Dilemma , 1987, ICGA.

[15]  Richard S. Sutton,et al.  Training and Tracking in Robotics , 1985, IJCAI.

[16]  Lawrence Davis,et al.  Adapting Operator Probabilities in Genetic Algorithms , 1989, ICGA.

[17]  John J. Grefenstette,et al.  How Genetic Algorithms Work: A Critical Look at Implicit Parallelism , 1989, ICGA.

[18]  Nichael Lynn Cramer,et al.  A Representation for the Adaptive Generation of Simple Sequential Programs , 1985, ICGA.

[19]  John R. Koza,et al.  Hierarchical Genetic Algorithms Operating on Populations of Computer Programs , 1989, IJCAI.

[20]  Kenneth D. Forbus Qualitative Process Theory , 1984, Artificial Intelligence.

[21]  Lashon B. Booker,et al.  Intelligent Behavior as an Adaptation to the Task Environment , 1982 .

[22]  Jan M. Zytkow,et al.  Utilizing Experience for Improving the Tactical Manager , 1988, ML.

[23]  Hendrik James Antonisse,et al.  Genetic Operators for High-Level Knowledge Representations , 1987, ICGA.

[24]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[25]  Stewart W. Wilson Classifier Systems and the Animat Problem , 1987, Machine Learning.

[26]  L. Booker Classifier Systems that Learn Internal World Models , 2005, Machine Learning.

[27]  Pat Langley,et al.  Learning Effective Search Heuristics , 1983, IJCAI.

[28]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[29]  Tom M. Mitchell,et al.  LEAP: A Learning Apprentice for VLSI Design , 1985, IJCAI.

[30]  Bruce G. Buchanan,et al.  Simulation-Assisted Inductive Learning , 1988, AAAI.

[31]  Richard S. Sutton,et al.  Learning and Sequential Decision Making , 1989 .

[32]  Lashon B. Booker,et al.  Improving the Performance of Genetic Algorithms in Classifier Systems , 1985, ICGA.

[33]  John J. Grefenstette,et al.  Genetic algorithms in noisy environments , 1988, Machine Learning.

[34]  Ryszard S. Michalski,et al.  A theory and methodology of inductive learning , 1993 .

[35]  Stewart W. Wilson Knowledge Growth in an Artificial Animal , 1985, ICGA.

[36]  David E. Goldberg,et al.  Probability Matching, the Magnitude of Reinforcement, and Classifier System Bidding , 1990, Machine Learning.