论文信息 - Machine Learning 2: 199 228, 1987 © 1987 Kluwer Academic Publishers, Boston Manufactured in The Netherlands

Machine Learning 2: 199 228, 1987 © 1987 Kluwer Academic Publishers, Boston Manufactured in The Netherlands

This paper characterizes and investigates, from the perspective of machine learning and, particularly, classifier systems, the learning problem faced by animals and autonomous robots (here collectively termed animats). We suggest that, to survive in their environments, animats must in effect learn multiple disjunctive concepts incremen- tally under payoff (needs-satisfying) feedback. A review of machine learning techniques indicates that most relax at least one of these constraints. In theory, classifier systems satisfy the constraints, but tests have been limited. We show how the standard classifier system model applies to the animat learning problem. Then, in the experimental part of the paper, we specialize the model and test it in a problem environment satisfying the constraints and consisting of a difficult, disjunctive Boolean function drawn from the machine learning literature. Results include: learning the function in significantly fewer trials than a neural-network method; learning under payoff regimes that include both noisy payoff and partial reward for suboptimal performance; demonstration, in a classi- fier system, of a theoretically predicted property of genetic algorithms: the superiority of crossovers to point mutations; and automatic control of variation (search) rate based on system entropy. We conclude that the results support the classifier system approach to the animat problem, but suggest work aimed at the emergence of behavioral hierarchies of classifiers to offset slower learning rates in larger problems.

Stewart W. Wilson | Classifier Systems | Classifier Systems

[1] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[2] Ryszard S. Michalski,et al. Selection of Most Representative Training Examples and Incremental Generation of VL1 Hypotheses: The Underlying Methodology and the Description of Programs ESEL and AQ11 , 1978 .

[3] John J. Grefenstette. Proceedings of the First International Conference on Genetic Algorithms and their Applications, July 24-26, 1985, at the Carnegie-Mellon University, Pittsburgh, PA , 1988 .

[4] John H. Holland,et al. Cognitive systems based on adaptive algorithms , 1977, SGAR.

[5] J. Ross Quinlan,et al. Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[6] Kenneth Alan De Jong,et al. An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[7] John H. Holland,et al. Properties of the bucket brigade algorithm , 1985 .

[8] Lashon B. Booker,et al. Improving the Performance of Genetic Algorithms in Classifier Systems , 1985, ICGA.

[9] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[10] John H. Holland,et al. COGNITIVE SYSTEMS BASED ON ADAPTIVE ALGORITHMS1 , 1978 .

[11] R. Michalski. Understanding the Nature of Learning: Issues and Research Directions , 1985 .