Why and How to Learn Why: Analysis-Based Generalization of Procedures

Max Wertheimer, in his classic Productive Thinking, linked understanding to transfer: Understanding is important because it provides the ability to generalize the solution of one problem to apply to another. Recent work in human and machine learning has led to the development of a new class of generalization mechanism, called here analysis-based generalization, which can be used to provide a concrete account of the linkage Wertheimer suggested: these mechanisms all, in different ways, use understanding of examples in the generalization process. In this paper I review this class of mechanism, and describe a method for causal attribution that can produce the analyses of examples that the generalization methods require, in the domain of simple procedures in human-computer interaction. This causal analysis method is linked with analysis-based generalization to form EXPL, an implemented model which is a concrete, though limited, instontiation of Wertheimer s scheme. EXPL constructs an understanding of an example procedure and generalizes it on the basis of that understanding. Results of an empirical study suggest that some of EXPL's attribution heuristics are used by people, and that while a subclass of analysis-based methods, called superstitious methods, seem to provide a more plausible account of people's generalization under the conditions of the study than a contrasting class of rationalistic methods, at least some participants appear to use methods from both classes. The results also show that explanation-based methods, which rely on comprehensive domain theories, must be used in conjunction with a means for extending the domain theory. If thus enhanced, explanation-based methods are able to mimic the effects of other analysis-based methods, and can provide a good account of the data, though combinations of other methods must also be considered. Finally, I return to Wertheimer s ideas to argue that none of the current analysis-based generalization methods fully captures Wertheimer s notion of understanding. Proper choice among different possible analyses of an example is crucial for Wertheimer, but I argue that this problem may be beyond the reach of learning systems.

[1]  T. Shultz,et al.  Similarity as a Principle of Causal Inference. , 1977 .

[2]  Michael R. Lowry,et al.  Learning Physical Descriptions From Functional Definitions, Examples, and Precedents , 1983, AAAI.

[3]  Clayton Lewis,et al.  Understanding what's happening in system interactions , 1986 .

[4]  James G. Greeno,et al.  Understanding and procedural knowledge in mathematics instruction 1 , 1978 .

[5]  John R. Anderson,et al.  Use of analogy in a production system architecture , 1989 .

[6]  John M. Carroll,et al.  Learning to use word processors: problems and prospects , 1983, TOIS.

[7]  W. Friedman The Developmental psychology of time , 1982 .

[8]  P. Pirolli,et al.  The role of learning from examples in the acquisition of recursive programming skills. , 1985 .

[9]  R. Greenberg Biometry , 1969, The Yale Journal of Biology and Medicine.

[10]  Patrick Henry Winston,et al.  Learning and reasoning by analogy , 1980, CACM.

[11]  Gerald DeJong,et al.  Generalizations Based on Explanations , 1981, IJCAI.

[12]  Gerald DeJong,et al.  Acquiring Schemata Through Understanding and Generalizing Plans , 1983, IJCAI.

[13]  Thomas G. Dietterich,et al.  A Comparative Review of Selected Methods for Learning from Examples , 1983 .

[14]  M. Scheerer,et al.  Problem Solving , 1967, Nature.

[15]  Peter L. Pirolli,et al.  Problem solving by analogy and skill acquisition in the domain of programming , 1985 .

[16]  Stuart J. Russell Analogy and Single-Instance Generalization , 1987 .

[17]  Patrick Henry Winston,et al.  Learning New Principles from Precedents and Exercises , 1982, Artif. Intell..

[18]  Donald A. Norman,et al.  User Centered System Design: New Perspectives on Human-Computer Interaction , 1988 .

[19]  Ryszard S. Michalski,et al.  Machine learning: an artificial intelligence approach volume III , 1990 .

[20]  C. Lewis A model of mental model construction , 1986, CHI '86.

[21]  John R. Anderson Causal Analysis and Inductive Learning , 1987 .

[22]  Dedre Gentner,et al.  Structure-Mapping: A Theoretical Framework for Analogy , 1983, Cogn. Sci..

[23]  Clayton Lewis,et al.  Learning to use a text processing system: Evidence from “thinking aloud” protocols , 1982, CHI '82.

[24]  M. Pazzani Inducing Causal and Social Theories: A Prerequisite for Explanation-based Learning , 1987 .