The Integration of Reinforcements over Time

The title of this paper could as well have been “The Integration of Prey over Time.” We are concerned with a problem that is common to the experimental analysis of animal behavior as it occurs in Skinner boxes, and to the empirical investigation of foraging as it occurs in the animal’s natural environment. It is the problem of how a series of discrete events, deliveries of reinforcers or captures of prey, are transformed into an estimate of the density of reinforcements available on a schedule, or the density of prey present in a patch. All popular accounts both of foraging behavior and of behavior under schedules of reinforcement make use of the results of such an integration. Herrnstein’s’.’ matching law and “quantitative law of effect” predict that behavior under schedules of reinforcement will be a function of the reinforcement rates experienced under the schedules. Charnov’s’ marginal value theorem predicts that natural foraging behavior in a patchy environment will be a function of the rates a t which prey can be captured within each patch and within the environment as a whole. The functions concerned are different, but the problem is common to both. You or I , faced with the problem of estimating the rate a t which discrete events occur, would doubtless solve it by counting them, then dividing by the time that had elapsed, or the number of responses we had made. We have no evidence that animals can use this solution. So far as we can tell, the capacities of even the largest-brained birds to discriminate between numbers are quite m ~ d e s t . ~ (Other kinds of “counting” behavior, which appear to be much more a c c ~ r a t e , ” ~ turn out to be no different from the kind of noncounting mechanism we propose below.) We have no evidence at all that animals can manipulate numbers in the kind of way required for division. Besides, this solution ignores an additional, fundamental aspect of the problem: the density that is being estimated is liable to change. I n the laboratory we do, of course, change the schedules of reinforcement that our subjects experience. But, except i n the case of the transition to extinction, we very rarely study the effects of these transitions. Instead, we maintain the same conditions in force until we are sure we have achieved “steady-state” behavior, and look only at

[1]  R. R. Bush,et al.  A mathematical model for simple learning. , 1951, Psychological review.

[2]  R. R. Bush,et al.  A Mathematical Model for Simple Learning , 1951 .

[3]  J D Findley,et al.  Preference and Switching under Concurrent Scheduling. , 1958, Journal of the experimental analysis of behavior.

[4]  R J HERRNSTEIN,et al.  Relative and absolute strength of response as a function of frequency of reinforcement. , 1961, Journal of the experimental analysis of behavior.

[5]  M. Rilling Number of responses as a stimulus in fixed interval and fixed ratio schedules. , 1967, Journal of comparative and physiological psychology.

[6]  P. Killeen On the measurement of reinforcement frequency in the study of preference. , 1968, Journal of the experimental analysis of behavior.

[7]  R. Herrnstein On the law of effect. , 1970, Journal of the experimental analysis of behavior.

[8]  M. Cody Finch flocks in the Mohave desert. , 1971, Theoretical population biology.

[9]  G. Ainslie Specious reward: a behavioral theory of impulsiveness and impulse control. , 1975, Psychological bulletin.

[10]  Richard J. Herrnstein,et al.  MAXIMIZING AND MATCHING ON CONCURRENT RATIO SCHEDULES1 , 1975 .

[11]  E. Charnov Optimal foraging, the marginal value theorem. , 1976, Theoretical population biology.

[12]  A. Oaten,et al.  Optimal foraging in patches: a case for stochasticity. , 1977, Theoretical population biology.

[13]  P. Taylor,et al.  Test of optimal sampling by foraging great tits , 1978 .

[14]  S. Lea Foraging and reinforcement schedules in the pigeon: Optimal and non-optimal aspects of choice , 1979, Animal Behaviour.

[15]  La valeur adaptative des comportements aléatoires , 1979 .

[16]  W. Voxman,et al.  Predator Response to Variation of Prey Density in a Patchy Environment: A Model , 1979, The American Naturalist.

[17]  A Houston,et al.  The application of statistical decision theory to animal behaviour. , 1980, Journal of theoretical biology.

[18]  J. Myerson,et al.  The kinetics of choice: An operant systems analysis. , 1980 .

[19]  Stephen José Hanson,et al.  Central place foraging in Rattus norvegicus , 1981, Animal Behaviour.

[20]  Foraging in a simulated natural environment: There's a rat loose in the lab. , 1982, Journal of the experimental analysis of behavior.