This paper presents a simple model of the decision problems which face many animals in the wild. Animals have an upper limit to the amount of energy reserves which can be stored as body fat. They constantly use energy and find food as a stochastic process. Death occurs if energy reserves fall to zero or if the animal is taken by a predator. Typically animals have a range of behavioural options which differ in the distribution of food gained and in the associated predation risk. There is often a trade-off between starvation and predation in that animals have to expose themselves to higher predation risks in order to gain more food. The above situation is modelled as a finite-state, finite-time-horizon Markov decision problem. The policy which maximises long-term survival probability is characterised. As special cases two trade-offs are analysed. It is shown that an animal should take fewer risks in terms of predation as its reserves increase, and that an animal should reduce the variability of its food supply at the expense of its mean gain as its reserves increase.
[1]
E. Seneta,et al.
THE THEORY OF NON-NEGATIVE MATRICES IN A DYNAMIC PROGRAMMING PROBLEM
,
1969
.
[2]
Karel Sladký,et al.
On dynamic programming recursions for multiplicative Markov decision chains
,
1976
.
[3]
D. P. Kennedy,et al.
On sets of countable non-negative matrices and Markov decision processes
,
1978,
Advances in Applied Probability.
[4]
D. R. Grey.
Non-negative matrices, dynamic programming and a harvesting problem
,
1984
.
[5]
Kathryn J. Schneider.
Dominance, predation, and optimal foraging in white-throated sparrow flocks
,
1984
.
[6]
A. Houston,et al.
The Common Currency for Behavioral Decisions
,
1986,
The American Naturalist.