Non-Markovian Policies in Sequential Decision Problems

In this article we prove the validity of the Dellman Optimality Equa­ tion a.nd related results for sequential decision problems with a general recursive structure. The characteristic feature of our approach is that also non-Markovian policies are taken into account. The theory is moti­ vated by some experiments with a learning robot.