ADAPTIVE CONTROL OF MARKOV CHAINS: A SURVEY

Abstract Adaptive policies are computationally attractive procedures for controlling systems whose dynamics involve unknown parameters. The paper organizes a survey of the literature in terms of the convergence properties of proposed adaptive policies and the restrictions imposed on the unknown parameters. Suggestions for future research are summarized in the conclusion.