From the Publisher:
The past decade has seen considerable theoretical and applied research on Markov decision processes, as well as the growing use of these models in ecology, economics, communications engineering, and other fields where outcomes are uncertain and sequential decision-making processes are needed. A timely response to this increased activity, Martin L. Puterman's new work provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models. It discusses all major research directions in the field, highlights many significant applications of Markov decision processes models, and explores numerous important topics that have previously been neglected or given cursory coverage in the literature. Markov Decision Processes focuses primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous-time discrete state models. The book is organized around optimality criteria, using a common framework centered on the optimality (Bellman) equation for presenting results. The results are presented in a "theorem-proof" format and elaborated on through both discussion and examples, including results that are not available in any other book. A two-state Markov decision process model, presented in Chapter 3, is analyzed repeatedly throughout the book and demonstrates many results and algorithms. Markov Decision Processes covers recent research advances in such areas as countable state space models with average reward criterion, constrained models, and models with risk sensitive optimality criteria. It also explores several topics that have received little or no attention in other books, including modified policy iteration, multichain models with average reward criterion, and sensitive optimality. In addition, a Bibliographic Remarks section in each chapter comments on relevant historic
[1]
R. Bellman.
Dynamic programming.
,
1957,
Science.
[2]
D. J. White,et al.
Finite Dynamic Programming: An Approach to Finite Markov Decision Processes
,
1978
.
[3]
Martin L. Puterman,et al.
Dynamic Programming and Its Application
,
1979
.
[4]
R. Hartley,et al.
Optimisation Over Time: Dynamic Programming and Stochastic Control:
,
1983
.
[5]
Liliane Pintelon,et al.
Handbooks in operations research and management science volume 2: Stochastic models
,
1991
.
[6]
Stephen P. Brooks,et al.
Markov Decision Processes.
,
1995
.