论文信息 - Optimal Stopping in a Partially Observable Markov Process with Costly Information

Optimal Stopping in a Partially Observable Markov Process with Costly Information

A problem of optimal stopping in a Markov chain whose states are not directly observable is presented. Using the theory of partially observable Markov decision processes, a model which combines the classical stopping problem with sequential sampling at each stage of the decision process is developed. Several results which characterize the optimal expected value function in terms of its parameters are given. An example is given which indicates that the best action to take as a function of the information currently available may not be of the intuitively appealing control limit type. The set of states at which it is optimal to purchase information need not be convex. The expected value of information as a function of the decision maker's knowledge is related to nonmonotone optimal policies.

George E. Monahan | G. Monahan

[1] Edward J. Sondik,et al. The optimal control of par-tially observable Markov processes , 1971 .

[2] K. Hinderer,et al. Foundations of Non-stationary Dynamic Programming with Discrete Time Parameter , 1970 .

[3] K. M. vanHee,et al. Bayesian control of Markov chains , 1978 .

[4] T. Yoshikawa,et al. Discrete-Time Markovian Decision Processes with Incomplete State Observation , 1970 .

[5] Ronald A. Howard,et al. Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..

[6] D. Rhenius. Incomplete Information in Markovian Decision Models , 1974 .

[7] J. J. Martin. Bayesian Decision Problems and Markov Chains , 1967 .

[8] William P. Pierskalla,et al. A survey of maintenance models: The control and surveillance of deteriorating systems , 1976 .

[9] Karl Johan Åström,et al. Optimal control of Markov processes with incomplete state information , 1965 .

[10] J. Wessels. Decision rules in Markovian decision processes with incompletely known transition probabilities , 1968 .

[11] J. K. Satia,et al. Markovian Decision Processes with Uncertain Transition Probabilities , 1973, Oper. Res..