State information lag markov decision process with control limit rule
暂无分享,去创建一个
In the framework of a discrete Markov decision process with state information lag, this article suggests a way for selecting an optimal policy using the control limit rule. The properties sufficient for an optimal decision rule to be contained in the class of control limit rules are also studied. The degradation in expected reward from that of the perfect information process provides a measure of the potential value of improving the information system.
[1] C. Derman. On Sequential Decisions and Markov Chains , 1962 .
[2] W. Rudin. Principles of mathematical analysis , 1964 .
[3] S. S. Chitgopekar. Continuous Time Markovian Sequential Control Processes , 1969 .