Optimal Value of Information in Dynamic Bayesian Networks

Decision-making based on probabilistic reasoning often involves selecting a subset of expensive observations, that best predict the system state. Krause and Guestrin described two problems of non-myopically selecting observations in graphical models to optimize the value of information (VoI), namely, selection of an optimal subset of observations, and generation of an optimal conditional observation plan. They showed that these problems are intractable in general, but gave polynomial-time dynamic programming algorithms, called VoIDP, for chain graphical models. In this paper, we consider the general setting of Dynamic Bayesian Networks (DBNs), and formulate these problems in terms of finding optimal policies in Markov Decision Processes (MDPs). The time complexities of the resulting algorithms are exponential in general, but polynomial for chain models. Given a chain model, our algorithms compute the same subset, or plan, as VoIDP. Interestingly, despite their generality, our algorithms have significantly better time complexities for chain models compared to VoIDP. We also present an outline of how to use our framework to formulate an approximate, nonmyopic VoI optimization technique, with absolute a posteriori guarantees on approximation, that can handle arbitrary DBNs efficiently.

[1]  Andreas Krause,et al.  Optimal Nonmyopic Value of Information in Graphical Models - Efficient Algorithms and Theoretical Limits , 2005, IJCAI.

[2]  Valentina Bayer-Zubek Learning diagnostic policies from examples by systematic search , 2004, UAI 2004.

[3]  Andreas Krause,et al.  Near-optimal Nonmyopic Value of Information in Graphical Models , 2005, UAI.

[4]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[5]  L. van der Gaag,et al.  Selective evidence gathering for diagnostic belief networks , 1993 .

[6]  Adnan Darwiche,et al.  Algorithms and Applications for the Same-Decision Probability , 2014, J. Artif. Intell. Res..

[7]  Finn Verner Jensen,et al.  Myopic Value of Information in Influence Diagrams , 1997, UAI.

[8]  Peter D. Turney Cost-Sensitive Classification: Empirical Evaluation of a Hybrid Genetic Decision Tree Induction Algorithm , 1994, J. Artif. Intell. Res..

[9]  Ronald A. Howard,et al.  Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..

[10]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[11]  Ronald A. Howard,et al.  Influence Diagrams , 2005, Decis. Anal..

[12]  Andreas Krause,et al.  Intelligent light control using sensor networks , 2005, SenSys '05.

[13]  Andreas Krause,et al.  Optimal Value of Information in Graphical Models , 2009, J. Artif. Intell. Res..

[14]  Vijay S. Mookerjee,et al.  Sequential Decision Models for Expert System Optimization , 1997, IEEE Trans. Knowl. Data Eng..