Submodularity and its applications in optimized information gathering

Where should we place sensors to efficiently monitor natural drinking water resources for contamination? Which blogs should we read to learn about the biggest stories on the Web? These problems share a fundamental challenge: How can we obtain the most useful information about the state of the world, at minimum cost? Such information gathering, or active learning, problems are typically NP-hard, and were commonly addressed using heuristics without theoretical guarantees about the solution quality. In this article, we describe algorithms which efficiently find provably near-optimal solutions to large, complex information gathering problems. Our algorithms exploit submodularity, an intuitive notion of diminishing returns common to many sensing problems: the more sensors we have already deployed, the less we learn by placing another sensor. In addition to identifying the most informative sensing locations, our algorithms can handle more challenging settings, where sensors need to be able to reliably communicate over lossy links, where mobile robots are used for collecting data, or where solutions need to be robust against adversaries and sensor failures. We also present results applying our algorithms to several real-world sensing tasks, including environmental monitoring using robotic sensors, activity recognition using a built sensing chair, a sensor placement challenge, and deciding which blogs to read on the Web.

[1]  D. Lindley On a Measure of the Information Provided by an Experiment , 1956 .

[2]  Ronald A. Howard,et al.  Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..

[3]  E. J. Sondik,et al.  The Optimal Control of Partially Observable Markov Decision Processes. , 1971 .

[4]  Edward J. Sondik,et al.  The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..

[5]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[6]  Satoru Fujishige,et al.  Polymatroidal Dependence Structure of a Set of Random Variables , 1978, Inf. Control..

[7]  A. McBratney,et al.  DPTIMAL INTERPOLATION AND ISARITHMIC MAPPING OF SOIL PROPERTIES. IV SAMPLING STRATEGY , 1981 .

[8]  László Lovász,et al.  Submodular functions and convexity , 1982, ISMP.

[9]  A. McBratney,et al.  Optimal interpolation and isarithmic mapping of soil properties: V. Co-regionalization and multiple sampling strategy , 1983 .

[10]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[11]  David J. C. MacKay,et al.  Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[12]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[13]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[14]  R. Deyo,et al.  The effectiveness of four interventions for the prevention of low back pain. , 1994, JAMA.

[15]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[16]  David M. Smith Pressure Ulcers in the Nursing Home , 1995, Annals of Internal Medicine.

[17]  K. Chaloner,et al.  Bayesian Experimental Design: A Review , 1995 .

[18]  U. Feige A threshold of ln n for approximating set cover , 1998, JACM.

[19]  Lewis A. Rossman,et al.  The EPANET Programmer's Toolkit for Analysis of Water Distribution Systems , 1999 .

[20]  Alexander Schrijver,et al.  Theory of linear and integer programming , 1986, Wiley-Interscience series in discrete mathematics and optimization.

[21]  Héctor H. González-Baños,et al.  A randomized art-gallery algorithm for sensor placement , 2001, SCG '01.

[22]  Alex Pentland,et al.  A sensing chair using pressure distribution sensors , 2001 .

[23]  Feng Zhao,et al.  Information-driven dynamic sensor collaboration , 2002, IEEE Signal Process. Mag..

[24]  Feng Zhao,et al.  Information-Driven Dynamic Sensor Collaboration for Tracking Applications , 2002 .

[25]  M. de Looze,et al.  Please Scroll down for Article Ergonomics Sitting Comfort and Discomfort and the Relationships with Objective Measures Sitting Comfort and Discomfort and the Relationships with Objective Measures , 2022 .

[26]  Hong Z. Tan,et al.  Template-based Recognition of Static Sitting Postures , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[27]  Nicholas Roy,et al.  Global A-Optimal Robot Exploration in SLAM , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[28]  Andreas Krause,et al.  Near-optimal Nonmyopic Value of Information in Graphical Models , 2005, UAI.

[29]  Ronald E. Parr,et al.  Non-Myopic Multi-Aspect Sensing with Partially Observable Markov Decision Processes , 2005 .

[30]  Robert D. Nowak,et al.  Faster Rates in Regression via Active Learning , 2005, NIPS.

[31]  D. Wiens Robustness in spatial studies II: minimax design , 2005 .

[32]  Wolfram Burgard,et al.  Information Gain-based Exploration Using Rao-Blackwellized Particle Filters , 2005, Robotics: Science and Systems.

[33]  Himanshu Gupta,et al.  Connected sensor cover: self-organization of sensor networks for efficient query execution , 2003, IEEE/ACM Transactions on Networking.

[34]  C. Guestrin,et al.  Near-optimal sensor placements: maximizing information while minimizing communication cost , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[35]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[36]  Lawrence Carin,et al.  Nonmyopic Multiaspect Sensing With Partially Observable Markov Decision Processes , 2007, IEEE Transactions on Signal Processing.

[37]  Andreas Krause,et al.  Robust, low-cost, non-intrusive sensing and recognition of seated postures , 2007, UIST.

[38]  Andreas Krause,et al.  Nonmyopic active learning of Gaussian processes: an exploration-exploitation approach , 2007, ICML '07.

[39]  H. Robbins Some aspects of the sequential design of experiments , 1952 .

[40]  Abhimanyu Das,et al.  Algorithms for subset selection in linear regression , 2008, STOC.

[41]  Jonathan W. Berry,et al.  A FACILITY LOCATION APPROACH TO SENSOR PLACEMENT OPTIMIZATION , 2008 .

[42]  Maxim A. Batalin,et al.  NIMS-AQ: A novel system for autonomous sensing of aquatic environments , 2008, 2008 IEEE International Conference on Robotics and Automation.

[43]  H. B. McMahan,et al.  Robust Submodular Observation Selection , 2008 .

[44]  Avi Ostfeld,et al.  The Battle of the Water Sensor Networks (BWSN): A Design Challenge for Engineers and Algorithms , 2008 .

[45]  Andreas Krause,et al.  Efficient Sensor Placement Optimization for Securing Large Water Distribution Networks , 2008 .

[46]  Andreas Krause,et al.  Optimizing Sensing: From Water to the Web , 2009, Computer.

[47]  Andreas Krause,et al.  Simultaneous placement and scheduling of sensors , 2009, 2009 International Conference on Information Processing in Sensor Networks.

[48]  Andreas Krause,et al.  Nonmyopic Adaptive Informative Path Planning for Multiple Robots , 2009, IJCAI.

[49]  Andreas Krause,et al.  Efficient Informative Sensing using Multiple Robots , 2014, J. Artif. Intell. Res..

[50]  Dafna Shahaf,et al.  Turning down the noise in the blogosphere , 2009, KDD.

[51]  Andreas Krause,et al.  Online Learning of Assignments , 2009, NIPS.

[52]  Andreas Krause,et al.  Optimal Value of Information in Graphical Models , 2009, J. Artif. Intell. Res..

[53]  Andreas Krause,et al.  Online distributed sensor selection , 2010, IPSN '10.

[54]  Andreas Krause,et al.  SFO: A Toolbox for Submodular Function Optimization , 2010, J. Mach. Learn. Res..

[55]  Andreas Krause,et al.  Adaptive Submodularity: A New Approach to Active Learning and Stochastic Optimization , 2010, COLT 2010.