Active Learning for Level Set Estimation

Many information gathering problems require determining the set of points, for which an unknown function takes value above or below some given threshold level. We formalize this task as a classification problem with sequential measurements, where the unknown function is modeled as a sample from a Gaussian process (GP). We propose LSE, an algorithm that guides both sampling and classification based on GP-derived confidence bounds, and provide theoretical guarantees about its sample complexity. Furthermore, we extend LSE and its theory to two more natural settings: (1) where the threshold level is implicitly defined as a percentage of the (unknown) maximum of the target function and (2) where samples are selected in batches. We evaluate the effectiveness of our proposed methods on two problems of practical interest, namely autonomous monitoring of algal populations in a lake environment and geolocating network latency.

[1]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[2]  Frédéric Galland,et al.  Synthetic aperture Radar oil spill segmentation by stochastic complexity minimization , 2004, IEEE Geoscience and Remote Sensing Letters.

[3]  Gaurav S. Sukhatme,et al.  Adaptive sampling for environmental robotics , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[4]  Chris Bailey-Kellogg,et al.  Gaussian Processes for Active Data Mining of Spatial Aggregates , 2005, SDM.

[5]  Larry A. Wasserman,et al.  Active Learning For Identifying Function Threshold Boundaries , 2005, NIPS.

[6]  A. Singh,et al.  Active learning for adaptive mobile sensing networks , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[7]  Gaurav S. Sukhatme,et al.  Detecting and Tracking Level Sets of Scalar Fields using a Robotic Sensor Network , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[8]  Krithi Ramamritham,et al.  ACE in the Hole: Adaptive Contour Estimation Using Collaborating Mobile Sensors , 2008, 2008 International Conference on Information Processing in Sensor Networks (ipsn 2008).

[9]  Jeff G. Schneider,et al.  Actively learning level-sets of composite functions , 2008, ICML '08.

[10]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[11]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[12]  Andreas Krause,et al.  Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.

[13]  Alan Fern,et al.  Batch Bayesian Optimization via Simulation Matching , 2010, NIPS.

[14]  Nando de Freitas,et al.  A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.

[15]  Andreas Krause,et al.  Parallelizing Exploration-Exploitation Tradeoffs with Gaussian Process Bandit Optimization , 2012, ICML.

[16]  Roland Siegwart,et al.  Autonomous Inland Water Monitoring: Design and Application of a Surface Vessel , 2012, IEEE Robotics & Automation Magazine.

[17]  Roman Garnett,et al.  Bayesian Optimal Active Search and Surveying , 2012, ICML.

[18]  Shai Shalev-Shwartz,et al.  Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[19]  Andreas Krause,et al.  Active Learning for Multi-Objective Optimization , 2013, ICML.