Near Optimal Bayesian Active Learning for Decision Making

How should we gather information to make effective decisions? We address Bayesian active learning and experimental design problems, where we sequentially select tests to reduce uncertaintyaboutasetofhypotheses. Instead ofminimizinguncertaintyperse,weconsidera set of overlappingdecision regions of these hypotheses. Our goal is to drive uncertainty into a single decision region as quickly as possible. We identify necessary and sucient conditionsforcorrectlyidentifyingadecisionregion that contains all hypotheses consistent with observations. We develop a novel Hyperedge Cutting (HEC) algorithm for this problem, and prove that is competitive with the intractable optimal policy. Our ecient implementation of the algorithm relies on computingsubsetsofthecompletehomogeneoussymmetric polynomials. Finally, we demonstrate its eectiveness on two practical applications: approximate comparison-based learning and activelocalizationusingarobotmanipulator.

[1]  Jeff A. Bilmes,et al.  Average-Case Active Learning with Costs , 2009, ALT.

[2]  K. Chaloner,et al.  Bayesian Experimental Design: A Review , 1995 .

[3]  Leslie Pack Kaelbling,et al.  Robust Belief-Based Execution of Manipulation Programs , 2008 .

[4]  Raymond Séroul Programming for Mathematicians , 2000 .

[5]  D. Lindley On a Measure of the Information Provided by an Experiment , 1956 .

[6]  I. G. MacDonald,et al.  Symmetric functions and Hall polynomials , 1979 .

[7]  Mukesh K. Mohania,et al.  Decision trees for entity identification: approximation algorithms and hardness results , 2007, TALG.

[8]  Yury Lifshits,et al.  Disorder inequality: a combinatorial approach to nearest neighbor search , 2008, WSDM '08.

[9]  Peter M. Will,et al.  An Experimental System for Computer Controlled Mechanical Assembly , 1975, IEEE Transactions on Computers.

[10]  Nicholas Roy,et al.  Efficient Optimization of Information-Theoretic Exploration in SLAM , 2008, AAAI.

[11]  Robert D. Nowak,et al.  Noisy Generalized Binary Search , 2009, NIPS.

[12]  Teresa M. Przytycka,et al.  On an Optimal Split Tree Problem , 1999, WADS.

[13]  Andreas Krause,et al.  Adaptive Submodularity: Theory and Applications in Active Learning and Stochastic Optimization , 2010, J. Artif. Intell. Res..

[14]  Wolfram Burgard,et al.  Active Markov localization for mobile robots , 1998, Robotics Auton. Syst..

[15]  John Langford,et al.  Agnostic active learning , 2006, J. Comput. Syst. Sci..

[16]  Igor Kononenko,et al.  Machine learning for medical diagnosis: history, state of the art and perspective , 2001, Artif. Intell. Medicine.

[17]  Amin Karbasi,et al.  Comparison-Based Learning with Rank Nets , 2012, ICML.

[18]  Ronald A. Howard,et al.  Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..

[19]  Sanjoy Dasgupta,et al.  Analysis of a greedy active learning strategy , 2004, NIPS.

[20]  Andreas Krause,et al.  Near-Optimal Bayesian Active Learning with Noisy Observations , 2010, NIPS.

[21]  Siddhartha S. Srinivasa,et al.  Efficient touch based localization through submodularity , 2012, 2013 IEEE International Conference on Robotics and Automation.