论文信息 - An Information-Theoretic Framework for Unifying Active Learning Problems

An Information-Theoretic Framework for Unifying Active Learning Problems

This paper presents an information-theoretic framework for unifying active learning problems: level set estimation (LSE), Bayesian optimization (BO), and their generalized variant. We first introduce a novel active learning criterion that subsumes an existing LSE algorithm and achieves state-of-theart performance in LSE problems with a continuous input domain. Then, by exploiting the relationship between LSE and BO, we design a competitive information-theoretic acquisition function for BO that has interesting connections to upper confidence bound and max-value entropy search (MES). The latter connection reveals a drawback of MES which has important implications on not only MES but also on other MES-based acquisition functions. Finally, our unifying information-theoretic framework can be applied to solve a generalized problem of LSE and BO involving multiple level sets in a data-efficient manner. We empirically evaluate the performance of our proposed algorithms using synthetic benchmark functions, a real-world dataset, and in hyperparameter tuning of machine learning models.

Patrick Jaillet | Quoc Phong Nguyen | Bryan Kian Hsiang Low

[1] Nando de Freitas,et al. A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.

[2] Andreas Krause,et al. Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.

[3] Kian Hsiang Low,et al. Decentralized active robotic exploration and mapping for probabilistic field classification in environmental sensing , 2012, AAMAS.

[4] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[5] Larry A. Wasserman,et al. Active Learning For Identifying Function Threshold Boundaries , 2005, NIPS.

[6] Kian Hsiang Low,et al. Bayesian Optimization with Binary Auxiliary Information , 2019, UAI.

[7] Andreas Krause,et al. Contextual Gaussian Process Bandit Optimization , 2011, NIPS.

[8] Matthew W. Hoffman,et al. Output-Space Predictive Entropy Search for Flexible Global Optimization , 2016 .

[9] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[10] Kian Hsiang Low,et al. Decentralized High-Dimensional Bayesian Optimization with Factor Graphs , 2017, AAAI.

[11] Kian Hsiang Low,et al. Distributed Batch Gaussian Process Optimization , 2017, ICML.