论文信息 - Harnessing Heterogeneity: Learning from Decomposed Feedback in Bayesian Modeling - 字舞流文

Harnessing Heterogeneity: Learning from Decomposed Feedback in Bayesian Modeling

There is significant interest in learning and optimizing a complex system composed of multiple sub-components, where these components may be agents or autonomous sensors. Among the rich literature on this topic, agentbased and domain-specific simulations can capture complex dynamics and subgroup interaction, but optimizing over such simulations can be computationally and algorithmically challenging. Bayesian approaches, such as Gaussian processes (GPs), can be used to learn a computationally tractable approximation to the underlying dynamics but typically neglect the detailed information about subgroups in the complicated system. We attempt to find the best of both worlds by proposing the idea of decomposed feedback, which captures group-based heterogeneity and dynamics. We introduce a novel decomposed GP regression to incorporate the subgroup decomposed feedback. Our modified regression has provably lower variance – and thus a more accurate posterior – compared to previous approaches; it also allows us to introduce a decomposed GP-UCB optimization algorithm that leverages subgroup feedback. The Bayesian nature of our method makes the optimization algorithm trackable with a theoretical guarantee on convergence and no-regret property. To demonstrate the wide applicability of this work, we execute our algorithm on two disparate social problems: infectious disease control in a heterogeneous population and allocation of distributed weather sensors. Experimental results show that our new method provides significant improvement compared to the state-of-the-art.

Milind Tambe | Bistra Dilkina | Bryan Wilder | Sze-chuan Suen | Kai Wang | Milind Tambe | Kai Wang | B. Dilkina | B. Wilder | S. Suen

[1] Gergely Neu,et al. An Efficient Algorithm for Learning with Semi-bandit Feedback , 2013, ALT.

[2] Xiaofeng Meng,et al. Wind Power Forecasts Using Gaussian Processes and Numerical Weather Prediction , 2014, IEEE Transactions on Power Systems.

[3] Kirthevasan Kandasamy,et al. High Dimensional Bayesian Optimisation and Bandits via Additive Models , 2015, ICML.

[4] Peter Auer,et al. Using Confidence Bounds for Exploitation-Exploration Trade-offs , 2003, J. Mach. Learn. Res..

[5] M. Alcoforado,et al. Perception of temperature and wind by users of public outdoor spaces: relationships with weather parameters and personal characteristics , 2011, International journal of biometeorology.

[6] D. H. Lee. Seventy-five years of searching for a heat index. , 1980, Environmental research.

[7] Peter Vrancx,et al. Efficient Evaluation of Influenza Mitigation Strategies Using Preventive Bandits , 2017, AAMAS Workshops.

[8] Maurice Bluestein,et al. The New Wind Chill Equivalent Temperature Chart. , 2005 .

[9] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[10] Jonas Mockus,et al. On Bayesian Methods for Seeking the Extremum , 1974, Optimization Techniques.

[11] Bo An,et al. Multi-objective optimization for security games , 2012, AAMAS.

[12] Vianney Perchet,et al. Gaussian Process Optimization with Mutual Information , 2013, ICML.

[13] D. Kleinbaum,et al. Applied Regression Analysis and Other Multivariate Methods , 1978 .

[14] Milind Tambe,et al. Preventing Infectious Disease in Dynamic Populations Under Uncertainty , 2018, AAAI.

[15] Nando de Freitas,et al. A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.

[16] Kian Hsiang Low,et al. Decentralized High-Dimensional Bayesian Optimization with Factor Graphs , 2017, AAAI.

[17] Nakul Chitnis,et al. Mathematical models of contact patterns between age groups for predicting the spread of infectious diseases. , 2013, Mathematical biosciences and engineering : MBE.

[18] Andreas Krause,et al. Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.

[19] Tomasz P. Michalak,et al. Repeated Dollar Auctions: A Multi-Armed Bandit Approach , 2016, AAMAS.

[20] Harold J. Kushner,et al. A New Method of Locating the Maximum Point of an Arbitrary Multipeak Curve in the Presence of Noise , 1964 .

[21] Bo An,et al. Agent-mediated multi-step optimization for resource allocation in distributed sensor networks , 2011, AAMAS.

[22] G. Laschewski,et al. The perceived temperature – a versatile index for the assessment of the human thermal environment. Part A: scientific basics , 2011, International Journal of Biometeorology.

[23] Jan Medlock,et al. Optimizing the impact of low-efficacy influenza vaccines , 2018, Proceedings of the National Academy of Sciences.

[24] Philippe Rigo,et al. A review on simulation-based optimization methods applied to building performance analysis , 2014 .

[25] Volkan Cevher,et al. High-Dimensional Bayesian Optimization via Additive Models with Overlapping Groups , 2018, AISTATS.

[26] Bolei Zhou,et al. Optimization as Estimation with Gaussian Processes in Bandit Settings , 2015, AISTATS.

[27] Kian Hsiang Low,et al. Multi-robot active sensing of non-stationary gaussian process-based environmental phenomena , 2014, AAMAS.

[28] M. van Boven,et al. Variation in loss of immunity shapes influenza epidemics and the impact of vaccination , 2017, BMC Infectious Diseases.

[29] Donald R. Jones,et al. Efficient Global Optimization of Expensive Black-Box Functions , 1998, J. Glob. Optim..

[30] Oleg V. Khamisov. Objective Function Decomposition in Global Optimization , 2017, LION.

[31] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[32] W John Edmunds,et al. Estimating the impact of childhood influenza vaccination programmes in England and Wales. , 2008, Vaccine.