论文信息 - An Efficient Bandit Algorithm for Realtime Multivariate Optimization - 字舞流文

An Efficient Bandit Algorithm for Realtime Multivariate Optimization

Optimization is commonly employed to determine the content of web pages, such as to maximize conversions on landing pages or click-through rates on search engine result pages. Often the layout of these pages can be decoupled into several separate decisions. For example, the composition of a landing page may involve deciding which image to show, which wording to use, what color background to display, etc. Such optimization is a combinatorial problem over an exponentially large decision space. Randomized experiments do not scale well to this setting, and therefore, in practice, one is typically limited to optimizing a single aspect of a web page at a time. This represents a missed opportunity in both the speed of experimentation and the exploitation of possible interactions between layout decisions Here we focus on multivariate optimization of interactive web pages. We formulate an approach where the possible interactions between different components of the page are modeled explicitly. We apply bandit methodology to explore the layout space efficiently and use hill-climbing to select optimal content in realtime. Our algorithm also extends to contextualization and personalization of layout selection. Simulation results show the suitability of our approach to large decision spaces with strong interactions between content. We further apply our algorithm to optimize a message that promotes adoption of an Amazon service. After only a single week of online optimization, we saw a 21% conversion increase compared to the median layout. Our technique is currently being deployed to optimize content across several locations at Amazon.com.

Yi Liu | S. V. N. Vishwanathan | Houssam Nassif | Anand Iyer | Daniel N. Hill | Daniel N. Hill | S. Vishwanathan | Houssam Nassif | Yi Liu | Anand Iyer

[1] G. Casella,et al. Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[2] Cid C. de Souza,et al. The edge-weighted clique problem: Valid inequalities, facets and polyhedral computations , 2000, Eur. J. Oper. Res..

[3] Karen A. F. Copeland. Experiments: Planning, Analysis, and Parameter Design Optimization , 2002 .

[4] Joseph G. Pigeon,et al. Statistics for Experimenters: Design, Innovation and Discovery , 2006, Technometrics.

[5] V. Roshan Joseph,et al. A Review of: “Experiments: Planning, Analysis, and Parameter Design Optimization” , 2006 .

[6] S. Chow,et al. Sample Size Calculations In Clinical Research , 2007 .

[7] Thomas P. Hayes,et al. Stochastic Linear Optimization under Bandit Feedback , 2008, COLT.

[8] Nicolò Cesa-Bianchi,et al. Combinatorial Bandits , 2012, COLT.

[9] Joaquin Quiñonero Candela,et al. Web-Scale Bayesian Click-Through rate Prediction for Sponsored Search Advertising in Microsoft's Bing Search Engine , 2010, ICML.

[10] Steffen Rendle,et al. Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[11] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.

[12] Ravi Kumar,et al. Optimizing two-dimensional search results presentation , 2011, WSDM '11.

[13] Yisong Yue,et al. Linear Submodular Bandits and their Application to Diversified Retrieval , 2011, NIPS.

[14] Lihong Li,et al. An Empirical Evaluation of Thompson Sampling , 2011, NIPS.

[15] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..

[16] Shipra Agrawal,et al. Analysis of Thompson Sampling for the Multi-armed Bandit Problem , 2011, COLT.

[17] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18] Liang Tang,et al. Automatic ad format selection via contextual bandits , 2013, CIKM.

[19] Shipra Agrawal,et al. Thompson Sampling for Contextual Bandits with Linear Payoffs , 2012, ICML.

[20] Yue Wang,et al. Beyond Ranking: Optimizing Whole-Page Presentation , 2016, WSDM.

[21] Choon Hui Teo,et al. Adaptive, Personalized Diversity for Visual Discovery , 2016, RecSys.

[22] S. V. N. Vishwanathan,et al. Diversifying Music Recommendations , 2018, ArXiv.