Adaptive RiskAware Bidding with Budget Constraint in Display Advertising

Real-time bidding (RTB) has become a major paradigm of display advertising. Each ad impression generated from a user visit is auctioned in real time, where demand-side platform (DSP) automatically provides bid price usually relying on the ad impression value estimation and the optimal bid price determination. However, the current bid strategy overlooks large randomness of the user behaviors (e.g., click) and the cost uncertainty caused by the auction competition. In this work, we explicitly factor in the uncertainty of estimated ad impression values and model the risk preference of a DSP under a specific state and market environment via a sequential decision process. Specifically, we propose a novel adaptive risk-aware bidding algorithm with budget constraint via reinforcement learning, which is the first to simultaneously consider estimation uncertainty and the dynamic risk tendency of a DSP. We theoretically unveil the intrinsic relation between the uncertainty and the risk tendency based on value at risk (VaR). Consequently, we propose two instantiations to model risk tendency, including an expert knowledge-based formulation embracing three essential properties and an adaptive learning method based on self-supervised reinforcement learning. We conduct extensive experiments on public datasets and show that the proposed framework outperforms state-of-the-art methods in practical settings.

[1]  A. Vries Value at Risk , 2019, Derivatives.

[2]  Daochen Zha,et al.  Experience Replay Optimization , 2019, IJCAI.

[3]  Keping Yang,et al.  Deep Session Interest Network for Click-Through Rate Prediction , 2019, IJCAI.

[4]  Priyanka Bhatt,et al.  Robust Factorization Machines for User Response Prediction , 2018, WWW.

[5]  Hao Wang,et al.  Budget Constrained Bidding by Model-free Reinforcement Learning in Display Advertising , 2018, CIKM.

[6]  Tat-Seng Chua,et al.  Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks , 2017, IJCAI.

[7]  Guorui Zhou,et al.  Deep Interest Network for Click-Through Rate Prediction , 2017, KDD.

[8]  Yunming Ye,et al.  DeepFM: A Factorization-Machine based Neural Network for CTR Prediction , 2017, IJCAI.

[9]  Jun Wang,et al.  Managing Risk of Bidding in Display Advertising , 2017, WSDM.

[10]  Jun Wang,et al.  Real-Time Bidding by Reinforcement Learning in Display Advertising , 2017, WSDM.

[11]  Dong Yu,et al.  Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features , 2016, KDD.

[12]  Jun Wang,et al.  Deep Learning over Multi-field Categorical Data - - A Case Study on User Response Prediction , 2016, ECIR.

[13]  Ming-Syan Chen,et al.  Predicting Winning Price in Real Time Bidding with Censored Data , 2015, KDD.

[14]  Jun Wang,et al.  Real-Time Bidding: A New Frontier of Computational Advertising Research , 2015, WSDM.

[15]  Weinan Zhang,et al.  Optimal real-time bidding for display advertising , 2014, KDD.

[16]  Olivier Chapelle,et al.  Modeling delayed feedback in display advertising , 2014, KDD.

[17]  David Lo,et al.  Predicting response in mobile advertising with hierarchical importance-aware factorization machine , 2014, WSDM.

[18]  Martin Wattenberg,et al.  Ad click prediction: a view from the trenches , 2013, KDD.

[19]  Jun Wang,et al.  Real-time bidding for online advertising: measurement and analysis , 2013, ADKDD '13.

[20]  Wentong Li,et al.  Estimating conversion rate in display advertising from past erformance data , 2012, KDD.

[21]  Foster J. Provost,et al.  Bid optimizing and inventory scoring in targeted online advertising , 2012, KDD.

[22]  Joaquin Quiñonero Candela,et al.  Web-Scale Bayesian Click-Through rate Prediction for Sponsored Search Advertising in Microsoft's Bing Search Engine , 2010, ICML.

[23]  Sergei Vassilvitskii,et al.  Adaptive bidding for display advertising , 2009, WWW '09.

[24]  Philippe Artzner,et al.  Coherent Measures of Risk , 1999 .

[25]  E. Maasland,et al.  Auction Theory , 2021, Springer Texts in Business and Economics.

[26]  R. Rockafellar,et al.  Optimization of conditional value-at risk , 2000 .